The platform for open and practice-oriented research

product

A comprehensive approach for mitigating impersonation in online assessment

The security of online assessments is a major concern due to widespread cheating. One common form of cheating is impersonation, where students invite unauthorized persons to take assessments on their behalf. Several techniques exist to handle impersonation. Some researchers recommend use of integrity policy, but communicating the policy effectively to the students is a challenge. Others propose authentication methods like, password and fingerprint; they offer initial authentication but are vulnerable thereafter. Face recognition offers post-login authentication but necessitates additional hardware. Keystroke Dynamics (KD) has been used to provide post-login authentication without any additional hardware, but its use is limited to subjective assessment. In this work, we address impersonation in assessments with Multiple Choice Questions (MCQ). Our approach combines two key strategies: reinforcement of integrity policy for prevention, and keystroke-based random authentication for detection of impersonation. To the best of our knowledge, it is the first attempt to use keystroke dynamics for post-login authentication in the context of MCQ. We improve an online quiz tool for the data collection suited to our needs and use feature engineering to address the challenge of high-dimensional keystroke datasets. Using machine learning classifiers, we identify the best-performing model for authenticating the students. The results indicate that the highest accuracy (83%) is achieved by the Isolation Forest classifier. Furthermore, to validate the results, the approach is applied to Carnegie Mellon University (CMU) benchmark dataset, thereby achieving an improved accuracy of 94%. Though we also used mouse dynamics for authentication, but its subpar performance leads us to not consider it for our approach.

DOCUMENT

A comprehensive approach for mitigating impersonation in online assessment

product

Prediction of Medical Outcomes with Modern Modelling Techniques

Het doel van dit onderzoek is te onderzoeken onder welke omstandigheden en onder welke condities relatief moderne modelleringstechnieken zoals support vector machines, neural networks en random forests voordelen zouden kunnen hebben in medisch-wetenschappelijk onderzoek en in de medische praktijk in vergelijking met meer traditionele modelleringstechnieken, zoals lineaire regressie, logistische regressie en Cox regressie.

MULTIFILE

Prediction of Medical Outcomes with Modern Modelling Techniques

product

An Audit Framework for Technical Assessment of Binary Classifiers

Multilevel models using logistic regression (MLogRM) and random forest models (RFM) are increasingly deployed in industry for the purpose of binary classification. The European Commission’s proposed Artificial Intelligence Act (AIA) necessitates, under certain conditions, that application of such models is fair, transparent, and ethical, which consequently implies technical assessment of these models. This paper proposes and demonstrates an audit framework for technical assessment of RFMs and MLogRMs by focussing on model-, discrimination-, and transparency & explainability-related aspects. To measure these aspects 20 KPIs are proposed, which are paired to a traffic light risk assessment method. An open-source dataset is used to train a RFM and a MLogRM model and these KPIs are computed and compared with the traffic lights. The performance of popular explainability methods such as kernel- and tree-SHAP are assessed. The framework is expected to assist regulatory bodies in performing conformity assessments of binary classifiers and also benefits providers and users deploying such AI-systems to comply with the AIA.

DOCUMENT

An Audit Framework for Technical Assessment of Binary Classifiers

product

Improving Routine Immunization Coverage Through Optimally Designed Predictive Models

Routine immunization (RI) of children is the most effective and timely public health intervention for decreasing child mortality rates around the globe. Pakistan being a low-and-middle-income-country (LMIC) has one of the highest child mortality rates in the world occurring mainly due to vaccine-preventable diseases (VPDs). For improving RI coverage, a critical need is to establish potential RI defaulters at an early stage, so that appropriate interventions can be targeted towards such population who are identified to be at risk of missing on their scheduled vaccine uptakes. In this paper, a machine learning (ML) based predictive model has been proposed to predict defaulting and non-defaulting children on upcoming immunization visits and examine the effect of its underlying contributing factors. The predictive model uses data obtained from Paigham-e-Sehat study having immunization records of 3,113 children. The design of predictive model is based on obtaining optimal results across accuracy, specificity, and sensitivity, to ensure model outcomes remain practically relevant to the problem addressed. Further optimization of predictive model is obtained through selection of significant features and removing data bias. Nine machine learning algorithms were applied for prediction of defaulting children for the next immunization visit. The results showed that the random forest model achieves the optimal accuracy of 81.9% with 83.6% sensitivity and 80.3% specificity. The main determinants of vaccination coverage were found to be vaccine coverage at birth, parental education, and socio-economic conditions of the defaulting group. This information can assist relevant policy makers to take proactive and effective measures for developing evidence based targeted and timely interventions for defaulting children.

MULTIFILE

Improving Routine Immunization Coverage Through Optimally Designed Predictive Models

product

A Comparison of Different Modeling Techniques in Predicting Mortality With the Tilburg Frailty Indicator

Background: Modern modeling techniques may potentially provide more accurate predictions of dichotomous outcomes than classical techniques. Objective: In this study, we aimed to examine the predictive performance of eight modeling techniques to predict mortality by frailty. Methods: We performed a longitudinal study with a 7-year follow-up. The sample consisted of 479 Dutch community-dwelling people, aged 75 years and older. Frailty was assessed with the Tilburg Frailty Indicator (TFI), a self-report questionnaire. This questionnaire consists of eight physical, four psychological, and three social frailty components. The municipality of Roosendaal, a city in the Netherlands, provided the mortality dates. We compared modeling techniques, such as support vector machine (SVM), neural network (NN), random forest, and least absolute shrinkage and selection operator, as well as classical techniques, such as logistic regression, two Bayesian networks, and recursive partitioning (RP). The area under the receiver operating characteristic curve (AUROC) indicated the performance of the models. The models were validated using bootstrapping. Results: We found that the NN model had the best validated performance (AUROC=0.812), followed by the SVM model (AUROC=0.705). The other models had validated AUROC values below 0.700. The RP model had the lowest validated AUROC (0.605). The NN model had the highest optimism (0.156). The predictor variable “difficulty in walking” was important for all models. Conclusions: Because of the high optimism of the NN model, we prefer the SVM model for predicting mortality among community-dwelling older people using the TFI, with the addition of “gender” and “age” variables. External validation is a necessary step before applying the prediction models in a new setting.

DOCUMENT

A Comparison of Different Modeling Techniques in Predicting Mortality With the Tilburg Frailty Indicator

product

An AI-based Digital Twin Case Study in the MRO Sector

In this work, the concept of an Artificial Intelligence-based (AI) Digital Twin (DT) of an aircraft system is introduced, with the goal to improve the corresponding MRO Operations. More specifically, the current study aims to obtaining knowledge on the optimal placement of sensors in an ideal Power Electronics Cooling System (PECS) of a modern airliner, aiming to improve input data as a basis for an AI-based DT. The three main fluid parameters to be measured directly or indirectly at various physical locations at the PECS are mass flow rate, temperature and static pressure. The physics-based model can then be combined with a Machine Learning (ML) model, such as a Random Forest (RF), with a multitude of decision trees. Following, the AI system determines whether the PECS operations is considered normal, aiming to optimize the performance of the system and to maximize the Useful Remaining Life (URL). The suggested AI-DT approach is based both on data-driven and physics-based models, an approach which results in increased reliability and availability, reducing possible Aircraft on Ground (AOG) events. Subsequently, the enhanced prediction capability results in the optimization of the maintenance processes and in reduced operational costs.

DOCUMENT

An AI-based Digital Twin Case Study in the MRO Sector

product

A Clustering Approach for Personalized Coaching Applications

Insufficient physical activity presents a significant hazard to overall health, with sedentary lifestyles linked to a variety of health issues. Monitoring physical activity levels allows the recognition of patterns of sedentary behavior and the provision of coaching to meet the recommended physical activity standards. In this paper, we aim to address the problem of reducing the time consuming process of fitting classifiers when generating personalized models for a coaching application. The proposed approach consists of evaluating the effects of clustering participants based on their walking patterns and then recommending a unique model for each group. Each model consists of a random forest classifier with a different number of estimators each. The resulting approach reduces the fitting time considerably while keeping nearly the same classification performance as personalized models.

DOCUMENT

product

Predictors of district nursing care utilisation for community-living people in the Netherlands: an exploratory study using claims data

Objective To explore predictors of district nursing care utilisation for community-living (older) people in the Netherlands using claims data. To cope with growing demands in district nursing care, knowledge about the current utilisation of district nursing care is important. Setting District nursing care as a part of primary care. Participants In this nationwide study, claims data were used from the Dutch risk adjustment system and national information system of health insurers. Samples were drawn of 5500 pairs of community-living people using district nursing care (cases) and people not using district nursing care (controls) for two groups: all ages and aged 75+ years (total N=22 000). Outcome measures The outcome was district nursing care utilisation and the 114 potential predictors included predisposing factors (eg, age), enabling factors (eg, socioeconomic status) and need factors (various healthcare costs). The random forest algorithm was used to predict district nursing care utilisation. The performance of the models and importance of predictors were calculated. Results For the population of people aged 75+ years, most important predictors were older age, and high costs for general practitioner consultations, aid devices, pharmaceutical care, ambulance transportation and occupational therapy. For the total population, older age, and high costs for pharmaceutical care and aid devices were the most important predictors. Conclusions People in need of district nursing care are older, visit the general practitioner more often, and use more and/or expensive medications and aid devices. Therefore, close collaboration between the district nurse, general practitioner and the community pharmacist is important. Additional analyses including data regarding health status are recommended. Further research is needed to provide an evidence base for district nursing care to optimise the care for those with high care needs, and guide practice and policymakers’ decision-making.

DOCUMENT

Predictors of district nursing care utilisation for community-living people in the Netherlands: an exploratory study using claims data

product

External Validation of Models for Predicting Disability in Community-Dwelling Older People in the Netherlands

Background: Advanced statistical modeling techniques may help predict health outcomes. However, it is not the case that these modeling techniques always outperform traditional techniques such as regression techniques. In this study, external validation was carried out for five modeling strategies for the prediction of the disability of community-dwelling older people in the Netherlands. Methods: We analyzed data from five studies consisting of community-dwelling older people in the Netherlands. For the prediction of the total disability score as measured with the Groningen Activity Restriction Scale (GARS), we used fourteen predictors as measured with the Tilburg Frailty Indicator (TFI). Both the TFI and the GARS are self-report questionnaires. For the modeling, five statistical modeling techniques were evaluated: general linear model (GLM), support vector machine (SVM), neural net (NN), recursive partitioning (RP), and random forest (RF). Each model was developed on one of the five data sets and then applied to each of the four remaining data sets. We assessed the performance of the models with calibration characteristics, the correlation coefficient, and the root of the mean squared error. Results: The models GLM, SVM, RP, and RF showed satisfactory performance characteristics when validated on the validation data sets. All models showed poor performance characteristics for the deviating data set both for development and validation due to the deviating baseline characteristics compared to those of the other data sets. Conclusion: The performance of four models (GLM, SVM, RP, RF) on the development data sets was satisfactory. This was also the case for the validation data sets, except when these models were developed on the deviating data set. The NN models showed a much worse performance on the validation data sets than on the development data sets.

DOCUMENT

Search results

Products 140

A comprehensive approach for mitigating impersonation in online assessment

Prediction of Medical Outcomes with Modern Modelling Techniques

An Audit Framework for Technical Assessment of Binary Classifiers

Improving Routine Immunization Coverage Through Optimally Designed Predictive Models

A Comparison of Different Modeling Techniques in Predicting Mortality With the Tilburg Frailty Indicator

An AI-based Digital Twin Case Study in the MRO Sector

A Clustering Approach for Personalized Coaching Applications

Predictors of district nursing care utilisation for community-living people in the Netherlands: an exploratory study using claims data

External Validation of Models for Predicting Disability in Community-Dwelling Older People in the Netherlands

Navigate to

Categories

Filters

Products 140

A comprehensive approach for mitigating impersonation in online assessment

Prediction of Medical Outcomes with Modern Modelling Techniques

An Audit Framework for Technical Assessment of Binary Classifiers

Improving Routine Immunization Coverage Through Optimally Designed Predictive Models

A Comparison of Different Modeling Techniques in Predicting Mortality With the Tilburg Frailty Indicator

An AI-based Digital Twin Case Study in the MRO Sector

A Clustering Approach for Personalized Coaching Applications

Predictors of district nursing care utilisation for community-living people in the Netherlands: an exploratory study using claims data

External Validation of Models for Predicting Disability in Community-Dwelling Older People in the Netherlands