The platform for open and practice-oriented research

product

Testing machine learning applications

In this post I give an overview of the theory, tools, frameworks and best practices I have found until now around the testing (and debugging) of machine learning applications. I will start by giving an overview of the specificities of testing machine learning applications.

LINK

product

Understanding the effect of an educational intervention to optimize HIV testing strategies in primary care in Amsterdam – results of a mixed-methods study

Background: In the Netherlands, general practitioners (GPs) play a key role in provider-initiated HIV testing, but opportunities for timely diagnosis are regularly missed. We implemented an educational intervention to improve HIV testing by GPs from 2015 to 2020, and observed a 7% increase in testing in an evaluation using laboratory data. The objective for the current study was to gain a deeper understanding of whether and how practices and perceptions of GPs’ HIV/sexually transmitted infection (STI) testing behaviour changed following the intervention. Methods: We performed a mixed-methods study using questionnaires and semi-structured interviews to assess self-reported changes in HIV/STI testing by participating GPs. Questionnaires were completed by participants at the end of the final educational sessions from 2017 through 2020, and participating GPs were interviewed from January through March 2020. Questionnaire data were analysed descriptively, and open question responses were categorised thematically. Interview data were analysed following thematic analysis methods. Results: In total, 101/103 participants completed questionnaires. Of 65 participants that were included in analyses on the self-reported effect of the programme, forty-seven (72%) reported it had changed their HIV/STI testing, including improved STI consultations, adherence to the STI consultation guideline, more proactive HIV testing, and more extragenital STI testing. Patients’ risk factors, patients’ requests and costs were most important in selecting STI tests ordered. Eight participants were interviewed and 15 themes on improved testing were identified, including improved HIV risk-assessment, more proactive testing for HIV/STI, more focus on HIV indicator conditions and extragenital STI testing, and tools to address HIV during consultations. However, several persistent barriers for optimal HIV/STI testing by GPs were identified, including HIV-related stigma and low perceived risk. Conclusions: Most GPs reported improved HIV/STI knowledge, attitude and testing, but there was a discrepancy between reported changes in HIV testing and observed increases using laboratory data. Our findings highlight challenges in implementation of effective interventions, and in their evaluation. Lessons learned from this intervention may inform follow-up initiatives to keep GPs actively engaged in HIV testing and care, on our way to zero new HIV infections.

DOCUMENT

Understanding the effect of an educational intervention to optimize HIV testing strategies in primary care in Amsterdam – results of a mixed-methods study

product

Towards a reporting guideline for developmental and reproductive toxicology testing in C. elegans and other nematodes

Implementation of reliable methodologies allowing Reduction, Refinement, and Replacement (3Rs) of animal testing is a process that takes several decades and is still not complete. Reliable methods are essential for regulatory hazard assessment of chemicals where differences in test protocol can influence the test outcomes and thus affect the confidence in the predictive value of the organisms used as an alternative for mammals. Although test guidelines are common for mammalian studies, they are scarce for non-vertebrate organisms that would allow for the 3Rs of animal testing. Here, we present a set of 30 reporting criteria as the basis for such a guideline for Developmental and Reproductive Toxicology (DART) testing in the nematode Caenorhabditis elegans. Small organisms like C. elegans are upcoming in new approach methodologies for hazard assessment; thus, reliable and robust test protocols are urgently needed. A literature assessment of the fulfilment of the reporting criteria demonstrates that although studies describe methodological details, essential information such as compound purity and lot/batch number or type of container is often not reported. The formulated set of reporting criteria for C. elegans testing can be used by (i) researchers to describe essential experimental details (ii) data scientists that aggregate information to assess data quality and include data in aggregated databases (iii) regulators to assess study data for inclusion in regulatory hazard assessment of chemicals.

DOCUMENT

product

The importance of effect sizes

KEY MESSAGE: • Statistical significance testing alone is not the most adequate manner to evaluate if there is indeed a clinically relevant effect • Effect sizes should be added to significance testing • Effect sizes facilitate the decision whether a clinically relevant effect is found, helps determining the sample size for future studies, and facilitates comparison between scientific studies

DOCUMENT

product

The Elements of Deformation Analysis

With summaries in Dutch, Esperanto and English. DOI: 10.4233/uuid:d7132920-346e-47c6-b754-00dc5672b437 "The subject of this study is deformation analysis of the earth's surface (or part of it) and spatial objects on, above or below it. Such analyses are needed in many domains of society. Geodetic deformation analysis uses various types of geodetic measurements to substantiate statements about changes in geometric positions.Professional practice, e.g. in the Netherlands, regularly applies methods for geodetic deformation analysis that have shortcomings, e.g. because the methods apply substandard analysis models or defective testing methods. These shortcomings hamper communication about the results of deformation analyses with the various parties involved. To improve communication solid analysis models and a common language have to be used, which requires standardisation.Operational demands for geodetic deformation analysis are the reason to formulate in this study seven characteristic elements that a solid analysis model needs to possess. Such a model can handle time series of several epochs. It analyses only size and form, not position and orientation of the reference system; and datum points may be under influence of deformation. The geodetic and physical models are combined in one adjustment model. Full use is made of available stochastic information. Statistical testing and computation of minimal detectable deformations is incorporated. Solution methods can handle rank deficient matrices (both model matrix and cofactor matrix). And, finally, a search for the best hypothesis/model is implemented. Because a geodetic deformation analysis model with all seven elements does not exist, this study develops such a model.For effective standardisation geodetic deformation analysis models need: practical key performance indicators; a clear procedure for using the model; and the possibility to graphically visualise the estimated deformations."

DOCUMENT

product

Maximal cardiopulmonary exercise testing in laryngectomised patients using different heat and moisture exchangers - Feasibility and exercise responses

Objective. After laryngectomy, the breathing resistance of heat and moisture exchangers may limit exercise capacity. Breathing gas analysis during cardiopulmonary exercise testing is not possible using regular masks. This study tested the feasibility of cardiopulmonary exercise testing with a heat and moisture exchanger in situ, using an in-house designed connector. Additionally, we explored the effect of different heat and moisture exchanger resistances on exercise capacity in this group. Methods. Ten participants underwent two cardiopulmonary exercise tests using their daily life heat and moisture exchanger (0.3 hPa or 0.6 hPa) and one specifically developed for activity (0.15 hPa). Heat and moisture exchanger order was randomised and blinded.Results. All participants completed both tests. No (serious) adverse events occurred. Only four subjects reached a respiratory exchange ratio of more than 1.1 in at least one test. Maximum exercise levels using heat and moisture exchangers with different resistances did not differ. Conclusion. Cardiopulmonary exercise testing in laryngectomees with a heat and moisture exchanger is feasible; however, the protocol does not seem appropriate to reach this group's maximal exercise capacity. Lowering heat and moisture exchanger resistance does not increase exercise capacity in this sample.

DOCUMENT

Maximal cardiopulmonary exercise testing in laryngectomised patients using different heat and moisture exchangers - Feasibility and exercise responses

product

Improving provider-initiated testing for HIV and other STI in the primary care setting in Amsterdam, the Netherlands

Background In the Netherlands, general practitioners (GPs) play a key role in HIV testing. However, the proportion of people diagnosed with late-stage HIV remains high, and opportunities for earlier diagnosis are being missed. We implemented an educational intervention to improve HIV and STI testing in primary care in Amsterdam, the Netherlands. Methods GPs were invited to participate in an educational program between 2015 and 2020, which included repeat sessions using audit and feedback and quality improvement plans. Data on HIV, chlamydia and gonorrhoea testing by GPs were collected from 2011 through 2020. The primary outcome was HIV testing frequency, which was compared between GPs before and after participation using Poisson regression. Secondary outcomes were chlamydia and gonorrhoea testing frequencies, and positive test proportions. Additional analyses stratified by patient sex and age were done. Findings GPs after participation performed 7% more HIV tests compared to GPs before participation (adjusted relative ratio [aRR] 1.07, 95%CI 1.04–1.09); there was no change in the proportion HIV positive tests (aRR 0.87, 95%CI 0.63–1.19). HIV testing increased most among patients who were female and ≤19 or 50–64 years old. After participation, HIV testing continued to increase (aRR 1.02 per quarter, 95%CI 1.01–1.02). Chlamydia testing by GPs after participation increased by 6% (aRR 1.06, 95%CI 1.05–1.08), while gonorrhoea testing decreased by 2% (aRR 0.98, 95%CI 0.97–0.99). We observed increases specifically in extragenital chlamydia and gonorrhoea testing. Conclusions The intervention was associated with a modest increase in HIV testing among GPs after participation, while the proportion positive HIV tests remained stable. Our results suggest that the intervention yielded a sustained effect.

DOCUMENT

Improving provider-initiated testing for HIV and other STI in the primary care setting in Amsterdam, the Netherlands

product

Usefulness of cardiopulmonary exercise testing to predict the development of arterial hypertension in adult patients with repaired isolated coarctation of the aorta.

BACKGROUND: Patients who underwent surgery for aortic coarctation (COA) have an increased risk of arterial hypertension. We aimed at evaluating (1) differences between hypertensive and non-hypertensive patients and (2) the value of cardiopulmonary exercise testing (CPET) to predict the development or progression of hypertension. METHODS: Between 1999 and 2010, CPET was performed in 223 COA-patients of whom 122 had resting blood pressures of <140/90 mmHg without medication, and 101 were considered hypertensive. Comparative statistics were performed. Cox regression analysis was used to assess the relation between demographic, clinical and exercise variables and the development/progression of hypertension. RESULTS: At baseline, hypertensive patients were older (p=0.007), were more often male (p=0.004) and had repair at later age (p=0.008) when compared to normotensive patients. After 3.6 ± 1.2 years, 29/120 (25%) normotensive patients developed hypertension. In normotensives, VE/VCO2-slope (p=0.0016) and peak systolic blood pressure (SBP; p=0.049) were significantly related to the development of hypertension during follow-up. Cut-off points related to higher risk for hypertension, based on best sensitivity and specificity, were defined as VE/VCO2-slope ≥ 27 and peak SBP ≥ 220 mmHg. In the hypertensive group, antihypertensive medication was started/extended in 48/101 (48%) patients. Only age was associated with the need to start/extend antihypertensive therapy in this group (p=0.042). CONCLUSIONS: Higher VE/VCO2-slope and higher peak SBP are risk factors for the development of hypertension in adults with COA. Cardiopulmonary exercise testing may guide clinical decision making regarding close blood pressure control and preventive lifestyle recommendations.

DOCUMENT

Usefulness of cardiopulmonary exercise testing to predict the development of arterial hypertension in adult patients with repaired isolated coarctation of the aorta.

product

The don't know option in progress testing

Formula scoring (FS) is the use of a don't know option (DKO) with subtraction of points for wrong answers. Its effect on construct validity and reliability of progress test scores, is subject of discussion. Choosing a DKO may not only be affected by knowledge level, but also by risk taking tendency, and may thus introduce construct-irrelevant variance into the knowledge measurement. On the other hand, FS may result in more reliable test scores. To evaluate the impact of FS on construct validity and reliability of progress test scores, a progress test for radiology residents was divided into two tests of 100 parallel items (A and B). Each test had a FS and a number-right (NR) version, A-FS, B-FS, A-NR, and B-NR. Participants (337) were randomly divided into two groups. One group took test A-FS followed by B-NR, and the second group test B-FS followed by A-NR. Evidence for impaired construct validity was sought in a hierarchical regression analysis by investigating how much of the participants' FS-score variance was explained by the DKO-score, compared to the contribution of the knowledge level (NR-score), while controlling for Group, Gender, and Training length. Cronbach's alpha was used to estimate NR and FS-score reliability per year group. NR score was found to explain 27 % of the variance of FS [F(1,332) = 219.2, p < 0.0005], DKO-score, and the interaction of DKO and Gender were found to explain 8 % [F(2,330) = 41.5, p < 0.0005], and the interaction of DKO and NR 1.6 % [F(1,329) = 16.6, p < 0.0005], supporting our hypothesis that FS introduces construct-irrelevant variance into the knowledge measurement. However, NR-scores showed considerably lower reliabilities than FS-scores (mean year-test group Cronbach's alphas were 0.62 and 0.74, respectively). Decisions about FS with progress tests should be a careful trade-off between systematic and random measurement error.

DOCUMENT

Search results

Products 2.592

Testing machine learning applications

Understanding the effect of an educational intervention to optimize HIV testing strategies in primary care in Amsterdam – results of a mixed-methods study

Towards a reporting guideline for developmental and reproductive toxicology testing in C. elegans and other nematodes

The importance of effect sizes

The Elements of Deformation Analysis

Maximal cardiopulmonary exercise testing in laryngectomised patients using different heat and moisture exchangers - Feasibility and exercise responses

Improving provider-initiated testing for HIV and other STI in the primary care setting in Amsterdam, the Netherlands

Usefulness of cardiopulmonary exercise testing to predict the development of arterial hypertension in adult patients with repaired isolated coarctation of the aorta.

The don't know option in progress testing

Projects 1

Automated Mobility traIninG prOgrams (AMIGO)

Navigate to

Categories

Filters

Products 2.592

Testing machine learning applications

Understanding the effect of an educational intervention to optimize HIV testing strategies in primary care in Amsterdam – results of a mixed-methods study

Towards a reporting guideline for developmental and reproductive toxicology testing in C. elegans and other nematodes

The importance of effect sizes

The Elements of Deformation Analysis

Maximal cardiopulmonary exercise testing in laryngectomised patients using different heat and moisture exchangers - Feasibility and exercise responses

Improving provider-initiated testing for HIV and other STI in the primary care setting in Amsterdam, the Netherlands

Usefulness of cardiopulmonary exercise testing to predict the development of arterial hypertension in adult patients with repaired isolated coarctation of the aorta.

The don't know option in progress testing

Projects 1

Automated Mobility traIninG prOgrams (AMIGO)