Recently, the job market for Artificial Intelligence (AI) engineers has exploded. Since the role of AI engineer is relatively new, limited research has been done on the requirements as set by the industry. Moreover, the definition of an AI engineer is less established than for a data scientist or a software engineer. In this study we explore, based on job ads, the requirements from the job market for the position of AI engineer in The Netherlands. We retrieved job ad data between April 2018 and April 2021 from a large job ad database, Jobfeed from TextKernel. The job ads were selected with a process similar to the selection of primary studies in a literature review. We characterize the 367 resulting job ads based on meta-data such as publication date, industry/sector, educational background and job titles. To answer our research questions we have further coded 125 job ads manually. The job tasks of AI engineers are concentrated in five categories: business understanding, data engineering, modeling, software development and operations engineering. Companies ask for AI engineers with different profiles: 1) data science engineer with focus on modeling, 2) AI software engineer with focus on software development , 3) generalist AI engineer with focus on both models and software. Furthermore, we present the tools and technologies mentioned in the selected job ads, and the soft skills. Our research helps to understand the expectations companies have for professionals building AI-enabled systems. Understanding these expectations is crucial both for prospective AI engineers and educational institutions in charge of training those prospective engineers. Our research also helps to better define the profession of AI engineering. We do this by proposing an extended AI engineering life-cycle that includes a business understanding phase.
LINK
Over the past three years we have built a practice-oriented, bachelor level, educational programme for software engineers to specialize as AI engineers. The experience with this programme and the practical assignments our students execute in industry has given us valuable insights on the profession of AI engineer. In this paper we discuss our programme and the lessons learned for industry and research.
MULTIFILE
In my previous post on AI engineering I defined the concepts involved in this new discipline and explained that with the current state of the practice, AI engineers could also be named machine learning (ML) engineers. In this post I would like to 1) define our view on the profession of applied AI engineer and 2) present the toolbox of an AI engineer with tools, methods and techniques to defy the challenges AI engineers typically face. I end this post with a short overview of related work and future directions. Attached to it is an extensive list of references and additional reading material.
LINK
The past two years I have conducted an extensive literature and tool review to answer the question: “What should software engineers learn about building production-ready machine learning systems?”. During my research I noted that because the discipline of building production-ready machine learning systems is so new, it is not so easy to get the terminology straight. People write about it from different perspectives and backgrounds and have not yet found each other to join forces. At the same time the field is moving fast and far from mature. My focus on material that is ready to be used with our bachelor level students (applied software engineers, profession-oriented education), helped me to consolidate everything I have found into a body of knowledge for building production-ready machine learning (ML) systems. In this post I will first define the discipline and introduce the terminology for AI engineering and MLOps.
LINK
The current set of research methods on ictresearchmethods.nl contains only one research method that refers to machine learning: the “Data analytics” method in the “Lab” strategy. This does not reflect the way of working in ML projects, where Data Analytics is not a method to answer one question but the main goal of the project. For ML projects, the Data Analytics method should be divided in several smaller steps, each becoming a method of its own. In other words, we should treat the Data Analytics (or more appropriate ML engineering) process in the same way the software engineering process is treated in the framework. In the remainder of this post I will briefly discuss each of the existing research methods and how they apply to ML projects. The methods are organized by strategy. In the discussion I will give pointers to relevant tools or literature for ML projects.
LINK
This chapter discusses how to build production-ready machine learning systems. There are several challenges involved in accomplishing this, each with its specific solutions regarding practices and tool support. The chapter presents those solutions and introduces MLOps (machine learning operations, also called machine learning engineering) as an overarching and integrated approach in which data engineers, data scientists, software engineers, and operations engineers integrate their activities to implement validated machine learning applications managed from initial idea to daily operation in a production environment. This approach combines agile software engineering processes with the machine learning-specific workflow. Following the principles of MLOps is paramount in building high-quality production-ready machine learning systems. The current state of MLOps is discussed in terms of best practices and tool support. The chapter ends by describing future developments that are bound to improve and extend the tool support for implementing an MLOps approach.
LINK
Background: Digital health is well-positioned in low and middle-income countries (LMICs) to revolutionize health care due, in part, to increasing mobile phone access and internet connectivity. This paper evaluates the underlying factors that can potentially facilitate or hinder the progress of digital health in Pakistan. Objective: The objective of this study is to identify the current digital health projects and studies being carried out in Pakistan, as well as the key stakeholders involved in these initiatives. We aim to follow a mixed-methods strategy and to evaluate these projects and studies through a strengths, weaknesses, opportunities, and threats (SWOT) analysis to identify the internal and external factors that can potentially facilitate or hinder the progress of digital health in Pakistan. Methods: This study aims to evaluate digital health projects carried out in the last 5 years in Pakistan with mixed methods. The qualitative and quantitative data obtained from field surveys were categorized according to the World Health Organization’s (WHO) recommended building blocks for health systems research, and the data were analyzed using a SWOT analysis strategy. Results: Of the digital health projects carried out in the last 5 years in Pakistan, 51 are studied. Of these projects, 46% (23/51) used technology for conducting research, 30% (15/51) used technology for implementation, and 12% (6/51) used technology for app development. The health domains targeted were general health (23/51, 46%), immunization (13/51, 26%), and diagnostics (5/51, 10%). Smartphones and devices were used in 55% (28/51) of the interventions, and 59% (30/51) of projects included plans for scaling up. Artificial intelligence (AI) or machine learning (ML) was used in 31% (16/51) of projects, and 74% (38/51) of interventions were being evaluated. The barriers faced by developers during the implementation phase included the populations’ inability to use the technology or mobile phones in 21% (11/51) of projects, costs in 16% (8/51) of projects, and privacy concerns in 12% (6/51) of projects.
DOCUMENT
The prevention and diagnosis of frailty syndrome (FS) in cardiac patients requires innovative systems to support medical personnel, patient adherence, and self-care behavior. To do so, modern medicine uses a supervised machine learning approach (ML) to study the psychosocial domains of frailty in cardiac patients with heart failure (HF). This study aimed to determine the absolute and relative diagnostic importance of the individual components of the Tilburg Frailty Indicator (TFI) questionnaire in patients with HF. An exploratory analysis was performed using machine learning algorithms and the permutation method to determine the absolute importance of frailty components in HF. Based on the TFI data, which contain physical and psychosocial components, machine learning models were built based on three algorithms: a decision tree, a random decision forest, and the AdaBoost Models classifier. The absolute weights were used to make pairwise comparisons between the variables and obtain relative diagnostic importance. The analysis of HF patients’ responses showed that the psychological variable TFI20 diagnosing low mood was more diagnostically important than the variables from the physical domain: lack of strength in the hands and physical fatigue. The psychological variable TFI21 linked with agitation and irritability was diagnostically more important than all three physical variables considered: walking difficulties, lack of hand strength, and physical fatigue. In the case of the two remaining variables from the psychological domain (TFI19, TFI22), and for all variables from the social domain, the results do not allow for the rejection of the null hypothesis. From a long-term perspective, the ML based frailty approach can support healthcare professionals, including psychologists and social workers, in drawing their attention to the nonphysical origins of HF.
DOCUMENT
Routine immunization (RI) of children is the most effective and timely public health intervention for decreasing child mortality rates around the globe. Pakistan being a low-and-middle-income-country (LMIC) has one of the highest child mortality rates in the world occurring mainly due to vaccine-preventable diseases (VPDs). For improving RI coverage, a critical need is to establish potential RI defaulters at an early stage, so that appropriate interventions can be targeted towards such population who are identified to be at risk of missing on their scheduled vaccine uptakes. In this paper, a machine learning (ML) based predictive model has been proposed to predict defaulting and non-defaulting children on upcoming immunization visits and examine the effect of its underlying contributing factors. The predictive model uses data obtained from Paigham-e-Sehat study having immunization records of 3,113 children. The design of predictive model is based on obtaining optimal results across accuracy, specificity, and sensitivity, to ensure model outcomes remain practically relevant to the problem addressed. Further optimization of predictive model is obtained through selection of significant features and removing data bias. Nine machine learning algorithms were applied for prediction of defaulting children for the next immunization visit. The results showed that the random forest model achieves the optimal accuracy of 81.9% with 83.6% sensitivity and 80.3% specificity. The main determinants of vaccination coverage were found to be vaccine coverage at birth, parental education, and socio-economic conditions of the defaulting group. This information can assist relevant policy makers to take proactive and effective measures for developing evidence based targeted and timely interventions for defaulting children.
MULTIFILE
Buildings need to be carefully operated and maintained for optimum health, comfort, energy performance, and utility costs. The increasing use of Machine Learning combined with Big Data in the building services sector has shown the potential to bring energy efficiency and cost-effectiveness. Therefore, upskilling and reskilling the current workforce is required to realize new possibilities. In addition, sharing and preserving knowledge are also required for the sustainable growth of professionals and companies. This formed the basis for the Dutch Research Council funded TransAct project. To increase access to education on the job, online learning is experiencing phenomenal growth. A study was conducted with two focus groups - professionals of a building service company and university researchers - to understand the existing challenges and the ways to improve knowledge sharing and upskilling through learning on the job. This study introduced an Enterprise Social Network platform that connects members and may facilitate knowledge sharing. As a community forum, Yammer from office 365 was used. For hosting project files, a SharePoint page was created. For online courses, the company’s online learning site was utilized. The log data from the online tools were analysed, semi-structured interviews and webinars were conducted and feedback was collected with google forms. Incentive models like social recognition and innovative project results were used to motivate the professionals for online activities. This paper distinguishes the impacts of initiatives on the behaviour of university researchers vs company employees.
DOCUMENT