Learning analytics is the analysis of student data with the purpose of improving learning. However, the process of data cleaning remains underexposed within learning analytics literature. In this paper, we elaborate on choices made in the cleaning process of student data and their consequences. We illustrate this with a case where data was gathered during six courses taught via Moodle. In this data set, only 21% of the logged activities were linked to a specific course. We illustrate possible choices in dealing with missing data by applying the cleaning process twelve times with different choices on copies of the raw data. Consequently, the analysis of the data shows varying outcomes. As the purpose of learning analytics is to intervene based on analysis and visualizations, it is of utmost importance to be aware of choices made during data cleaning. This paper's main goal is to make stakeholders of (learning) analytics activities aware of the fact that choices are made during data cleaning have consequences on the outcomes. We believe that there should be transparency to the users of these outcomes and give them a detailed report of the decisions made.
DOCUMENT
Citizens regularly search the Web to make informed decisions on daily life questions, like online purchases, but how they reason with the results is unknown. This reasoning involves engaging with data in ways that require statistical literacy, which is crucial for navigating contemporary data. However, many adults struggle to critically evaluate and interpret such data and make data-informed decisions. Existing literature provides limited insight into how citizens engage with web-sourced information. We investigated: How do adults reason statistically with web-search results to answer daily life questions? In this case study, we observed and interviewed three vocationally educated adults searching for products or mortgages. Unlike data producers, consumers handle pre-existing, often ambiguous data with unclear populations and no single dataset. Participants encountered unstructured (web links) and structured data (prices). We analysed their reasoning and the process of preparing data, which is part of data-ing. Key data-ing actions included judging relevance and trustworthiness of the data and using proxy variables when relevant data were missing (e.g., price for product quality). Participants’ statistical reasoning was mainly informal. For example, they reasoned about association but did not calculate a measure of it, nor assess underlying distributions. This study theoretically contributes to understanding data-ing and why contemporary data may necessitate updating the investigative cycle. As current education focuses mainly on producers’ tasks, we advocate including consumers’ tasks by using authentic contexts (e.g., music, environment, deferred payment) to promote data exploration, informal statistical reasoning, and critical web-search skills—including selecting and filtering information, identifying bias, and evaluating sources.
LINK
During the past two decades the implementation and adoption of information technology has rapidly increased. As a consequence the way businesses operate has changed dramatically. For example, the amount of data has grown exponentially. Companies are looking for ways to use this data to add value to their business. This has implications for the manner in which (financial) governance needs to be organized. The main purpose of this study is to obtain insight in the changing role of controllers in order to add value to the business by means of data analytics. To answer the research question a literature study was performed to establish a theoretical foundation concerning data analytics and its potential use. Second, nineteen interviews were conducted with controllers, data scientists and academics in the financial domain. Thirdly, a focus group with experts was organized in which additional data were gathered. Based on the literature study and the participants responses it is clear that the challenge of the data explosion consist of converting data into information, knowledge and meaningful insights to support decision-making processes. Performing data analyses enables the controller to support rational decision making to complement the intuitive decision making by (senior) management. In this way, the controller has the opportunity to be in the lead of the information provision within an organization. However, controllers need to have more advanced data science and statistic competences to be able to provide management with effective analysis. Specifically, we found that an important skill regarding statistics is the visualization and communication of statistical analysis. This is needed for controllers in order to grow in their role as business partner..
DOCUMENT
The focus of the research is 'Automated Analysis of Human Performance Data'. The three interconnected main components are (i)Human Performance (ii) Monitoring Human Performance and (iii) Automated Data Analysis . Human Performance is both the process and result of the person interacting with context to engage in tasks, whereas the performance range is determined by the interaction between the person and the context. Cheap and reliable wearable sensors allow for gathering large amounts of data, which is very useful for understanding, and possibly predicting, the performance of the user. Given the amount of data generated by such sensors, manual analysis becomes infeasible; tools should be devised for performing automated analysis looking for patterns, features, and anomalies. Such tools can help transform wearable sensors into reliable high resolution devices and help experts analyse wearable sensor data in the context of human performance, and use it for diagnosis and intervention purposes. Shyr and Spisic describe Automated Data Analysis as follows: Automated data analysis provides a systematic process of inspecting, cleaning, transforming, and modelling data with the goal of discovering useful information, suggesting conclusions and supporting decision making for further analysis. Their philosophy is to do the tedious part of the work automatically, and allow experts to focus on performing their research and applying their domain knowledge. However, automated data analysis means that the system has to teach itself to interpret interim results and do iterations. Knuth stated: Science is knowledge which we understand so well that we can teach it to a computer; and if we don't fully understand something, it is an art to deal with it.[Knuth, 1974]. The knowledge on Human Performance and its Monitoring is to be 'taught' to the system. To be able to construct automated analysis systems, an overview of the essential processes and components of these systems is needed.Knuth Since the notion of an algorithm or a computer program provides us with an extremely useful test for the depth of our knowledge about any given subject, the process of going from an art to a science means that we learn how to automate something.
Het analyseren van grote gegevensbestanden om de kwaliteit van het onderwijs te verbeteren is een hot item. De toepassing van learning analytics kan het onderwijs verbeteren. Wij doen onderzoek naar learning analytics en de vaardigheden die gebruikers daarbij nodig hebben.Doel Wij onderzoeken wat de gevolgen zijn van databewerking op de uitkomsten van learning analytics. En welke vaardigheden hebben gebruikers nodig om deze systemen zinvol te gebruiken? Learning analytics Learning analytics is het meten, verzamelen, analyseren en rapporteren van data van studenten en hun omgeving om het leren en de leeromgeving te begrijpen en te verbeteren. Het gebruik van learning analyticssystemen Het realiseren van grote delen van de onderwijsvisie van Hogeschool Utrecht is sterk verbonden met de succesvolle uitvoering van analyses op studentniveau. Het gebruik van learning analyticssystemen is niet vanzelfsprekend. De ontwerpers en ontwikkelaars van deze systemen moeten helder zijn over hun ontwerpkeuzes (zoals manieren van databewerking en de werking van algoritmes). Anderzijds moeten studenten en docenten beschikken over datavaardigheden om deze systemen op een zinvolle manier te gebruiken. Resultaten Dit onderzoek loopt. Na afloop vind je hier een samenvatting van de resultaten. In juli 2019 verscheen het volgende artikel van de onderzoekers: Automated Feedback for Workplace Learning in Higher Education. Looptijd 01 september 2017 - 31 december 2020 Aanpak We hebben eerst verkennend onderzoek gedaan door een case study waarin onderzocht is wat de effecten zijn van verschillende keuzes in de data cleaning op de uitkomsten van de data-analyse. Vanaf september 2019 gaan we onderzoeken welke datavaardigheden studenten nodig hebben om learning analytics-systemen effectief te gebruiken.
De analyse van data over het leren van studenten kan waardevol zijn. 'Learning analytics' gebruikt studentdata om het leerproces te verbeteren. Welke organisatorische vaardigheden hebben Nederlandse instellingen voor hoger onderwijs nodig om learning analytics succesvol in te zetten?Doel We onderzoeken welke organisatievaardigheden er nodig zijn om in het hoger onderwijs met 'learning analytics' te werken. Met learning analytics krijgen studenten, docenten en studiebegeleiders inzicht in het leerproces. Dit doen ze door data van studenten te analyseren. In de praktijk blijkt het lastig voor onderwijsinstellingen om hier over de hele breedte van de organisatie mee te gaan werken. We kijken in dit onderzoek welke vaardigheden er nodig zijn binnen een organisatie om 'learning analytics' slim in te zetten. Resultaten Dit onderzoek loopt. Tot nu toe hebben we drie wetenschappelijke artikelen gepubliceerd: A First Step Towards Learning Analytics: Implementing an Experimental Learning Analytics Tool Where is the learning in learning analytics? A systematic literature review to identify measures of affected learning From Dirty Data to Multiple Versions of Truth: How Different Choices in Data Cleaning Lead to Different Learning Analytics Outcomes Looptijd 01 december 2016 - 01 december 2020 Aanpak Het onderzoek bestaat uit literatuuronderzoek, een case study bij Nederlandse onderwijsinstellingen en een validatieproject. Dit leidt tot de ontwikkeling van een Learning Analytics Capability Model (LACM): een model dat beschrijft welke organisatorische vaardigheden nodig zijn om learning analytics in de praktijk toe te passen.