The development of the World Wide Web, the emergence of social media and Big Data have led to a rising amount of data. Infor¬mation and Communication Technol¬ogies (ICTs) affect the environment in various ways. Their energy consumption is growing exponentially, with and without the use of ‘green’ energy. Increasing envi¬ronmental aware¬ness has led to discussions on sustainable development. The data deluge makes it not only necessary to pay attention to the hard‑ and software di¬mensions of ICTs but also to the ‘value’ of the data stored. In this paper, we study the possibility to methodically reduce the amount of stored data and records in organizations based on the ‘value’ of informa¬tion, using the Green Archiving Model we have developed. Reducing the amount of data and records in organizations helps in allowing organizations to fight the data deluge and to realize the objectives of both Digital Archiving and Green IT. At the same time, methodi¬cally deleting data and records should reduce the con¬sumption of electricity for data storage. As a consequencs, the organizational cost for electricity use should be reduced. Our research showed that the model can be used to reduce [1] the amount of data (45 percent, using Archival Retention Levels and Retention Schedules) and [2] the electricity con¬sumption for data storage (resulting in a cost reduction of 35 percent). Our research indicates that the Green Ar¬chiving Model is a viable model to reduce the amount of stored data and records and to curb electricity use for storage in organi¬zations. This paper is the result of the first stage of a research project that is aimed at devel¬oping low power ICTs that will automa¬tically appraise, select, preserve or permanently delete data based on their ‘value’. Such an ICT will automatically reduce storage capacity and reduce electricity con¬sumption used for data storage. At the same time, data dispos¬al will reduce overload caused by storing the sa¬me data in different for¬mats, it will lower costs and it reduces the po¬tential for liability.
Abstract Despite the numerous business benefits of data science, the number of data science models in production is limited. Data science model deployment presents many challenges and many organisations have little model deployment knowledge. This research studied five model deployments in a Dutch government organisation. The study revealed that as a result of model deployment a data science subprocess is added into the target business process, the model itself can be adapted, model maintenance is incorporated in the model development process and a feedback loop is established between the target business process and the model development process. These model deployment effects and the related deployment challenges are different in strategic and operational target business processes. Based on these findings, guidelines are formulated which can form a basis for future principles how to successfully deploy data science models. Organisations can use these guidelines as suggestions to solve their own model deployment challenges.
Analyzing historical decision-related data can help support actual operational decision-making processes. Decision mining can be employed for such analysis. This paper proposes the Decision Discovery Framework (DDF) designed to develop, adapt, or select a decision discovery algorithm by outlining specific guidelines for input data usage, classifier handling, and decision model representation. This framework incorporates the use of Decision Model and Notation (DMN) for enhanced comprehensibility and normalization to simplify decision tables. The framework’s efficacy was tested by adapting the C4.5 algorithm to the DM45 algorithm. The proposed adaptations include (1) the utilization of a decision log, (2) ensure an unpruned decision tree, (3) the generation DMN, and (4) normalize decision table. Future research can focus on supporting on practitioners in modeling decisions, ensuring their decision-making is compliant, and suggesting improvements to the modeled decisions. Another future research direction is to explore the ability to process unstructured data as input for the discovery of decisions.
MULTIFILE