We present our ongoing work on upgrading the Amsterdam Public Library's book database search capabilities. So far, users have had to input the exact book title and/or author name without any typos or misspellings in order to retrieve any results. This is in sharp contrast with the manner in which users typically use the interface: they frequently search for books on a particular topic, input the names of the characters, or even ask fully-fledged questions. The aim of this project is therefore to enable smart search in natural language based on book content. The initial focus is on the Dutch language, with the possibility of including English and other languages later. In the first phase of the project, we built a proof-of-concept knowledge graph from a sample of the existing tabular database and enriched the data with named entities extracted from book summaries. Based on this first step, a user query like "Heeft u boeken over de Tweede Wereldoorlog in Amsterdam?" would yield all books that mention both WW2 and Amsterdam. We are currently working on augmenting the knowledge graph with embeddings, which will enable us to retrieve semantically similar results. The final step of the research involves integrating our knowledge graph with a pre-trained large language model.
LINK
Analyzing historical decision-related data can help support actual operational decision-making processes. Decision mining can be employed for such analysis. This paper proposes the Decision Discovery Framework (DDF) designed to develop, adapt, or select a decision discovery algorithm by outlining specific guidelines for input data usage, classifier handling, and decision model representation. This framework incorporates the use of Decision Model and Notation (DMN) for enhanced comprehensibility and normalization to simplify decision tables. The framework’s efficacy was tested by adapting the C4.5 algorithm to the DM45 algorithm. The proposed adaptations include (1) the utilization of a decision log, (2) ensure an unpruned decision tree, (3) the generation DMN, and (4) normalize decision table. Future research can focus on supporting on practitioners in modeling decisions, ensuring their decision-making is compliant, and suggesting improvements to the modeled decisions. Another future research direction is to explore the ability to process unstructured data as input for the discovery of decisions.
MULTIFILE