E-discovery projects typically start with an assessment of the collected electronic data in order to estimate the risk to prosecute or defend a legal case. This is not a review task but is appropriately called early case assessment, which is better known as exploratory search in the information retrieval community. This paper first describes text mining methodologies that can be used for enhancing exploratory search. Based on these ideas we present a semantic search dashboard that includes entities that are relevant to investigators such as who knew who, what, where and when. We describe how this dashboard can be powered by results from our ongoing research in the “Semantic Search for E-Discovery” project on topic detection and clustering, semantic enrichment of user profiles, email recipient recommendation, expert finding and identity extraction from digital forensic evidence.
MULTIFILE
Corporate reputation is becoming increasingly important for firms; social media platforms such as Twitter are used to convey their message. In this paper, corporate reputation will be assessed from a sustainability perspective. Using sentiment analysis, the top 100 brands of the Netherlands were scraped and analyzed. The companies were registered in the sustainable industry classification system (SICS) to perform the analysis on an industry level. A semantic search tool called Open Semantic Desktop Search was used to filter through the data to find keywords related to sustainability and corporate reputation. Findings show that companies that tweet more often about corporate reputation and sustainability receive overall a more positive sentiment from the public.
DOCUMENT
1e alinea column: De ontstellende hoeveelheid informatie en contactmogelijkheden op internet stelt ons voor de keuze wie we willen zijn en volgens welke waarden we willen leven. Waar Internet 1.0 nog vooral gezien kon worden als een grote database met Google als markt-hit, speelt in het semantic web sociale interactie een grote rol. In het semantic web kan alle data en dus bijvoorbeeld ook al uw berichtjes, profielgegevens, bestandjes en teksten en dat van anderen, nog gemakkelijker verspreid, gecombineerd, maar ook geanalyseerd en op maat worden gepresenteerd. Op iedere unieke vraag of zoekopdracht direct dus een uniek antwoord.
LINK
We present our ongoing work on upgrading the Amsterdam Public Library's book database search capabilities. So far, users have had to input the exact book title and/or author name without any typos or misspellings in order to retrieve any results. This is in sharp contrast with the manner in which users typically use the interface: they frequently search for books on a particular topic, input the names of the characters, or even ask fully-fledged questions. The aim of this project is therefore to enable smart search in natural language based on book content. The initial focus is on the Dutch language, with the possibility of including English and other languages later. In the first phase of the project, we built a proof-of-concept knowledge graph from a sample of the existing tabular database and enriched the data with named entities extracted from book summaries. Based on this first step, a user query like "Heeft u boeken over de Tweede Wereldoorlog in Amsterdam?" would yield all books that mention both WW2 and Amsterdam. We are currently working on augmenting the knowledge graph with embeddings, which will enable us to retrieve semantically similar results. The final step of the research involves integrating our knowledge graph with a pre-trained large language model.
DOCUMENT
Peer-reviewed artikel over semantische segmentatie van point clouds.
MULTIFILE
The research project In search of pedagogical sensitivity is executed from the research department of the knowledge circle renewing methods and didactics for teacher education and training of the Hogeschool Utrecht in the Netherlands under supervision of Hans Jansen (associated professor of the Hogeschool Utrecht - chair: renewing methods and didactics for teacher education and training) by Karel Mulderij, Renée van der Linde and Loes Houweling (all senior teachers and senior researchers of the Hogeschool Utrecht and members of the knowledge circle renewing methods and didactics for teacher education and training) with assistance of 25 students (teachers) studying in a three year Master course Ecological Pedagogy.
DOCUMENT
Gamma-band neuronal synchronization during sentence-level language comprehension has previously been linked with semantic unification. Here, we attempt to further narrow down the functional significance of gamma during language comprehension, by distinguishing between two aspects of semantic unification: successful integration of word meaning into the sentence context, and prediction of upcoming words. We computed eventrelated potentials (ERPs) and frequency band-specific electroencephalographic (EEG) power changes while participants read sentences that contained a critical word (CW) that was (1) both semantically congruent and predictable (high cloze, HC), (2) semantically congruent but unpredictable (low cloze, LC), or (3) semantically incongruent (and therefore also unpredictable; semantic violation, SV). The ERP analysis showed the expected parametric N400 modulation (HC < LC < SV). The time-frequency analysis showed qualitatively different results. In the gamma-frequency range, we observed a power increase in response to the CW in the HC condition, but not in the LC and the SV conditions. Additionally, in the theta frequency range we observed a power increase in the SV condition only. Our data provide evidence that gamma power increases are related to the predictability of an upcoming word based on the preceding sentence context, rather than to the integration of the incoming word's semantics into the preceding context. Further, our theta band data are compatible with the notion that theta band synchronization in sentence comprehension might be related to the detection of an error in the language input.
MULTIFILE
DOCUMENT
Differences in the oscillatory EEG dynamics of reading open class (OC) and closed class (CC) words have previously been found (Bastiaansen et al., 2005) and are thought to reflect differences in lexical-semantic content between these word classes. In particu-lar, the theta-band (4-7 Hz) seems to play a prominent role in lexical-semantic retrieval. We tested whether this theta effect is robust in an older population of subjects. Additionally, we examined how the context of a word can modulate the oscillatory dynamics underly-ing retrieval for the two different classes of words. Older participants (mean age 55) read words presented in either syntactically correct sentences or in a scrambled order ("scram-bled sentence") while their EEG was recorded. We performed time-frequency analysis to examine how power varied based on the context or class of the word. We observed larger power decreases in the alpha (8-12 Hz) band between 200-700 ms for the OC compared to CC words, but this was true only for the scrambled sentence context. We did not observe differences in theta power between these conditions. Context exerted an effect on the alpha and low beta (13-18 Hz) bands between 0 and 700 ms. These results suggest that the previously observed word class effects on theta power changes in a younger participant sample do not seem to be a robust effect in this older population. Though this is an indi-rect comparison between studies, it may suggest the existence of aging effects on word retrieval dynamics for different populations. Additionally, the interaction between word class and context suggests that word retrieval mechanisms interact with sentence-level comprehension mechanisms in the alpha-band.
MULTIFILE
1e alinea column: De grote beweging via ketenomkering naar customer self care en bottom-up self assembled teaming is zich snel aan het voltrekken. De klant neemt het initiatief en Tofflers prosumership wordt zichtbaar. Het aantal business voorbeelden wordt snel groter, al gaat het om je auto zelf samenstellen, onderdelen bestellen, 3D printing, zelfroosteren, civil journalism, klanten die restaurants recenseren, tracking &tracing van de post, medische zorg. Neem Qlinx als open architectuur in combinatie met bijvoorbeeld Twitter, dat laat goed zien wat dit kan gaan betekenen voor de dynamiek op de arbeidsmarkt. Wolfram-alpha toont de potentie van het semantic web. In bijvoorbeeld Share2Start - power of the open mind zien we de kracht van crowdfunding en het begin van ‘financials 2.0’. Deze sites laten goed zien welke richting het uitgaat.
LINK