The platform for open and practice-oriented research

product

Who is Involved?

E-discovery projects typically start with an assessment of the collected electronic data in order to estimate the risk to prosecute or defend a legal case. This is not a review task but is appropriately called early case assessment, which is better known as exploratory search in the information retrieval community. This paper first describes text mining methodologies that can be used for enhancing exploratory search. Based on these ideas we present a semantic search dashboard that includes entities that are relevant to investigators such as who knew who, what, where and when. We describe how this dashboard can be powered by results from our ongoing research in the “Semantic Search for E-Discovery” project on topic detection and clustering, semantic enrichment of user profiles, email recipient recommendation, expert finding and identity extraction from digital forensic evidence.

MULTIFILE

product

Corporate Reputation of Companies on Twitter Seen from a Sustainability Perspective

Corporate reputation is becoming increasingly important for firms; social media platforms such as Twitter are used to convey their message. In this paper, corporate reputation will be assessed from a sustainability perspective. Using sentiment analysis, the top 100 brands of the Netherlands were scraped and analyzed. The companies were registered in the sustainable industry classification system (SICS) to perform the analysis on an industry level. A semantic search tool called Open Semantic Desktop Search was used to filter through the data to find keywords related to sustainability and corporate reputation. Findings show that companies that tweet more often about corporate reputation and sustainability receive overall a more positive sentiment from the public.

DOCUMENT

Corporate Reputation of Companies on Twitter Seen from a Sustainability Perspective

product

Creëren of Reageren?

1e alinea column: De ontstellende hoeveelheid informatie en contactmogelijkheden op internet stelt ons voor de keuze wie we willen zijn en volgens welke waarden we willen leven. Waar Internet 1.0 nog vooral gezien kon worden als een grote database met Google als markt-hit, speelt in het semantic web sociale interactie een grote rol. In het semantic web kan alle data en dus bijvoorbeeld ook al uw berichtjes, profielgegevens, bestandjes en teksten en dat van anderen, nog gemakkelijker verspreid, gecombineerd, maar ook geanalyseerd en op maat worden gepresenteerd. Op iedere unieke vraag of zoekopdracht direct dus een uniek antwoord.

LINK

product

Improving Book Search with AI

We present our ongoing work on upgrading the Amsterdam Public Library's book database search capabilities. So far, users have had to input the exact book title and/or author name without any typos or misspellings in order to retrieve any results. This is in sharp contrast with the manner in which users typically use the interface: they frequently search for books on a particular topic, input the names of the characters, or even ask fully-fledged questions. The aim of this project is therefore to enable smart search in natural language based on book content. The initial focus is on the Dutch language, with the possibility of including English and other languages later. In the first phase of the project, we built a proof-of-concept knowledge graph from a sample of the existing tabular database and enriched the data with named entities extracted from book summaries. Based on this first step, a user query like "Heeft u boeken over de Tweede Wereldoorlog in Amsterdam?" would yield all books that mention both WW2 and Amsterdam. We are currently working on augmenting the knowledge graph with embeddings, which will enable us to retrieve semantically similar results. The final step of the research involves integrating our knowledge graph with a pre-trained large language model.

DOCUMENT

product

Get in or get lost, Social Business or no Business

1e alinea column: De grote beweging via ketenomkering naar customer self care en bottom-up self assembled teaming is zich snel aan het voltrekken. De klant neemt het initiatief en Tofflers prosumership wordt zichtbaar. Het aantal business voorbeelden wordt snel groter, al gaat het om je auto zelf samenstellen, onderdelen bestellen, 3D printing, zelfroosteren, civil journalism, klanten die restaurants recenseren, tracking &tracing van de post, medische zorg. Neem Qlinx als open architectuur in combinatie met bijvoorbeeld Twitter, dat laat goed zien wat dit kan gaan betekenen voor de dynamiek op de arbeidsmarkt. Wolfram-alpha toont de potentie van het semantic web. In bijvoorbeeld Share2Start - power of the open mind zien we de kracht van crowdfunding en het begin van ‘financials 2.0’. Deze sites laten goed zien welke richting het uitgaat.

LINK

product

Integration or predictability? A further specification of the functional role of gamma oscillations in language comprehension

Gamma-band neuronal synchronization during sentence-level language comprehension has previously been linked with semantic unification. Here, we attempt to further narrow down the functional significance of gamma during language comprehension, by distinguishing between two aspects of semantic unification: successful integration of word meaning into the sentence context, and prediction of upcoming words. We computed eventrelated potentials (ERPs) and frequency band-specific electroencephalographic (EEG) power changes while participants read sentences that contained a critical word (CW) that was (1) both semantically congruent and predictable (high cloze, HC), (2) semantically congruent but unpredictable (low cloze, LC), or (3) semantically incongruent (and therefore also unpredictable; semantic violation, SV). The ERP analysis showed the expected parametric N400 modulation (HC < LC < SV). The time-frequency analysis showed qualitatively different results. In the gamma-frequency range, we observed a power increase in response to the CW in the HC condition, but not in the LC and the SV conditions. Additionally, in the theta frequency range we observed a power increase in the SV condition only. Our data provide evidence that gamma power increases are related to the predictability of an upcoming word based on the preceding sentence context, rather than to the integration of the incoming word's semantics into the preceding context. Further, our theta band data are compatible with the notion that theta band synchronization in sentence comprehension might be related to the detection of an error in the language input.

MULTIFILE

product

Distributional Semantics of Tags

Preprint submitted to Information Processing & Management Tags are a convenient way to label resources on the web. An interesting question is whether one can determine the semantic meaning of tags in the absence of some predefined formal structure like a thesaurus. Many authors have used the usage data for tags to find their emergent semantics. Here, we argue that the semantics of tags can be captured by comparing the contexts in which tags appear. We give an approach to operationalizing this idea by defining what we call paradigmatic similarity: computing co-occurrence distributions of tags with tags in the same context, and comparing tags using information theoretic similarity measures of these distributions, mostly the Jensen-Shannon divergence. In experiments with three different tagged data collections we study its behavior and compare it to other distance measures. For some tasks, like terminology mapping or clustering, the paradigmatic similarity seems to give better results than similarity measures based on the co-occurrence of the documents or other resources that the tags are associated to. We argue that paradigmatic similarity, is superior to other distance measures, if agreement on topics (as opposed to style, register or language etc.), is the most important criterion, and the main differences between the tagged elements in the data set correspond to different topics

DOCUMENT

product

Word class and context affect alpha-band oscillatory dynamics in an older population.

Differences in the oscillatory EEG dynamics of reading open class (OC) and closed class (CC) words have previously been found (Bastiaansen et al., 2005) and are thought to reflect differences in lexical-semantic content between these word classes. In particu-lar, the theta-band (4-7 Hz) seems to play a prominent role in lexical-semantic retrieval. We tested whether this theta effect is robust in an older population of subjects. Additionally, we examined how the context of a word can modulate the oscillatory dynamics underly-ing retrieval for the two different classes of words. Older participants (mean age 55) read words presented in either syntactically correct sentences or in a scrambled order ("scram-bled sentence") while their EEG was recorded. We performed time-frequency analysis to examine how power varied based on the context or class of the word. We observed larger power decreases in the alpha (8-12 Hz) band between 200-700 ms for the OC compared to CC words, but this was true only for the scrambled sentence context. We did not observe differences in theta power between these conditions. Context exerted an effect on the alpha and low beta (13-18 Hz) bands between 0 and 700 ms. These results suggest that the previously observed word class effects on theta power changes in a younger participant sample do not seem to be a robust effect in this older population. Though this is an indi-rect comparison between studies, it may suggest the existence of aging effects on word retrieval dynamics for different populations. Additionally, the interaction between word class and context suggests that word retrieval mechanisms interact with sentence-level comprehension mechanisms in the alpha-band.

MULTIFILE

product

A company’s corporate reputation through the eyes of employees measured with sentiment analysis of online reviews

DOCUMENT

Search results

Products 146

Who is Involved?

Corporate Reputation of Companies on Twitter Seen from a Sustainability Perspective

Creëren of Reageren?

Improving Book Search with AI

Get in or get lost, Social Business or no Business

Integration or predictability? A further specification of the functional role of gamma oscillations in language comprehension

Distributional Semantics of Tags

Word class and context affect alpha-band oscillatory dynamics in an older population.

A company’s corporate reputation through the eyes of employees measured with sentiment analysis of online reviews

People 3

Jesse Aarden

Nádia Oliveira

Eric Blaauw

Projects 1

The Resonance of Vocalising: Performative imaginings of future ecological states of being through collective vocal and listening practices

Navigate to

Categories

Filters

Products 146

Who is Involved?

Corporate Reputation of Companies on Twitter Seen from a Sustainability Perspective

Creëren of Reageren?

Improving Book Search with AI

Get in or get lost, Social Business or no Business

Integration or predictability? A further specification of the functional role of gamma oscillations in language comprehension

Distributional Semantics of Tags

Word class and context affect alpha-band oscillatory dynamics in an older population.

A company’s corporate reputation through the eyes of employees measured with sentiment analysis of online reviews

People 3

Jesse Aarden

Nádia Oliveira

Eric Blaauw

Projects 1

The Resonance of Vocalising: Performative imaginings of future ecological states of being through collective vocal and listening practices