Preprint submitted to Information Processing & Management Tags are a convenient way to label resources on the web. An interesting question is whether one can determine the semantic meaning of tags in the absence of some predefined formal structure like a thesaurus. Many authors have used the usage data for tags to find their emergent semantics. Here, we argue that the semantics of tags can be captured by comparing the contexts in which tags appear. We give an approach to operationalizing this idea by defining what we call paradigmatic similarity: computing co-occurrence distributions of tags with tags in the same context, and comparing tags using information theoretic similarity measures of these distributions, mostly the Jensen-Shannon divergence. In experiments with three different tagged data collections we study its behavior and compare it to other distance measures. For some tasks, like terminology mapping or clustering, the paradigmatic similarity seems to give better results than similarity measures based on the co-occurrence of the documents or other resources that the tags are associated to. We argue that paradigmatic similarity, is superior to other distance measures, if agreement on topics (as opposed to style, register or language etc.), is the most important criterion, and the main differences between the tagged elements in the data set correspond to different topics
We review over 10 years of research at Elsevier and various Dutch academic institutions on establishing a new format for the scientific research article. Our work rests on two main theoretical principles: the concept of modular documents, consisting of content elements that can exist and be published independently and are linked by meaningful relations, and the use of semantic data standards allowing access to heterogeneous data. We discuss the application of these concepts in five different projects: a modular format for physics articles, an XML encyclopedia in pharmacology, a semantic data integration project, a modular format for computer science proceedings papers, and our current work on research articles in cell biology.
Gamma-band neuronal synchronization during sentence-level language comprehension has previously been linked with semantic unification. Here, we attempt to further narrow down the functional significance of gamma during language comprehension, by distinguishing between two aspects of semantic unification: successful integration of word meaning into the sentence context, and prediction of upcoming words. We computed eventrelated potentials (ERPs) and frequency band-specific electroencephalographic (EEG) power changes while participants read sentences that contained a critical word (CW) that was (1) both semantically congruent and predictable (high cloze, HC), (2) semantically congruent but unpredictable (low cloze, LC), or (3) semantically incongruent (and therefore also unpredictable; semantic violation, SV). The ERP analysis showed the expected parametric N400 modulation (HC < LC < SV). The time-frequency analysis showed qualitatively different results. In the gamma-frequency range, we observed a power increase in response to the CW in the HC condition, but not in the LC and the SV conditions. Additionally, in the theta frequency range we observed a power increase in the SV condition only. Our data provide evidence that gamma power increases are related to the predictability of an upcoming word based on the preceding sentence context, rather than to the integration of the incoming word's semantics into the preceding context. Further, our theta band data are compatible with the notion that theta band synchronization in sentence comprehension might be related to the detection of an error in the language input.
MULTIFILE
The primary objective of this research is to develop the new practical methodology of "Vocalising Resonance", as a development of existing methods. The approach intends to involve artists and citizens in a multi-sensory engagement with their surrounding environment, by collectively utilizing the 'physical', non-semantic voice as a tool for communication. This methodology's creation is rooted in the application of artistic research to well-formulated arguments within contemporary humanities studies. It also incorporates existing approaches for engaging with sound and voice in a multisensory and multidisciplinary manner. Musicologist, Nina Eidsheim argues that music research should not solely focus on the figure of sound, but should also delve into the materiality of sound as vibration (Eidsheim, Sensing Sound). Similarly, this research aims to explore the voice as an experience connected to the whole body and senses, rather than simply hearing. Several existing strategies offer starting points. The second objective of this research is to investigate how the proposed methodology of "Vocalising Resonance" may have a transformative impact on participants' consciousness, by increasing their exposure to diversity and facilitating their realisation that persons in the literal meaning of the word are “the speaking through - sonare + per- the masks that make up our social selves”(Scapp, A Question of Voice). Broadening the scope beyond traditional norms, the project paves the way for a more inclusive, sensorially rich understanding of performing arts, inviting a holistic approach that goes beyond typical categorisations, enhancing the immersive nature of performances. It prompts reflections on the significance of an audience to define a voice, the essence of a voice irrespective of language, and how the non-semantic use of voice serves as a powerful interdisciplinary communication tool within abstract performance contexts. This process seeks to offer a new perspective and transform our practices.