Het beheer van grote hoeveelheden documenten vormt een steeds grotere uitdaging voor organisaties. Of het nu gaat om bestaande of nieuwe informatiesystemen, lokaal of in de cloud, informatiemanagers stellen zich de vraag welke mogelijkheden er zijn om de toegankelijkheid en vindbaarheid van de informatie zo effectief mogelijk te realiseren. Een goede vindbaarheid bespaart immers vele uren arbeidstijd en voorkomt incomplete dossiers. Traditionele instrumenten als taxonomieën, thesauri en autorisatielijsten bewijzen daarbij nog dagelijks hun waarde en de technische ontwikkelingen hebben de mogelijkheden uiteraard verruimd: automatische indexering en klassering, ontologieën en hyperlinking zijn waardevolle aanvullingen. In dit boek behandelen we belangrijke methoden en technieken om informatie (documenten) van een organisatie vindbaar te maken. De theorie van de toegankelijkheidsleer wordt vanaf de basis behandeld en aan de hand van vele voorbeelden komen technieken en instrumenten als taxonomie, thesaurus, ontologie, zoekmachine en classificatie aan de orde, inclusief stappenplannen om hier zelf mee aan de slag te gaan. Omdat SharePoint een veelgebruikt platform is voor het beheren en delen van documenten, besteden we een apart hoofdstuk aan de wijze waarop documenten binnen SharePoint zo goed mogelijk vindbaar kunnen worden gemaakt. De hoofdstukken worden afgewisseld met kaderteksten waarin specifiek wordt ingegaan op gerelateerde onderwerpen als XML, machine learning en cardsorting. Iedereen die in de praktijk betrokken is bij de implementatie van een informatiesysteem of in opleiding is tot informatieprofessional kan putten uit de uitgebreide beschrijvingen en handvatten die dit werk biedt. Omdat studenten een deel van de doelgroep vormen, is dit werk als open textbook onder een Creative Commons licentie gratis te downloaden. Originele document: https://udocstore.nl/docs/9789492388001 Joyce van Aalten (docent bij GO Opleidingen) Peter Becker (docent informatiebeheer aan de Haagse Hogeschool en GO Opleidingen) Marjolein van der Linden (docente aan de Hogeschool van Amsterdam, opleiding Media Informatie en Communicatie en bij de opleiding Communicatie) Eric Sieverts (docent bij VOGIN en GO Opleidingen)
DOCUMENT
In this article, we present CoPub 5.0, a publicly available text mining system, which uses Medline abstracts to calculate robust statistics for keyword co-occurrences. CoPub was initially developed for the analysis of microarray data, but we broadened the scope by implementing new technology and new thesauri. In CoPub 5.0, we integrated existing CoPub technology with new features, and provided a new advanced interface, which can be used to answer a variety of biological questions. CoPub 5.0 allows searching for keywords of interest and its relations to curated thesauri and provides highlighting and sorting mechanisms, using its statistics, to retrieve the most important abstracts in which the terms co-occur. It also provides a way to search for indirect relations between genes, drugs, pathways and diseases, following an ABC principle, in which A and C have no direct connection but are connected via shared B intermediates. With CoPub 5.0, it is possible to create, annotate and analyze networks using the layout and highlight options of Cytoscape web, allowing for literature based systems biology. Finally, operations of the CoPub 5.0 Web service enable to implement the CoPub technology in bioinformatics workflows. CoPub 5.0 can be accessed through the CoPub portal http://www.copub.org. © 2011 The Author(s).
DOCUMENT
This article is based on five years of longitudinal participatory action research on how former pre‐bachelor programme students with a refugee background experience finding their way into Dutch higher education and society. The four‐member research team and authors (two of which were former refugees), found that refugee students face a significant barrier of “us‐versus‐them,” especially in an educational context. We explored how creative co‐creation contributed to rethinking difference and sameness in higher education by breaking through or transcending this divide. Creative co‐creation through play, storytelling, or constructing artefacts enables “alterity,” approaching the other from the other’s position. Movement and action help to shape the world around us: Connecting and shifting positions creates sameness while leaving space for difference. Creative co‐creation during our research process included making co‐creation artefacts and activities, thus involving outreach to broader audiences for engagement. In the research process, it became clear that successful participation matters to all students and provides more opportunities for all, not just refugee students. A new notion of “we” in Dutch higher education and society that does not perpetuate the divide between “us” and “them” requires a shared responsibility. Higher education needs the university authorities and the teachers to make room for student stories and should provide spaces for dialogue and community development.
LINK
A common strategy to assign keywords to documents is to select the most appropriate words from the document text. One of the most important criteria for a word to be selected as keyword is its relevance for the text. The tf.idf score of a term is a widely used relevance measure. While easy to compute and giving quite satisfactory results, this measure does not take (semantic) relations between words into account. In this paper we study some alternative relevance measures that do use relations between words. They are computed by defining co-occurrence distributions for words and comparing these distributions with the document and the corpus distribution. We then evaluate keyword extraction algorithms defined by selecting different relevance measures. For two corpora of abstracts with manually assigned keywords, we compare manually extracted keywords with different automatically extracted ones. The results show that using word co-occurrence information can improve precision and recall over tf.idf.
DOCUMENT
A common strategy to assign keywords to documents is to select the most appropriate words from the document text. One of the most important criteria for a word to be selected as keyword is its relevance for the text. The tf.idf score of a term is a widely used relevance measure. While easy to compute and giving quite satisfactory results, this measure does not take (semantic) relations between words into account.
DOCUMENT
The scientific literature represents a rich source for retrieval of knowledge on associations between biomedical concepts such as genes, diseases and cellular processes. A commonly used method to establish relationships between biomedical concepts from literature is co-occurrence. Apart from its use in knowledge retrieval, the co-occurrence method is also wellsuited to discover new, hidden relationships between biomedical concepts following a simple ABC-principle, in which A and C have no direct relationship, but are connected via shared B-intermediates. In this paper we describe CoPub Discovery, a tool that mines the literature for new relationships between biomedical concepts. Statistical analysis using ROC curves showed that CoPub Discovery performed well over a wide range of settings and keyword thesauri. We subsequently used CoPub Discovery to search for new relationships between genes, drugs, pathways and diseases. Several of the newly found relationships were validated using independent literature sources. In addition, new predicted relationships between compounds and cell proliferation were validated and confirmed experimentally in an in vitro cell proliferation assay. The results show that CoPub Discovery is able to identify novel associations between genes, drugs, pathways and diseases that have a high probability of being biologically valid. This makes CoPub Discovery a useful tool to unravel the mechanisms behind disease, to find novel drug targets, or to find novel applications for existing drugs. © 2010 Frijters et al.
DOCUMENT
Bespreking van competenties waarover de 'Digitale Bibliothecaris' dient te beschikken op het gebied van management en organisatie, managen van informatiebronnen, managen van informatiediensten, toepassen van ict-hulpmiddelen. Het hoofdstuk eindigt met 10 eisen die je aan de 'ideale digitale bibliothecaris' zou mogen stellen. [Peter Becker en Jos van Helvoort]
DOCUMENT
Het SHB heeft Futureconsult verzocht een begeleidingstraject te starten voor de ontwikkeling van een toekomstvisie op de Hogeschool Bibliotheken. Hoe ziet de toekomst van het bibliotheekwezen in het hoger beroepsonderwijs eruit? Welke vormen van samenwerking passen daarbij? Welke ICT-toepassingen zullen hun intrede doen? Rondom deze en andere ontwikkelingen wilde het SHB een toekomstverkenning realiseren om een robuuste strategie voor de toekomst te ontwikkelen. Om de toekomst beter te overzien heeft Futureconsult in een interactief proces met het SHB vier scenario's ontwikkeld als vertrekpunt voor het creëren van visie en strategie. Dit eindverslag geeft de resultaten van het scenarioproject weer. Na de inleiding volgt een uitleg over de scenariomethode die in dit project is toegepast. Daarna volgen de vier toekomstscenario's. Bij elk scenario zijn de reacties tijdens de eerste bespreking van de scenario's op de slotbijeenkomst op 16 juni weergegeven. De bijlage omvat een aantal interviews die ten behoeve van het project zijn gehouden, de resultaten van een onderzoek onder studenten, de registratie van de interactieve workshop en van een debat tussen SHB-bestuursleden en studenten van het Interstedelijk Studenten Overleg (ISO).
DOCUMENT
Preprint submitted to Information Processing & Management Tags are a convenient way to label resources on the web. An interesting question is whether one can determine the semantic meaning of tags in the absence of some predefined formal structure like a thesaurus. Many authors have used the usage data for tags to find their emergent semantics. Here, we argue that the semantics of tags can be captured by comparing the contexts in which tags appear. We give an approach to operationalizing this idea by defining what we call paradigmatic similarity: computing co-occurrence distributions of tags with tags in the same context, and comparing tags using information theoretic similarity measures of these distributions, mostly the Jensen-Shannon divergence. In experiments with three different tagged data collections we study its behavior and compare it to other distance measures. For some tasks, like terminology mapping or clustering, the paradigmatic similarity seems to give better results than similarity measures based on the co-occurrence of the documents or other resources that the tags are associated to. We argue that paradigmatic similarity, is superior to other distance measures, if agreement on topics (as opposed to style, register or language etc.), is the most important criterion, and the main differences between the tagged elements in the data set correspond to different topics
DOCUMENT
The Hague University of Applied Sciences has high ambitions in the field of internationalisation. Two out of four priorities in the institutional policy touch this theme: global citizenship and internationalisation. In order to ensure that the curriculum of the new degree programme HBO ICT meets these priorities, it is interesting to know which international competencies the ICT sector requires. The main research questions in this report is: Which international competencies does the ICT sector demand of ICT graduates and how can these be embedded in the curriculum of the new HBO ICT degree programme? That the question is relevant, is shown by the fact that 25% of the respondents, ICT graduates, indicated that they actually work abroad for longer and shorter periods. In this research an online survey was held among alumni (n = 315) of the precursors of the HBO ICT degree programme in order to find out which international competencies are important. By conducting interviews on the same target group, this information was deepened. In an online survey among graduation supervisors (n = 202) it is examined to what extent the graduates master the required skills by the end of their training. This combined information provides the input to develop the new curriculum of the HBO ICT degree programme and its specialisations. The results show that English and especially English listening and reading skills are considered to be very important. Our alumni master these skills highly satisfactorily. It was specifically mentioned, however, that alumni must overcome a certain reluctance to speak. Intercultural and personal and social competencies are found very important. To master these competencies, students should learn by experiencing. This can be done by working together in international teams, but also in national teams as long as they are supervised explicitly on intercultural, personal and social competencies. As far as the international academic and professional competencies concerned, especially internationally accepted professional knowledge is considered important. On these categories the HBO ICT graduates score satisfactorily (a score of 6 or 6,5 out of 10). Depending on the ambitions of the programme, some improvements could be made here. In general, the ICT sector is quite satisfied with the extent to which our students possess international competencies they consider to be relevant. However, there are suggestions for improvement and some of them have already been included in the toolkit internationalisation as part of the development of the curriculum of HBO ICT.
DOCUMENT