A common strategy to assign keywords to documents is to select the most appropriate words from the document text. One of the most important criteria for a word to be selected as keyword is its relevance for the text. The tf.idf score of a term is a widely used relevance measure. While easy to compute and giving quite satisfactory results, this measure does not take (semantic) relations between words into account. In this paper we study some alternative relevance measures that do use relations between words. They are computed by defining co-occurrence distributions for words and comparing these distributions with the document and the corpus distribution. We then evaluate keyword extraction algorithms defined by selecting different relevance measures. For two corpora of abstracts with manually assigned keywords, we compare manually extracted keywords with different automatically extracted ones. The results show that using word co-occurrence information can improve precision and recall over tf.idf.
DOCUMENT
The research described in this paper provides insights into tools and methods which are used by professional information workers to keep and to manage their personal information. A literature study was carried out on 23 scholar papers and articles, retrieved from the ACM Digital Library and Library and Information Science Abstracts (LISA). The research questions were: - How do information workers keep and manage their information sources? - What aims do they have when building personal information collections? - What problems do they experience with the use and management of their personal collections? The main conclusion from the literature is that professional information workers use different tools and approaches for personal information management, depending on their personal style, the types of information in their collections and the devices which they use for retrieval. The main problem that they experience is that of information fragmentation over different collections and different devices. These findings can provide input for improvement of information literacy curricula in Higher Education. It has been remarked that scholar research and literature on Personal Information Management do not pay a lot of attention to the keeping and management of (bibliographic) data from external documentation. How people process the information from those sources and how this stimulates their personal learning, is completely overlooked. [The original publication is available at www.elpub.net]
DOCUMENT
A common strategy to assign keywords to documents is to select the most appropriate words from the document text. One of the most important criteria for a word to be selected as keyword is its relevance for the text. The tf.idf score of a term is a widely used relevance measure. While easy to compute and giving quite satisfactory results, this measure does not take (semantic) relations between words into account.
DOCUMENT
This study explores how journalists in highspeed newsrooms gather information, how gathering activities are temporally structured and how reliability manifests itself in information-gathering activities.
DOCUMENT
The objective of this paper is a reflective discussion on the validity of the construct Information Literacy in the perspective of changing information and communication technologies. The research question that will be answered is: what is the impact of technological developments on the relevance of the Information Literacy concept? Technological developments that will be discussed are: - content integration (federated search engines) - amateur publishing (user generated content) - use of social networks to find information - personalisation and push technology - loss of context / fragmentation of information. Research methods: desk research and critical analysis of the results that were found. The analysis of the influence of the discussed technologies on the Information Literacy concept is represented by arrow diagrams. Findings: The Information Literacy concept refers to a set of sub skills varying from retrieval skills to critical use of scholar information. Changing technologies reduce the significance of the more instrumental sub skills of the Information Literacy concept. On the other hand, higher order cognitive skills (for instance critical evaluation of resources and analysis of content) become more and more important for students and professionals who try to solve their information problems. The paper concludes with a description of the facets of the Information Literacy concept that need extra attention in the education of the knowledge workers of the future. [De hier gepubliceerde versie is het 'accepted paper' van het origineel dat is gepubliceerd op www.springerlink.com . De officiële publicatie kan worden gedownload op http://www.springerlink.com/content/n32j3um878720h40/abstract/]
DOCUMENT
The central thesis of this book is that access to information represents a vital aspect of contemporary society, encompassing participation, accountability, governance, transparency, the production of products, and the delivery of services. This view is widely shared, with commentators and scholars agreeing that access to information is a key factor in maintaining societal and economic stability. However, having access to information does not guarantee its accessibility. Assuming that information is (cognitively) interpretable is incorrect, as many practical examples illustrate. In the first chapter, this book offers insights into the challenge of access to information in a digitalized world. The concepts of access and accessibility are addressed, elucidating their meanings and delineating the ways in which they are influenced by the exponential growth of information. It examines how information technology introduces a novel access paradox. The second chapter examines the challenges to access to and accessibility of information in a digitalized, hybrid world where code may be law, where there is an inescapable loss of privacy, where doing business opens and restricts access, where literacy is a necessity to survive ‘digital divides,’ and where environmental concerns may have an adverse effect on high expectations. The third chapter presents a review of theoretical approaches to access and accessibility from seven different research perspectives: information access disparity, information seeking, information retrieval, information quality, information security, information management, and archives management. Six approaches to information access and accessibility are identified: [1] social, economic, and political participation; [2] ‘smart’ and evolving technology; [3] power and control; [4] sense-making; [5] knowledge representations, and [6] information survival. The fourth chapter addresses the bottlenecks and requirements for information access and accessibility, culminating in a checklist for organizations to assess these requirements within their own business processes. In the fifth chapter, some perspectives on artificial intelligence and the future of information access are presented. The sixth chapter represents an attempt to draw conclusions and to bring this book to a close.
DOCUMENT
From the article: "Abstract Maintenance processes of Dutch housing associations are often still organized in a traditional manner. Contracts are based on lowest price instead of ‘best quality for lowest price’ considering users’ demands. Dutch housing associations acknowledge the need to improve their maintenance processes in order to lower maintenance cost, but are not sure how. In this research, this problem is addressed by investigating different supply chain partnering principles and the role of information management. The main question is “How can the organisation of maintenance processes of Dutch housing associations, in different supply chain partnering principles and the related information management, be improved?” The answer is sought through case study research."
DOCUMENT
Background: Low-educated patients are disadvantaged in using questionnaires within the health care setting because most health-related questionnaires do not take the educational background of patients into account. The Dutch Talking Touch Screen Questionnaire (DTTSQ) was developed in an attempt to meet the needs of low-educated patients by using plain language and adding communication technology to an existing paper-based questionnaire. For physical therapists to use the DTTSQ as part of their intake procedure, it needs to generate accurate information from all of their patients, independent of educational level. Objective: The aim of this study was to get a first impression of the information that is generated by the DTTSQ. To achieve this goal, response processes of physical therapy patients with diverse levels of education were analyzed. Methods: The qualitative Three-Step Test-Interview method was used to collect observational data on actual response behavior of 24 physical therapy patients with diverse levels of education. The interviews included both think-aloud and retrospective probing techniques. Results: Of the 24 respondents, 20 encountered one or more problems during their response process. The use of plain language and information and communication technology (ICT) appeared to have a positive effect on the comprehensibility of the DTTSQ. However, it also had some negative effects on the interpretation, retrieval, judgment, and response selection within the response processes of the participants in this study. No educational group in this research population stood out from the rest in the kind or number of problems that arose. All respondents recognized themselves in the outcomes of the questionnaire. Conclusions: The use of plain language and ICT within the DTTSQ had both positive and negative effects on the response processes of its target population. The results of this study emphasize the importance of earlier recommendations to accompany any adaption of any questionnaire to a new mode of delivery by demonstrating the difference and equivalence between the two different modes and to scientifically evaluate the applicability of the newly developed mode of the questionnaire in its intended setting. This is especially important in a digital era in which the use of plain language within health care is increasingly being advocated.
LINK
The presentation of management information on screens and paper is aimed at the initiation of control actions in order to bring about predefinied goals. The terms and concepts used in this control information can be interptreted in different ways. It is of vital importance that adequate definitions for these terms and concepts are provided, because of the area of tension betrween those that control and those being controlled. The creation of a common conceptual framework and the maintenance of concepts and definitions can be supported by the construction of an organization-specific lexicon and the use of modern IT tools.
DOCUMENT
Although most authors on Information Literacy do not really differ in their definitions of the information literacy concept, phenomenographic research makes clear that in the context of education at least two different conceptions can be distinguished: an “Information Problem Solving” conception and a “Personal Knowledge Base” conception [1]. The conception of “Information Problem Solving” has been elaborated on in various models by many researchers but the operationalization of the “Personal Knowledge Base conception” has, until now, been ignored in LIS research. Based on educational literature a model for the content of a “Personal Knowledge Base” will be proposed. Two kinds of internalized knowledge are distinguished: the body of knowledge of the discipline and metacognitive knowledge. Both of these elements display sub content. This conception of information literacy as a “Personal Knowledge Base” is consistent with the idea that “learning to learn” is one of the main goals of Higher Education. Copyright / opmerkingen: De hier gepubliceerde versie is het 'accepted paper' van het origineel dat is gepubliceerd op www.springerlink.com . De officiële publicatie kan worden gedownload op http://link.springer.com/chapter/10.1007/978-3-319-14136-7_4
DOCUMENT