

An edition of Intelligent Document Retrieval (2005)
Exploiting Markup Structure (The Information Retrieval Series)
By Udo Kruschwitz
Publish Date
December 1, 2005
Publisher
Springer
Language
eng
Pages
214
Description:
Collections of digital documents can nowadays be found everywhere in institutions, universities or companies. Examples are Web sites or intranets. But searching them for information can still be painful. Searches often return either large numbers of matches or no suitable matches at all. Such document collections can vary a lot in size and how much structure they carry. What they have in common is that they typically do have some structure and that they cover a limited range of topics. The second point is significantly different from the Web in general. The type of search system that we propose in this book can suggest ways of refining or relaxing the query to assist a user in the search process. In order to suggest sensible query modifications we would need to know what the documents are about. Explicit knowledge about the document collection encoded in some electronic form is what we need. However, typically such knowledge is not available. So we construct it automatically.
subjects: Internet searching, Text processing (Computer science), Document markup languages, Computational linguistics, Information storage and retrieval systems, Documentation, Computer science, Computer system performance, Information systems, Text processing (Computer science, Computer Science, general, System Performance and Evaluation, Information Storage and Retrieval, Information Systems Applications (incl.Internet), Document Preparation and Text Processing