M. Nozue, FULL-TEXT DATABASE RETRIEVAL USING PARAGRAPHS - IN THE CASE OF JAPANESE TECHNICAL DOCUMENT DATABASE, Library and Information Science, (31), 1993, pp. 79-131
In these days the online full-text databases are increasing, but these
full-text databases are difficult to retrieve, because recall is high
er than bibliographic databases, and precision is so lower. There are
cases where we don't always read whole paper, but use one or a few par
t of an article. So this paper presents an approach to retrieve the re
levant parts of a document by using paragraphs of individual documents
. Sample documents are 49 articles in Japanese about information retri
eval and natural language processing studies. The retrieval technique
used in this retrieve experiment is the vector space model. As a resul
t, the higher precision and recall were shown by using the words in ch
apter titles or section headings to retrieve the relevant paragraphs.