BIB-VERSION:: CS-TR-v2.0 ID:: UCB//S2K-93-22 ENTRY:: February 25, 1994 TITLE:: Cases as Structured Indexes for Full-Length Documents DATE:: AUTHOR:: Hearst, Marti A. PAGES:: 6 ABSTRACT:: Two long, full-length texts are not likely to discuss all, or almost all, of the same subtropics or sub- points. Even if the documents contain many of the same terms the ways the terms are grouped to form subtopical disucssions sill might be quite different. A solution is to create a description of a document which lists all of its subtopical discussions as well as its main topics. An index that indicates this structure is an abstract representation of the document and we can think of this index as a case in the Case-Based Reasoning (CBR) sense. This paper proposes the use of cases to represent the high-level structure of full-length documents for the purpose of information retrieval. The cases are to be used both for assessing document similarity and for helping the user construct viable queries. The case can be transformed in various ways in order to make it more similar to the descriptions of other documents; these tranformations include generalizing, substituting, and emphasizing subtropic descrip- tions. An advantage of this approach is that the cases that represent the document are automatically generable. RETRIEVAL:: postscript (in all.ps) END:: UCB//S2K-93-22