专利内容由知识产权出版社提供
专利名称:SELF-ORGANIZED CONCEPT SEARCH AND
DATA STORAGE METHOD
发明人:George Witwer,Ravi Kumar Kondadadi申请号:US11275554申请日:20060113
公开号:US20060167930A1公开日:20060727
专利附图:
摘要:A document search and retrieval system and method stores documents ingroups based on content. The documents are self-organized into a hierarchy ofconceptual clusters, and branches of the hierarchy are stored separately in distinct
physical stores, each having an index. In response to a query, the system finds theconcepts (clusters) that best match the search criteria and returns the documents fromthose content categories. The indexing, clustering, and searching are performed usingdocument themes and/or summaries. Themes are automatically developed by stemmingand scoring phrases from the sentences in each document, and clustering the sentencescontaining the highest-scoring stems. A set of phrases (themes) is taken from eachcluster. Document summaries are taken from text segments for each cluster ofsentences within a document, then strung together to create a summary.
申请人:George Witwer,Ravi Kumar Kondadadi
地址:Bluffton IN US,Indianapolis IN US
国籍:US,US
更多信息请下载全文后查看