×

Information data retrieval, where the data is organized in terms, documents and document corpora

  • US 20050149494A1
  • Filed: 01/13/2003
  • Published: 07/07/2005
  • Est. Priority Date: 01/16/2002
  • Status: Active Grant
First Claim
Patent Images

1. A method of processing digitized textual information, the information being organized in terms, documents and document corpora, where each document contains at least one term and each document corpus contains at least one document, the method comprising:

  • generating a concept vector for each document in a document corpus wherein the concept vector conceptually classifying the contents of the document on a relatively compact format, generating, for each term in the document corpus, a term-to-concept vector describing a relationship between the term and each of the concept vectors wherein the term-to-concept vectors being generated on basis of the concept vectors, comprises;

    receiving the term-to-concept vectors for the document corpus and on basis thereof generating a term-term matrix describing a term-to-term relationship between the terms in the document corpus, and processing the term-term matrix into processed textual information.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×