×

Process and system for retrieval of documents using context-relevant semantic profiles

  • US 6,189,002 B1
  • Filed: 12/08/1999
  • Issued: 02/13/2001
  • Est. Priority Date: 12/14/1998
  • Status: Expired due to Term
First Claim
Patent Images

1. A method for extracting a semantic profile from a text corpus, the method comprising the steps of:

  • parsing the text corpus into paragraphs and words;

    processing the text corpus to remove stop words and inflectional morphemes;

    arranging the vocabulary of the parsed and processed text corpus in an order;

    creating a reference set of vectors from the text corpus by transforming each paragraph of the parsed and processed text corpus into a corresponding vector of K elements arranged in the order, K being the size of the vocabulary, each element of the vector corresponding to said each paragraph determined by application of a first predetermined function to the number of occurrences within said each paragraph of the word corresponding to said each element;

    presenting the reference set of vectors to a neural network to train the neural network in the word relationships of the text corpus; and

    storing activation patterns of the hidden units of the trained neural network for use as the semantic profile of the text corpus.

View all claims
  • 21 Assignments
Timeline View
Assignment View
    ×
    ×