×

Word association method and apparatus

  • US 7,711,547 B2
  • Filed: 10/29/2002
  • Issued: 05/04/2010
  • Est. Priority Date: 03/16/2001
  • Status: Expired due to Fees
First Claim
Patent Images

1. A method for associating words and word strings in a language comprising:

  • providing a collection of documents, wherein said collection includes at least one document;

    receiving from a user a word or word string query to be analyzed;

    searching, by a processor, said collection of documents for the query to be analyzed and returning documents containing the query to be analyzed;

    determining a user-defined amount of words or word strings or both to the left of said query to be analyzed in said returned documents based on their frequency and creating a Left Signature List comprising each of said words and word strings to the left of said query to be analyzed in said returned documents;

    searching said collection of documents for the words and word strings on the Left Signature List and returning documents containing said words or word strings on the Left Signature List;

    determining a user-defined amount of words or word strings or both to the right of each of said words and word strings comprising said Left Signature List and creating a Left Anchor List comprising each of said words and word strings to the right of each of said words and word strings on the Left Signature List based on their frequency in a collection of documents;

    determining a user-defined number of words or word strings or both to the right of said query to be analyzed in said returned documents and creating a Right Signature List comprising each of said words and word strings to the right of said query to be analyzed in said returned documents based on their frequency;

    searching said collection of documents for each of said words and word strings on the Right Signature List and returning documents containing said words and word strings on the Right Signature List;

    determining a user-defined number of words or word strings or both to the left of each of said words and word strings comprising said Right Signature List and creating a Right Anchor List comprising each of said words and word strings to the left of each of said words and word strings on the Right Signature List based on their frequency; and

    ranking results based on the number of different Anchor Lists on which the result appears.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×