×

SOURCE EXPANSION FOR INFORMATION RETRIEVAL AND INFORMATION EXTRACTION

  • US 20120078895A1
  • Filed: 09/24/2010
  • Published: 03/29/2012
  • Est. Priority Date: 09/24/2010
  • Status: Active Grant
First Claim
Patent Images

1. A method for automatically expanding existing data content that is included in a corpus comprising:

  • automatically generating search queries to search for content related to existing data, the queries being generated based on existing data content;

    automatically retrieving content from one or more data repositories;

    automatically extracting units of text from the retrieved content;

    automatically determining a relevance of the extracted units of text and their relatedness to the existing data; and

    automatically selecting new sources of content and including them in the corpus based on the determined relevance.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×