×

System and method for extraction of factoids from textual repositories

  • US 8,706,730 B2
  • Filed: 12/29/2005
  • Issued: 04/22/2014
  • Est. Priority Date: 12/29/2005
  • Status: Expired due to Fees
First Claim
Patent Images

1. A method of extracting factoids associated with a given factoid category of a plurality of categories from text repositories, said method comprising the steps of:

  • training a classifier to recognise factoids relevant to said given factoid category;

    collecting, by a processor within a computer, documents or document summaries relevant to said given factoid category from the text repositories and storing the documents or document summaries in an entity store;

    extracting sentences having a predetermined association to said given factoid category from said documents or said document summaries; and

    classifying, in a noisy environment, said sentences using said classifier to extract snippets containing phrases relevant to said given factoid category, said extracted snippets being said factoid associated with said given factoid category, and storing the snippets in a snippet store.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×