×

System and method for building diverse language models

  • US 9,727,557 B2
  • Filed: 07/18/2016
  • Issued: 08/08/2017
  • Est. Priority Date: 03/08/2011
  • Status: Active Grant
First Claim
Patent Images

1. A method comprising:

  • establishing a crawling schedule configured to identify, according to a pattern of links, a likelihood of web pages to have information capable of filling vocabulary gaps, and wherein a website visitation policy comprises the crawling schedule according to perplexity of the web pages with respect to a language model;

    crawling, via a processor, the web-pages based on the crawling schedule, to yield new vocabulary words; and

    generating a new language model according to the language model and the new vocabulary words.

View all claims
  • 4 Assignments
Timeline View
Assignment View
    ×
    ×