×

Generating speech recognition grammars from a large corpus of data

  • US 7,567,902 B2
  • Filed: 09/18/2002
  • Issued: 07/28/2009
  • Est. Priority Date: 09/18/2002
  • Status: Expired due to Fees
First Claim
Patent Images

1. A method of generating an expandable speech recognition grammar for use with a speech recognition engine comprising:

  • parsing a corpus of data using a processor to generate an annotated corpus of data identifying grammatical structures and grammatical parts of speech within the corpus of data, wherein the corpus of data comprises a plurality of well formed sentences, and wherein the parsing comprises providing for each identified grammatical structure and grammatical part of speech a tag labeling each identified grammatical structure and grammatical part of speech accordingly;

    using the processor to compare the identified grammatical structures and the identified grammatical parts of speech within the annotated corpus of data with grammar generation rules to designate particular ones of the identified grammatical structures and the identified grammatical parts of speech to include within a speech recognition grammar to be generated, wherein the grammar generation rules further designate, independently of a context of the corpus of data and any words already included in the expandable speech grammar, particular grammatical parts of speech and grammatical structures to be included within the expandable speech recognition grammar to be generated; and

    using the processor to include within the expandable speech recognition grammar one or more words associated with the grammatical structures within the annotated corpus of data which have been identified in said parsing step and designated in said comparing step by the grammar generation rules, exclusive of words already included in the expandable speech recognition grammar, wherein the grammar is generated without use of counter-examples associated with the corpus of data.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×