×

Creating a language model for a language processing system

  • US 7,031,908 B1
  • Filed: 06/01/2000
  • Issued: 04/18/2006
  • Est. Priority Date: 06/01/2000
  • Status: Expired due to Fees
First Claim
Patent Images

1. A method for creating a task dependent unified language model for a selected application from a task independent corpus, the task dependent unified language model being for use in a language processing system and having embedded context-free grammar non-terminal tokens in a N-gram model, the method comprising:

  • obtaining a plurality of context-free grammars comprising non-terminal tokens representing semantic or syntactic concepts, each of the context-free grammars having words present in the task independent corpus to form the semantic or syntactic concepts;

    parsing the task independent corpus with the plurality of context-free grammars to identify word occurrences of each of the semantic or syntactic concepts;

    replacing each of the identified word occurrences with corresponding non-terminal tokens;

    building a N-gram model having the non-terminal tokens; and

    obtaining a second plurality of context-free grammars comprising at least some of the same non-terminals representing the same semantic or syntactic concepts, each of the context-free grammars of the second plurality being more appropriate for use in the selected application.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×