Speech recognition of partial proper names by natural language processing
First Claim
1. A method of speech recognition for partial proper names comprising:
- receiving, with a processor, a corpus including a plurality of complex proper names;
performing, with the processor, natural language processing to generate a syntactic structure corresponding to a plurality of words in one of the plurality of complex proper names in the corpus;
generating, with the processor, a plurality of candidate partial names corresponding to the one complex proper name using a machine learning process with reference to the syntactic structure corresponding to the one complex proper name and the plurality of words in the one complex proper name, wherein the one complex proper name is divided into categories of syntactic units and each of the plurality of candidate partial names comprises a subset of words from the plurality of words contained within the one complex proper name and the subset of words classified into a specific syntactic category;
selecting, with the processor, only a portion of the plurality of candidate partial names based on at least one syntactic structural identifier of at least one phrase in each candidate partial name to provide a modified list of candidate partial names;
producing, with the processor, a phonetic transcription of each partial name in the modified list of candidate partial names;
incorporating, with the processor, the modified list of candidate partial names based on the phonetic transcription into a model for a speech recognizer to recognize partial names in a speech recognition process; and
storing, with the processor, the model for the speech recognizer in a memory;
receiving, with the processor, speech input from a user comprising a candidate partial name from the modified candidate partial names;
performing, with the processor, speech recognition, using the model, on the received speech; and
identifying, with the processor, the one complex proper name based on the recognized speech and performing an action on a user device based on the identified one complex proper name.
1 Assignment
0 Petitions
Accused Products
Abstract
A method for speech recognition of partial proper names is described which includes natural language processing (NLP), partial name candidate generation, speech recognition and post processing. Natural language processing techniques including shallow and deep parsing are applied to long proper names to identify syntactic units (for example, noun phrases). The syntactic units form a basis for generating a candidate list of partial names for each original full name. A partial name is part of the original name, with some words omitted, or word order changed, or even word substitution. After candidate partial names are generated, their phonetic transcriptions are incorporated into a model for a speech recognizer to recognize the partial names in a speech recognition system.
-
Citations
16 Claims
-
1. A method of speech recognition for partial proper names comprising:
-
receiving, with a processor, a corpus including a plurality of complex proper names; performing, with the processor, natural language processing to generate a syntactic structure corresponding to a plurality of words in one of the plurality of complex proper names in the corpus; generating, with the processor, a plurality of candidate partial names corresponding to the one complex proper name using a machine learning process with reference to the syntactic structure corresponding to the one complex proper name and the plurality of words in the one complex proper name, wherein the one complex proper name is divided into categories of syntactic units and each of the plurality of candidate partial names comprises a subset of words from the plurality of words contained within the one complex proper name and the subset of words classified into a specific syntactic category; selecting, with the processor, only a portion of the plurality of candidate partial names based on at least one syntactic structural identifier of at least one phrase in each candidate partial name to provide a modified list of candidate partial names; producing, with the processor, a phonetic transcription of each partial name in the modified list of candidate partial names; incorporating, with the processor, the modified list of candidate partial names based on the phonetic transcription into a model for a speech recognizer to recognize partial names in a speech recognition process; and storing, with the processor, the model for the speech recognizer in a memory; receiving, with the processor, speech input from a user comprising a candidate partial name from the modified candidate partial names; performing, with the processor, speech recognition, using the model, on the received speech; and identifying, with the processor, the one complex proper name based on the recognized speech and performing an action on a user device based on the identified one complex proper name. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A speech recognition system comprising:
-
a memory configured to store; a corpus including a plurality of complex proper names; and a model for a speech recognizer; and a processor operatively connected to the memory, the processor being configured to; perform natural language processing to generate syntactic structure corresponding to a plurality of words in one of the plurality of complex proper names in the corpus; generate a plurality of candidate partial names corresponding to the one complex proper name using a machine learning process with reference to the syntactic structure corresponding to the one complex proper name and the plurality of words in the one complex proper name, wherein the one complex proper name is divided into categories of syntactic units and each of the plurality of candidate partial names comprises a subset of words from the plurality of words contained within the one complex proper name and the subset of words classified into a specific syntactic category; select only a portion of the plurality of candidate partial names based on at least one syntactic structural identifier of at least one phrase in each candidate partial name to provide a modified list of candidate partial names; incorporate the modified list of candidate partial names based on the phonetic transcription into the model for the speech recognizer to recognize partial names in a speech recognition process; store the model for the speech recognizer in the memory; receive speech input from a user comprising a candidate partial name from the modified candidate partial names; perform speech recognition, using the model, on the received speech; and identify the one complex proper name based on the recognized speech and performing an action on a user device based on the identified one complex proper name. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
Specification