System and method of improving speech recognition using context
First Claim
Patent Images
1. A system, comprising:
- a processor;
a single microphone configured to both record user speech and to record ambient sounds; and
a speech recognition module configured to;
identify that the ambient sounds are of a particular type by comparing the ambient sounds to stored waveforms;
select a dictionary based on the identified particular type of ambient sounds;
identify, as contextual information, terms related to the identified particular type of ambient sounds based on identification of the identified particular type of ambient sounds, the terms being generated as contextual information;
alter, in response to identification of the terms related to the identified particular type of ambient sounds, the dictionary such that the dictionary includes the terms related to the identified particular type of ambient sounds;
assign, in the dictionary, score values to the terms related to the identified particular type of ambient sounds based on identifying that the terms are related to the identified particular type of ambient sounds; and
analyze the user speech by comparing each potential output word or phoneme in the user speech to waveforms stored for the dictionary to attempt to match the potential output word or phoneme to a waveform corresponding to a particular word or phoneme in the dictionary, an analysis varying based on the assigned scores to the terms identified as contextual information.
2 Assignments
0 Petitions
Accused Products
Abstract
A system and method are provided for improving speech recognition accuracy. Contextual information about user speech may be received, and then speech recognition analysis can be performed on the user speech using the contextual information. This allows the system and method to improve accuracy when performing tasks like searching and navigating using speech recognition.
36 Citations
15 Claims
-
1. A system, comprising:
-
a processor; a single microphone configured to both record user speech and to record ambient sounds; and a speech recognition module configured to; identify that the ambient sounds are of a particular type by comparing the ambient sounds to stored waveforms; select a dictionary based on the identified particular type of ambient sounds; identify, as contextual information, terms related to the identified particular type of ambient sounds based on identification of the identified particular type of ambient sounds, the terms being generated as contextual information; alter, in response to identification of the terms related to the identified particular type of ambient sounds, the dictionary such that the dictionary includes the terms related to the identified particular type of ambient sounds; assign, in the dictionary, score values to the terms related to the identified particular type of ambient sounds based on identifying that the terms are related to the identified particular type of ambient sounds; and analyze the user speech by comparing each potential output word or phoneme in the user speech to waveforms stored for the dictionary to attempt to match the potential output word or phoneme to a waveform corresponding to a particular word or phoneme in the dictionary, an analysis varying based on the assigned scores to the terms identified as contextual information. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A method comprising:
-
recording sounds using a single microphone; identifying, using one or more processors, potential output words and phonemes as well as ambient sounds in the sounds recorded by the single microphone; identifying that the ambient sounds are of a particular type by comparing the ambient sounds to stored waveforms; selecting a dictionary based on the identified particular type of ambient sounds; identifying, as contextual information, terms related to the identified particular type of ambient sounds based on identification of the identified particular type of ambient sounds, the terms being generated as contextual information; assigning, in the dictionary, score values to the terms related to the identified particular type of ambient sounds based on identifying that the terms are related to the identified particular type of ambient sounds; and analyzing user speech by comparing each potential output word or phoneme in the user speech to waveforms stored for the dictionary to attempt to match the potential output word or phoneme to a waveform corresponding to a particular word or phoneme in the dictionary, the analyzing varying based on the assigned scores to the terms identified as contextual information. - View Dependent Claims (7, 8, 9, 10, 11, 12)
-
-
13. A non-transitory machine-readable storage medium comprising a set of instructions which, when executed by a processor, causes execution of operations comprising:
-
recording sounds using a single microphone; identifying potential output words and phonemes as well as ambient sounds in the sounds recorded by the single microphone; identifying that the ambient sounds are of a particular type by comparing the ambient sounds to stored waveforms; selecting a dictionary based on the identified particular type of ambient sounds; identifying, as contextual information, terms related to the identified particular type of ambient sounds based on identification of the identified particular type of ambient sounds, the terms being generated as contextual information; assigning, in the dictionary, score values to the terms related to the identified particular type of ambient sounds based on identifying that the terms are related to the identified particular type of ambient sounds; and analyzing the user speech by comparing each potential output word or phoneme in the user speech to waveforms stored for the dictionary to attempt to match the potential output word or phoneme to a waveform corresponding to a particular word or phoneme in the dictionary, the analyzing varying based on the assigned scores to the terms identified as contextual information. - View Dependent Claims (14, 15)
-
Specification