Language model selection for speech-to-text conversion
First Claim
Patent Images
1. A method of converting speech to text, comprising:
- generating a language model by analyzing textual content for a first web page to determine a topic of the first web page, determining other pages determined to be directed to the same topic of the first web page, and analyzing content of queries previously submitted to the other pages determined to be directed to the same topic of the first web page, wherein the queries previously submitted to the other pages include queries submitted to respective search capabilities of at least some of the other pages;
receiving, at a computer server system and from an electronic device, sound information from a user of the electronic device, and a context identifier that indicates a context within which the user provided the sound information;
using the context identifier to select the generated language model from among a plurality of language models;
converting speech in the sound information to text using the selected language model; and
providing the text for use by the electronic device.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods, computer program products and systems are described for converting speech to text. Sound information is received at a computer server system from an electronic device, where the sound information is from a user of the electronic device. A context identifier indicates a context within which the user provided the sound information. The context identifier is used to select, from among multiple language models, a language model appropriate for the context. Speech in the sound information is converted to text using the selected language model. The text is provided for use by the electronic device.
233 Citations
21 Claims
-
1. A method of converting speech to text, comprising:
-
generating a language model by analyzing textual content for a first web page to determine a topic of the first web page, determining other pages determined to be directed to the same topic of the first web page, and analyzing content of queries previously submitted to the other pages determined to be directed to the same topic of the first web page, wherein the queries previously submitted to the other pages include queries submitted to respective search capabilities of at least some of the other pages; receiving, at a computer server system and from an electronic device, sound information from a user of the electronic device, and a context identifier that indicates a context within which the user provided the sound information; using the context identifier to select the generated language model from among a plurality of language models; converting speech in the sound information to text using the selected language model; and providing the text for use by the electronic device. - View Dependent Claims (2, 3, 4)
-
-
5. A method of converting speech to text, comprising:
-
generating a plurality of language models by analyzing textual content for a web page to determine a topic of the web page, determining other pages determined to be directed to the same topic of the web page, and analyzing textual content of queries previously submitted by a plurality of users to the other pages determined to be directed to the same topic of the web page, wherein the queries previously submitted by the plurality of users to the other pages include queries submitted to respective search capabilities of at least some of the other pages; receiving, at a computer server system and from an electronic device, sound information spoken by a user of the electronic device, and a context identifier of the web page, wherein the web page was being presented by the electronic device when the sound information was spoken by the user; selecting, using the context identifier and from among the plurality of language models, a language model appropriate for the context identifier; converting speech in the sound information to text using the selected language model; and providing the text for use by the electronic device. - View Dependent Claims (6, 7)
-
-
8. A system comprising:
-
a data processing apparatus; and storage coupled to the data processing apparatus storing code that when executed by the data processing apparatus causes the data processing apparatus to perform operations comprising; generating a language model by analyzing textual content for a first web page to determine a topic of the first web page, determining other pages determined to be directed to the same topic of the first web page, and analyzing content of queries previously submitted to the other pages determined to be directed to the same topic of the first web page, wherein the queries previously submitted to the other pages include queries submitted to respective search capabilities of at least some of the other pages; receiving, at a computer server system and from an electronic device, sound information from a user of the electronic device, and a context identifier that indicates a context within which the user provided the sound information; using the context identifier to select the generated language model from among a plurality of language models; converting speech in the sound information to text using the selected language model; and providing the text for use by the electronic device. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A computer-readable storage device encoded with a computer program product, the computer program product including instructions that, when executed, cause data processing apparatus to perform operations comprising:
-
generating a language model by analyzing textual content for a first web page to determine a topic of the first web page, determining other pages determined to be directed to the same topic of the first web page, and analyzing content of queries previously submitted to the other pages determined to be directed to the same topic of the first web page, wherein the queries previously submitted to the other pages include queries submitted to respective search capabilities of at least some of the other pages; receiving, at a computer server system and from an electronic device, sound information from a user of the electronic device, and a context identifier that indicates a context within which the user provided the sound information; using the context identifier to select the generated language model from among a plurality of language models; converting speech in the sound information to text using the selected language model; and providing the text for use by the electronic device. - View Dependent Claims (16, 17, 18, 19, 20, 21)
-
Specification