Distributed internet based speech recognition system with natural language support

US 7,203,646 B2
Filed: 05/22/2006
Issued: 04/10/2007
Est. Priority Date: 11/12/1999
Status: Expired due to Term

- Alert
- Pin

First Claim

Patent Images

1. A system for enabling a browser program to interact with a website using speech utterances, the system comprising:

a speech recognition engine configured to generate a recognized speech query from an utterance;

said speech recognition engine being further configurable such that speech processing operations can be distributed between a client device and a server device as required to achieve real-time recognition of a speech query; and

a natural language engine configured to determine a meaning of said recognized speech query and provide a first response thereto;

a web page routine for presenting one or more web pages to the browser program, wherein data content for said one or more web pages is controlled by said recognized speech query and/or said first response of said natural language engine;

wherein said recognized speech queries can be presented to both said natural language engine as well as to a text based query database for identifying a meaning of said recognized speech query, such that a second response can be provided by said database for at least some recognized speech queries prior to said first response of said natural language engine.

View all claims

2 Assignments

Timeline View

Assignment View

Litigations

0 Petitions

Accused Products

Abstract

A speech-enabled internet based computing system includes a configurable speech recognition engine used for interacting with content on a web accessible page. The speech recognition engine is distributed across a client and server architecture, and is adaptive so that speech processing operations can be allocated as needed between the two. This allows for support for client devices having differing computing capabilities. Natural language operations can also be supported as desired. A user can thus interact with a web page and select items of interest using speech as a mode of input. Dynamic grammars can assist in the recognition operations to improve speed and comprehension.

464 Citations

10 Claims

1. A system for enabling a browser program to interact with a website using speech utterances, the system comprising:
- a speech recognition engine configured to generate a recognized speech query from an utterance;
  
  said speech recognition engine being further configurable such that speech processing operations can be distributed between a client device and a server device as required to achieve real-time recognition of a speech query; and
  
  a natural language engine configured to determine a meaning of said recognized speech query and provide a first response thereto;
  
  a web page routine for presenting one or more web pages to the browser program, wherein data content for said one or more web pages is controlled by said recognized speech query and/or said first response of said natural language engine;
  
  wherein said recognized speech queries can be presented to both said natural language engine as well as to a text based query database for identifying a meaning of said recognized speech query, such that a second response can be provided by said database for at least some recognized speech queries prior to said first response of said natural language engine.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
- - 2. The system of claim 1, wherein said speech query is recognized by forming a concatenation of words and/or phrases derived from said speech query and using said concatenation as a search query for a database.
  - 3. The system of claim 1 wherein said speech recognition engine is also configured to dynamically change a speech recognition grammar based on input provided by a user to selections available within said web page.
  - 4. The system of claim 1 wherein multiple speech grammars are available and selectable within the web page, and such that speech input provided by the user for an item within the web page using a first grammar dynamically controls which one of a plurality of second grammars is loaded for speech recognition of subsequent speech input by the user.
  - 5. The system of claim 1 further including an electronic conversational agent adapted to interact with a user and mimic behavior of a human agent through a native language interactive real-time dialog session with the user.
  - 6. The system of claim 5, wherein said electronic conversational agent is configured to articulate suggestions to the user for appropriate speech queries.
  - 7. The system of claim 5, wherein said electronic conversational agent is adapted to have configurable perception parameters which are adjusted and tailored to said content pertaining to said list of items.
  - 8. The system of claim 5, wherein said server device causes said interactive character agent to respond in real-time whenever the user provides selected speech input.
  - 9. The system of claim 5, wherein the server device transfers speech related data for the web page using a hypertext transfer protocol (HTTP) and using a format which includes a predetermined NULL character.
  - 10. The system of claim 1, wherein the user can speak a help command while interacting with any web page maintained by the server device to cause an interactive character agent to appear.

Specification

Resources

Litigation Campaign Assessment

Litigation Data

Current Assignee
Nuance Communications, Inc. (Microsoft Corporation)
Original Assignee
Phoenix Solutions Incorporated (CDC Corporation)
Inventors
Bennett, Ian M.
Primary Examiner(s)
Lerner, Martin

Application Number

US11/419,736
Publication Number

US 20060200353A1
Time in Patent Office

323 Days
Field of Search

704/251, 704/252, 704/255, 704/257, 704/270, 704/270.1, 704/275, 707/3, 707/4, 707/5
US Class Current

704/257
CPC Class Codes

G06F 16/951   Indexing; Web crawling tech...

G06F 40/289   Phrasal analysis, e.g. fini...

G10L 15/19   Grammatical context, e.g. d...

G10L 15/30   Distributed recognition, e....

Y10S 707/99933   Query processing, i.e. sear...

Y10S 707/99935   Query augmenting and refini...

Distributed internet based speech recognition system with natural language support

First Claim

2 Assignments

Litigations

0 Petitions

Accused Products

Abstract

464 Citations

10 Claims

Specification

Solutions

Use Cases

Quick Links

Distributed internet based speech recognition system with natural language support

First Claim

2 Assignments

Subscription Required

Subscription Required

Litigations

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

464 Citations

10 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links