Web triggered word set boosting for speech interfaces to the world wide web

US 5,819,220 A
Filed: 09/30/1996
Issued: 10/06/1998
Est. Priority Date: 09/30/1996
Status: Expired due to Term

First Claim

Patent Images

1. A method for user speech actuation of access to information stored in a computer system, the method comprising:

storing the information in a memory of the computer system;

displaying a selected subset of the information via the computer on a display viewable by the user;

forming a web triggered word set from a portion of the information stored in the computer, including a plurality of individual words contained in the displayed subset;

receiving a speech input comprising one or more spoken words from the user;

performing a speech recognition process based on the speech input from the user to determine statistically a set of probable words based on the speech input of the user, including;

processing the input speech in accordance with a set of language/acoustic model and speech recognition search parameters which produce probability scores for at least individual words;

modifying the language/acoustic model and speech recognition search parameters dynamically using the web triggered word set to boost the probability score of at least individual ones of the probable words; and

updating the display to display a new subset of the information in accordance with the set of probable words determined from the speech recognition search as modified using the web-triggered word set.

View all claims

3 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A computer system for user speech actuation of access to stored information, the system including a central processing unit, a memory and a user input/output interface including a microphone for input of user speech utterances and audible sound signal processing circuitry, and a file system for accessing and storing information in the memory of the computer. A speech recognition processor operating on the computer system recognizes words based on the input speech utterances of the user in accordance with a set of language/acoustic model and speech recognition search parameters. Software running on the CPU scans a document accessed by a web browser to form a web triggered word set from a selected subset of information in the document. The language/acoustic model and speech recognition search parameters are modified dynamically using the web triggered word set, and used by the speech recognition processor for generating a word string for input to the browser to initiate a change in the information accessed.

345 Citations

12 Claims

1. A method for user speech actuation of access to information stored in a computer system, the method comprising:
- storing the information in a memory of the computer system;
  
  displaying a selected subset of the information via the computer on a display viewable by the user;
  
  forming a web triggered word set from a portion of the information stored in the computer, including a plurality of individual words contained in the displayed subset;
  
  receiving a speech input comprising one or more spoken words from the user;
  
  performing a speech recognition process based on the speech input from the user to determine statistically a set of probable words based on the speech input of the user, including;
  
  processing the input speech in accordance with a set of language/acoustic model and speech recognition search parameters which produce probability scores for at least individual words;
  
  modifying the language/acoustic model and speech recognition search parameters dynamically using the web triggered word set to boost the probability score of at least individual ones of the probable words; and
  
  updating the display to display a new subset of the information in accordance with the set of probable words determined from the speech recognition search as modified using the web-triggered word set.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
- - 2. A method according to claim 1 in which the computer system includes a local computer operable by the user, a remote computer at which the information is stored and a network link for coupling the local and remote computers, the displaying step including accessing the information from the local computer via a web browser.
  - 3. A method according to claim 1 in which the information is stored in a hypertext markup language (HTML) source document and the step of forming a web triggered word set includes forming a set of words that includes a source word set including individual words extracted from the (HTML) source document.
  - 4. A method according to claim 1 in which the step of forming a web triggered word set includes storing in a short term cache one or more previously-formed web triggered word sets for inclusion in a current web triggered word set.
  - 5. A method according to claim 1 in which the step of forming a web triggered word set includes modifying the web triggered word set responsive to updating to display a new subset of the information.
  - 6. A method according to claim 1 in which the step of forming a web triggered word set includes forming a set of words that includes words selected from at least one of a displayed subset of the information and a basic word set of command and function words chosen a priori.
  - 7. A method according to claim 1 in which the step of performing a speech recognition process which includes the processing and modifying steps includes estimating a fit between an acoustic observation X of the speech input word sequence and a possible word sequence W according to a Bayes-type evaluation functionF(S_LM (W|H), S_AC (X|W,H)), whereS_LM is a language model score,S_Ac is an acoustic model score, andH is the web triggered word set.
  - 8. A method according to claim 1 in which the step of performing a speech recognition process which includes the processing and modifying steps includes estimating a fit between an acoustic observation X of the speech input word sequence and a possible word sequence W according to an altered Bayes-like scoring functionPr(X|W)×
    - Pr(W)^Omega(W,H)using a special set of Omega(W,H) values for words W belonging to a predicted/web-triggered set H, so as to improve the scores of such words.

9. A computer system for user speech actuation of access to stored information, the system comprising:
- a computer including a central processing unit, a memory and a user input/output interface including a microphone for input of user speech utterances and audible sound signal processing circuitry;
  
  means for accessing and storing information in the memory of the computer system;
  
  a speech recognition processor operating on the computer system for recognizing words including individual words based on the input speech utterances of the user in accordance with a set of language/acoustic model and speech recognition search parameters which produce a probability score for at least the individual words;
  
  means for forming a web triggered word set from a selected subset of information in the document, the web triggered word set including a plurality of individual words contained in the displayed subset;
  
  means for modifying the language/acoustic model and speech recognition search parameters dynamically using the web triggered word set to boost the probability score of at least the individual words; and
  
  means responsive to the speech recognition processor for generating a word string based on the probability score as boosted by the modifying means for input to the accessing and storing means to initiate a change in the information accessed.
- View Dependent Claims (10, 11, 12)
- - 10. A system according to claim 9 including:
    - means for displaying a first portion of the selected subset of the information via the computer on a display viewable by the user, so that the user can formulate speech utterances based on the displayed portion of the information; and
      
      means for updating the display to show a second portion of the selected subset of the information in accordance with the word string determined from the speech recognition search.
  - 11. A system according to claim 9 in which the information stored in the memory comprises a first document and the accessing and storing means includes means for directing the system to access a different document responsive to the word string.
  - 12. A method according to claim 9 in which the computer system includes a local computer operable by the user, a remote computer at which the information is stored and a network link for coupling the local and remote computers, the displaying means including means for accessing the information from the local computer via a web browser.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Hewlett-Packard Development Company, L.P. (HP Inc.)
Original Assignee
Hewlett-Packard Company (HP Inc.)
Inventors
Sarukkai, Sekhar, Sarukkai, Ramesh
Primary Examiner(s)
Hudspeth, David R.
Assistant Examiner(s)
ABEBE, DANIEL DEMELASH

Application Number

US08/722,691
Time in Patent Office

736 Days
Field of Search

395/762, 704/270, 704/240, 704/243
US Class Current

704/270.1
CPC Class Codes

G10L 15/19   Grammatical context, e.g. d...

G10L 2015/0631   Creating reference template...

G10L 2015/228   of application context

H04M 3/493   Interactive information ser...

H04M 3/4938   comprising a voice browser ...

H04M 7/006   Networks other than PSTN/IS...

Web triggered word set boosting for speech interfaces to the world wide web

First Claim

3 Assignments

0 Petitions

Accused Products

Abstract

345 Citations

12 Claims

Specification

Solutions

Use Cases

Quick Links

Web triggered word set boosting for speech interfaces to the world wide web

First Claim

3 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

345 Citations

12 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links