Terminal device, server device and speech recognition method

US 20030050783A1
Filed: 09/12/2002
Published: 03/13/2003
Est. Priority Date: 09/13/2001
Status: Abandoned Application

First Claim

Patent Images

1. A terminal device, comprising:

a transmitting means for transmitting a voice produced by a user and environmental noises to a server device;

a receiving means for receiving from the server device an acoustic model adapted to the voice of the user and the environmental noises;

a first storage means for storing the acoustic model received by the receiving means; and

a speech recognition means for conducting speech recognition using the acoustic model stored in the first storage means.

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

Voice of a user having noises added thereto (noise-added voice) is input by a terminal device and transmitted to a server device. A plurality of acoustic models are stored in advance in a data storage section of the server device. An adapted-model selecting section of the server device selects an acoustic model which is the best adapted to the noise-added voice received by a receiving section from the acoustic models stored in the data storage section. A transmitting section transmits the selected adapted model to the terminal device. A receiving section of the terminal device receives the adapted model from the server device. The received adapted model is stored in a memory. A speech recognition section conducts speech recognition using the adapted model stored in the memory.

104 Citations

View as Search Results

23 Claims

1. A terminal device, comprising:
- a transmitting means for transmitting a voice produced by a user and environmental noises to a server device;
  
  a receiving means for receiving from the server device an acoustic model adapted to the voice of the user and the environmental noises;
  
  a first storage means for storing the acoustic model received by the receiving means; and
  
  a speech recognition means for conducting speech recognition using the acoustic model stored in the first storage means.
- View Dependent Claims (2, 3, 4, 5, 6)
- - 2. The terminal device according to claim 1, wherein the receiving means further receives an acoustic model which will be used by the user in future from the server device.
  - 3. The terminal device according to claim 1, further comprising:
    - a determining means for comparing similarity between the voice of the user having the environmental noises added thereto and an acoustic model which has already been stored in the first storage means with a predetermined threshold value, wherein if the similarity is smaller than the threshold value, the transmitting means transmits the voice of the user and the environmental noises to the server device.
  - 4. The terminal device according to claim 3, wherein if the similarity is smaller than the threshold value, the determining means prompts the user to determine whether an acoustic model is to be obtained or not, and if the user determines that an acoustic model is to be obtained, the transmitting means transmits the voice of the user and the environmental noises to the server device.
  - 5. The terminal device according to claim 1, further comprising:
    - a second storage means for storing a voice produced by a user, wherein if environmental noises are obtained, the transmitting means transmits the environmental noises and the voice of the user stored in the second storage means to the server device.
  - 6. The terminal device according to claim 1, wherein the terminal device prompts the user to select a desired environment from various environments, and plays back a characteristic sound of the selected environment.

7. A terminal device, comprising:
- a transmitting means for transmitting a voice produced by a user and environmental noises to a server device;
  
  a receiving means for receiving from the server device acoustic-model producing data for producing an acoustic model adapted to the voice of the user and the environmental noises;
  
  a first storage means for storing the acoustic-model producing data received by the receiving means;
  
  a producing means for producing the acoustic model adapted to the voice of the user and the environmental noises by using the acoustic-model producing data stored in the first storage means; and
  
  a speech recognition means for conducting speech recognition using the acoustic model produced by the producing means.
- View Dependent Claims (8, 9)
- - 8. The terminal device according to claim 7, wherein the receiving means further receives acoustic-model producing data which will be used by the user in future from the server device.
  - 9. The terminal device according to claim 7, wherein the terminal device prompts the user to select a desired environment from various environments, and plays back a characteristic sound of the selected environment.

10. A server device, comprising:
- a storage means for storing a plurality of acoustic models each adapted to a corresponding speaker and a corresponding environment;
  
  a receiving means for receiving from a terminal device a voice produced by a user and environmental noises;
  
  a selecting means for selecting from the storage means an acoustic model which is adapted to the voice of the user and the environmental noises received by the receiving means; and
  
  a transmitting means for transmitting the acoustic model selected by the selecting means to the terminal device.
- View Dependent Claims (11, 12, 13)
- - 11. The server device according to claim 10, wherein the selecting means selects an acoustic model which will be used by a user of the terminal device in future from the storage means.
  - 12. The server device according to claim 10, wherein each of the plurality of acoustic models stored in the storage means is adapted also to a tone of voice of a corresponding speaker.
  - 13. The server device according to claim 10, wherein each of the plurality of acoustic models stored in the storage means is adapted also to characteristics of an inputting means for obtaining a voice produced by a speaker in order to produce the acoustic model.

14. A server device, comprising:
- a storage means for storing a plurality of acoustic models each adapted to a corresponding speaker and a corresponding environment;
  
  a receiving means for receiving from a terminal device a voice produced by a user and environmental noises;
  
  a producing means for producing an acoustic model adapted to the voice of the user and the environmental noises, based on the voice of the user and the environmental noises received by the receiving means and the plurality of acoustic models stored in the storage means; and
  
  a transmitting means for transmitting the acoustic model produced by the producing means to the terminal device.
- View Dependent Claims (15, 16, 17)
- - 15. The server device according to claim 14, wherein the producing means produces an acoustic model which will be used by a user of the terminal device in future.
  - 16. The server device according to claim 14, wherein each of the plurality of acoustic models stored in the storage means is adapted also to a tone of voice of a corresponding speaker.
  - 17. The server device according to claim 14, wherein each of the plurality of acoustic models stored in the storage means is adapted also to characteristics of an inputting means for obtaining a voice produced by a speaker in order to produce the acoustic model.

18. A server device, comprising:
- a storage means for storing a plurality of acoustic models each adapted to a corresponding speaker and a corresponding environment;
  
  a receiving means for receiving from a terminal device a voice produced by a user and environmental noises;
  
  a selecting means for selecting from the storage means acoustic-model producing data for producing an acoustic model which is adapted to the voice of the user and the environmental noises received by the receiving means; and
  
  a transmitting means for transmitting the acoustic-model producing data selected by the selecting means to the terminal device.
- View Dependent Claims (19, 20, 21)
- - 19. The server device according to claim 18, wherein the selecting means selects acoustic-model producing data which will be used by a user of the terminal device in future from the storage means.
  - 20. The server device according to claim 18, wherein each of the plurality of acoustic models stored in the storage means is adapted also to a tone of voice of a corresponding speaker.
  - 21. The server device according to claim 18, wherein each of the plurality of acoustic models stored in the storage means is adapted also to characteristics of an inputting means for obtaining a voice produced by a speaker in order to produce the acoustic model.

22. A speech recognition method, comprising the steps of:
- preparing a plurality of acoustic models each adapted to a corresponding speaker, a corresponding environment, and a corresponding tone of voice;
  
  obtaining an acoustic model adapted to a voice produced by a user and environmental noises, based on the voice of the user, the environmental noises and the plurality of acoustic models; and
  
  conducting speech recognition using the obtained acoustic model.
- View Dependent Claims (23)
- - 23. The speech recognition method according to claim 22, wherein each of the plurality of acoustic models is adapted also to characteristics of an inputting means for obtaining a voice produced by a speaker in order to produce the acoustic model.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Matsushita Electric Industrial Company Limited (Panasonic Holdings Corporation)
Original Assignee
Matsushita Electric Industrial Company Limited (Panasonic Holdings Corporation)
Inventors
Yoshizawa, Shinichi

Application Number

US10/241,873
Publication Number

US 20030050783A1
Time in Patent Office

Days
Field of Search
US Class Current

704/270.1
CPC Class Codes

G10L 15/065 Adaptation

G10L 15/30 Distributed recognition, e....

Terminal device, server device and speech recognition method

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

104 Citations

23 Claims

Specification

Solutions

Use Cases

Quick Links

Terminal device, server device and speech recognition method

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

104 Citations

23 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links