System and method for natural language processing

US 10,395,647 B2
Filed: 10/26/2017
Issued: 08/27/2019
Est. Priority Date: 10/26/2017
Status: Active Grant

First Claim

Patent Images

1. A system for improving accuracy of natural language processing, the system comprising:

a natural language input device;

a plurality of speech recognition engines for automatic speech recognition functions only, the plurality of speech recognition engines being connected to the input device, the plurality of speech recognition engines receive an input from the input device and presents a speech recognition result as part of a set of speech recognition results;

a data fusion model to receive the set of speech recognition results and to identify a correct result from the set of speech recognition results, the correct result being identified as a result in the set of speech recognition results that has the highest probability of being a correct result from the plurality of speech recognition engines;

the data fusion model to receive all of the speech recognition results and to determine a correct result from speech recognition results, the determined correct result being selected from a result in the set of speech recognition results that has a low probability of being a correct result and is manually determined to be a normal expression of the input from the input device;

a semantic understanding model, separate and distinct from the plurality of speech recognition engines, to process the identified correct result and the determined correct result;

a corpora created from the processed correct result;

a corpus arranged from a plurality of the corpora; and

the data fusion model being updated by the corpus.

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A system and method for improving accuracy of natural language processing using a plurality of speech recognition engines, a data fusion model to identify a correct result from the plurality of speech recognition engines and a semantic understanding model, separate and distinct from the speech recognition model, to process the correct results. A corpus is developed using the correct results and the corpus is used to train the data fusion model and the semantic understanding model.

Citations

6 Claims

1. A system for improving accuracy of natural language processing, the system comprising:
- a natural language input device;
  
  a plurality of speech recognition engines for automatic speech recognition functions only, the plurality of speech recognition engines being connected to the input device, the plurality of speech recognition engines receive an input from the input device and presents a speech recognition result as part of a set of speech recognition results;
  
  a data fusion model to receive the set of speech recognition results and to identify a correct result from the set of speech recognition results, the correct result being identified as a result in the set of speech recognition results that has the highest probability of being a correct result from the plurality of speech recognition engines;
  
  the data fusion model to receive all of the speech recognition results and to determine a correct result from speech recognition results, the determined correct result being selected from a result in the set of speech recognition results that has a low probability of being a correct result and is manually determined to be a normal expression of the input from the input device;
  
  a semantic understanding model, separate and distinct from the plurality of speech recognition engines, to process the identified correct result and the determined correct result;
  
  a corpora created from the processed correct result;
  
  a corpus arranged from a plurality of the corpora; and
  
  the data fusion model being updated by the corpus.
- View Dependent Claims (2)
- - 2. The system as claimed in claim 1 wherein the data fusion model to identify a correct result from the set of speech recognition results further comprises the correct result being identified as each of the results in the set of speech recognition results being the same.

3. A method for natural language processing in a system having a natural language input device, a plurality of speech recognition engines, a data fusion model and a semantic understanding model, the method carried out in a processor having computer executable instructions for performing the steps of:
- receiving, at the natural language input device, an input sentence;
  
  processing the input sentence at the plurality of speech recognition engines, each of the plurality of speech recognition engines producing a result that is part of a set of results for all of the speech recognition engines;
  
  recording all of the results from the plurality of speech recognition engines to develop a corpora;
  
  applying the data fusion model to identify a correct result from the set of results, the correct result being identified as a result in the set of speech recognition results that has the highest probability of being a correct result;
  
  applying the data fusion model to determine a correct result from all of the results, the correct result being determined from one or more results from the set of results for the input sentence that has a low probability of being a correct result, determining manually that the input sentence is a normal expression, and adding the input sentence to the developed corpora;
  
  processing the identified correct result and the determined correct result in the semantic understanding model;
  
  collecting the processed correct results in a corpus; and
  
  updating the data fusion model using the corpus.
- View Dependent Claims (4)
- - 4. The method as claimed in claim 3 wherein the step of applying the data fusion model to identify a correct result further comprises the correct result being identified by each of the results in the set of speech recognition results being the same.

5. A non-transitory computer readable storage medium comprising a program, which, when executed by one or more processors, performs an operation comprising:
- processing an input sentence received by an input device using a plurality of speech recognition engines;
  
  recording all of the results from the plurality of speech recognition engines to develop a corpora;
  
  producing a set of results that includes all results for each speech recognition engine in the plurality of speech recognition engines;
  
  applying a data fusion model to the set of results to identify a correct result from the set of results;
  
  applying the data fusion model to all of the results to determine a correct results from all of the results;
  
  processing the identified correct result in the semantic understanding model, the identified correct result being identified as a result in the set of speech recognition results that has the highest probability of being a correct result;
  
  processing the determined correct result in the semantic understanding model, the determined correct result being determined from a result in the set of speech recognition results that has a low probability of being a correct result, determining manually that the input sentence is a normal expression, and adding the input sentence to the developed corpora; and
  
  updating the data fusion model using the processed correct results.
- View Dependent Claims (6)
- - 6. The computer readable medium as claimed in claim 5 wherein the program performs an operation of applying a data fusion model to the set of results to identify a correct result from the set of results further comprises the correct result being identified when all of the results in the set of results are the same.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Harman International Industries Incorporated (Samsung Electronics Co. Ltd.)
Original Assignee
Harman International Industries Incorporated (Samsung Electronics Co. Ltd.)
Inventors
Qi, Lianjun, Ma, Jianjun
Primary Examiner(s)
Sharma, Neeraj

Application Number

US15/794,114
Publication Number

US 20190130895A1
Time in Patent Office

670 Days
Field of Search

None
US Class Current
CPC Class Codes

G06F 40/30   Semantic analysis

G10L 15/00   Speech recognition G10L17/0...

G10L 15/063   Training

G10L 15/1815   Semantic context, e.g. disa...

G10L 15/197   Probabilistic grammars, e.g...

G10L 15/22   Procedures used during a sp...

G10L 15/26   Speech to text systems G10L...

G10L 15/32   Multiple recognisers used i...

System and method for natural language processing

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

Citations

6 Claims

Specification

Solutions

Use Cases

Quick Links

System and method for natural language processing

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

6 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links