Word clustering for input data
First Claim
Patent Images
1. A computer implementable clustering tool comprising:
- a clustering component (244) configured to receive input data (240) indicative of a plurality of utterances from a speech recognition component (224) and generate word clusters indicative of words co-occurring in utterances in the input data (240).
3 Assignments
0 Petitions
Accused Products
Abstract
A clustering tool to generate word clusters. In embodiments described, the clustering tool includes a clustering component that generates word clusters for words or word combinations in input data. In illustrated embodiments, the word clusters are used to modify or update a grammar for a closed vocabulary speech recognition application.
-
Citations
20 Claims
-
1. A computer implementable clustering tool comprising:
a clustering component (244) configured to receive input data (240) indicative of a plurality of utterances from a speech recognition component (224) and generate word clusters indicative of words co-occurring in utterances in the input data (240). - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
10. A computer implementable clustering tool comprising:
-
a word clustering component (244) configured to receive input data (240) and to generate at least one of word occurrence vectors or word co-occurrence vectors; and
a vector dot product component (294) configured to compute a vector dot product to obtain word clusters (246) in the input data (240) from the word occurrence vectors or the word co-occurrence vectors. - View Dependent Claims (11, 12)
-
-
13. A method comprising the steps of:
-
providing input data (240);
generating word occurrence vectors for words in the input data (240); and
computing a vector dot product between the word occurrence vectors to generate word clusters being indicative of words that co-occur in clusters in the input data (240). - View Dependent Claims (14, 15, 16, 17, 18, 19, 20)
-
Specification