ASSIGNING AN INDEXING WEIGHT TO A SEARCH TERM
First Claim
1. A method for assigning an indexing weight to a search term in a document, the document in a collection of documents, the method comprising:
- calculating a text-based indexing weight for the search term in the document;
calculating a pronunciation prominence for the search term; and
assigning an indexing weight to the search term in the document, the indexing weight based, at least in part, on a mathematical combination of the calculated text-based indexing weight and the calculated pronunciation prominence.
2 Assignments
0 Petitions
Accused Products
Abstract
Disclosed is an indexing weight assigned to a potential search term in a document, the indexing weight is based on both textual and acoustic aspects of the term. In one embodiment, a traditional text-based weight is assigned to a potential search term. This weight can be TF-IDF (“term frequency-inverse document frequency”), TF-DV (“term frequency discrimination value”), or any other text-based weight. Then, a pronunciation prominence weight is calculated for the same term. The text-based weight and the pronunciation prominence weight are mathematically combined into the final indexing weight for that term. When a speech-based search string is entered, the combined indexing weight is used to determine the importance of each search term in each document. Several possibilities for calculating the pronunciation prominence are contemplated. In some embodiments, for pairs of terms in a document, an inter-term pronunciation distance is calculated based on inter-phoneme distances.
25 Citations
22 Claims
-
1. A method for assigning an indexing weight to a search term in a document, the document in a collection of documents, the method comprising:
-
calculating a text-based indexing weight for the search term in the document; calculating a pronunciation prominence for the search term; and assigning an indexing weight to the search term in the document, the indexing weight based, at least in part, on a mathematical combination of the calculated text-based indexing weight and the calculated pronunciation prominence. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A voice-to-text-search indexing server comprising:
-
a memory configured for storing an indexing weight assigned to a search term in a document, the document in a collection of documents; and a processor operatively coupled to the memory and configured for calculating a text-based indexing weight for the search term in the document, for calculating a pronunciation prominence for the search term, and for assigning an indexing weight to the search term in the document, the indexing weight based, at least in part, on a mathematical combination of the calculated text-based indexing weight and the calculated pronunciation prominence. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20, 21, 22)
-
Specification