Representation of orthography in a continuous vector space
First Claim
Patent Images
1. A computerized method comprising:
- mapping words in a dictionary and a set of letter patterns that are characteristic of the words into an orthographic space to create orthographic anchors, each orthographic anchor corresponding to a word in the dictionary, wherein the mapping comprises;
creating a matrix of feature vectors from the letter patterns and the words;
decomposing the matrix into row vectors and column vectors; and
selecting vectors from one of the row vectors and column vectors as the orthographic anchors.
2 Assignments
0 Petitions
Accused Products
Abstract
An orthographic anchor for each word in a dictionary is created in an orthographic space by mapping the words and a set of letter patterns characteristic of the words into the orthographic space. In one aspect the orthographic anchors are row or column vectors resulting from a decomposition of a matrix of feature vectors created by the mapping. In another aspect, a pronunciation for an input word is modeled based on a set of candidate phoneme strings that have pronunciations close to the input word in the orthographic space.
-
Citations
36 Claims
-
1. A computerized method comprising:
mapping words in a dictionary and a set of letter patterns that are characteristic of the words into an orthographic space to create orthographic anchors, each orthographic anchor corresponding to a word in the dictionary, wherein the mapping comprises; creating a matrix of feature vectors from the letter patterns and the words; decomposing the matrix into row vectors and column vectors; and selecting vectors from one of the row vectors and column vectors as the orthographic anchors. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
10. A machine-readable medium having instructions to cause a machine to perform a method comprising:
mapping words in a dictionary and a set of letter patterns that are characteristic of the words into an orthographic space to create orthographic anchors, each orthographic anchor corresponding to a word in the dictionary, wherein the mapping comprises; creating a matrix of feature vectors from the letter patterns and the words; decomposing the matrix into row vectors and column vectors; and selecting vectors from one of the row vectors and column vectors as the orthographic anchors. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
19. A system comprising:
-
a processing unit coupled to a memory through a bus; and an orthographic anchor process executed from the memory by the processing unit to cause the processing unit to map words in a dictionary and a set of letter patterns that are characteristic of the words into an orthographic space to create orthographic anchors, each orthographic anchor corresponding to a word in the dictionary, wherein the orthographic anchor process further causes the processing unit, when mapping words, to create a matrix of feature vectors from the letter patterns and the words, decompose the matrix into row vectors and column vectors, and select vectors from one of the row vectors and column vectors as the orthographic anchors. - View Dependent Claims (20, 21, 22, 23, 24, 25, 26, 27)
-
-
28. An apparatus comprising:
-
means for receiving a dictionary; and means for mapping words in the dictionary and a set of letter patterns that are characteristic of the words into an orthographic space to create orthographic anchors, each orthographic anchor corresponding to a word in the dictionary, wherein the means for mapping comprises; means for creating a matrix of feature vectors from the letter patterns and the words; and means for decomposing the matrix into row vectors and column vectors, vectors from one of the row vectors and the column vectors being selected as the orthographic anchors. - View Dependent Claims (29, 30, 31, 32, 33, 34, 35, 36)
-
Specification