NAMED ENTITY TRANSLITERATION USING CORPORATE CORPRA
First Claim
1. A method of mining multilingual named entity transliteration comprising:
- Reviewing a document in a first language;
Reviewing an additional document in a second language;
Determining if the additional document is sufficiently similar to the document;
If the additional document is sufficiently similar to the document;
Selecting a named entity in the document;
Searching for a sufficiently similar named entity comprising comparing the named entity to a word in the additional document;
If a sufficiently similar word to the named entity is located, storing the named entity and the similar word as named entity transliterations.
2 Assignments
0 Petitions
Accused Products
Abstract
A document in a first language and an additional document in a second language may be reviewed. It may be determined if the additional document is sufficiently similar to the document. If the additional document is determined sufficiently similar to the document, a named entity in the document may be selected. The method may search for a similar named entity by comparing the named entity to a word in the additional document and determining if the named entity and word are sufficiently similar. If a similar word to the named entity is located, the named entity and the similar named entities may be stored as name entity transliterations.
32 Citations
20 Claims
-
1. A method of mining multilingual named entity transliteration comprising:
-
Reviewing a document in a first language; Reviewing an additional document in a second language; Determining if the additional document is sufficiently similar to the document; If the additional document is sufficiently similar to the document; Selecting a named entity in the document; Searching for a sufficiently similar named entity comprising comparing the named entity to a word in the additional document; If a sufficiently similar word to the named entity is located, storing the named entity and the similar word as named entity transliterations. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A computer storage medium comprising computer executable instructions for mining multilingual named entity transliteration, the computer executable instructions comprising instructions for:
-
Reviewing a document in a first language; Reviewing an additional document in a second language; Determining if the additional document is sufficiently similar to the document by calculating a cross language similarity score; If the additional document is sufficiently similar to the document; Selecting a named entity in the document; Searching for a sufficiently similar named entity comprising comparing the named entity to a word in the additional document; If a sufficiently similar word to the named entity is located, storing the named entity and the similar word as named entity transliterations. - View Dependent Claims (12, 13, 14, 15, 16, 17)
-
-
18. A computer system comprising a processor for executing computer executable instructions for mining multilingual named entity transliteration, a memory in communication with the processor and an input output circuit, the computer executable instructions comprising instructions for:
-
Reviewing a document in a first language; Reviewing an additional document in a second language; Determining if the additional document is sufficiently similar to the document by calculating a cross language similarity score wherein the cross language similarity score is calculated using a Kullback-Leibler divergence; If the additional document is sufficiently similar to the document; Selecting a named entity in the document; Searching for a sufficiently similar named entity comprising comparing the named entity to a word in the additional document; If a sufficiently similar word to the named entity is located, storing the named entity and the similar word as named entity transliterations. - View Dependent Claims (19, 20)
-
Specification