Method and system for generating named entities
First Claim
1. A method of generating named entities, the method comprising:
- extracting, by a named entity generation engine, a plurality of named entities in a primary language from a plurality of digital content in the primary language;
transliterating, by the named entity generation engine, each of the plurality of named entities in the primary language to a set of possible named entities in a secondary language;
determining, by the named entity generation engine, a correct named entity in the secondary language from among the set of possible named entities in the secondary language; and
generating, by the named entity generation engine, using the correct named entity in the secondary language, a named entity in a subsequent secondary language corresponding to the correct named entity in the secondary language, wherein the generating comprises;
obtaining, from a multi-lingual character level tree model included in the named entity, generation engine, an index equivalent to an index of the correct named entity in the secondary language, andobtaining best sequence of symbols in a plurality of subsequent secondary languages including the subsequent secondary language;
wherein the plurality of named entities in the primary language are named entities in the subsequent secondary language, and wherein the subsequent secondary language is related to the secondary language.
1 Assignment
0 Petitions
Accused Products
Abstract
This disclosure relates generally to natural language processing, and more particularly to system and method for generating named entities. In one embodiment, a method is provided for generating named entities. The method includes extracting a plurality of named entities in a primary language from a plurality of digital content in the primary language, transliterating each of the plurality of named entities in the primary language to a set of possible named entities in a secondary language, determining a correct named entity in the secondary language from among the set of possible named entities in the secondary language, and generating a named entity in a subsequent secondary language corresponding to the correct named entity in the secondary language. It should be noted that the plurality of named entities in the primary language are named entities in the subsequent secondary language, and the subsequent secondary language is related to the secondary language.
19 Citations
21 Claims
-
1. A method of generating named entities, the method comprising:
-
extracting, by a named entity generation engine, a plurality of named entities in a primary language from a plurality of digital content in the primary language; transliterating, by the named entity generation engine, each of the plurality of named entities in the primary language to a set of possible named entities in a secondary language; determining, by the named entity generation engine, a correct named entity in the secondary language from among the set of possible named entities in the secondary language; and generating, by the named entity generation engine, using the correct named entity in the secondary language, a named entity in a subsequent secondary language corresponding to the correct named entity in the secondary language, wherein the generating comprises; obtaining, from a multi-lingual character level tree model included in the named entity, generation engine, an index equivalent to an index of the correct named entity in the secondary language, and obtaining best sequence of symbols in a plurality of subsequent secondary languages including the subsequent secondary language; wherein the plurality of named entities in the primary language are named entities in the subsequent secondary language, and wherein the subsequent secondary language is related to the secondary language. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A system for generating named entities, the system comprising:
-
at least one processor; and a computer-readable medium storing instructions that, when executed by the at least one processor, cause the at least one processor to perform operations comprising; extracting a plurality of named entities in a primary language from a plurality of digital content in the primary language; transliterating each of the plurality of named entities in the primary language to a set of possible named entities in a secondary language; determining a correct named entity in the secondary language from among the set of possible named entities in the secondary language; and generating, using the correct named entity in the secondary language, a named entity in a subsequent secondary language corresponding to the correct named entity in the secondary language, wherein the generating comprises; obtaining, from a multi-lingual character level tree model included in the named entity generation engine, an index equivalent to an index of the correct named entity in the secondary language, and obtaining a best sequence of symbols in a plurality of subsequent secondary languages including the subsequent secondary language; wherein the plurality of named entities in the primary language are named entities in the subsequent secondary language, and wherein the subsequent secondary language is related to the secondary language. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20)
-
-
21. A non-transitory computer-readable medium storing computer-executable instructions for:
-
extracting a plurality of named entities in a primary language from a plurality of digital content in the primary language; transliterating each of the plurality of named entities in the primary language to a set of possible named entities in a secondary language; determining a correct named entity in the secondary language from among the set of possible named entities in the secondary language; and generating, using the correct named entity in the secondary language, a named entity in a subsequent secondary language corresponding to the correct named entity in the secondary language, wherein the generating comprises; obtaining, from a multi-lingual character level tree model included in the named entity generation engine, an index equivalent to an index of the correct named entity in the secondary language, and obtaining a best sequence of symbols in a plurality of subsequent secondary languages including the subsequent secondary language; wherein the plurality of named entities in the primary language are named entities in the subsequent secondary language, and wherein the subsequent secondary language is related to the secondary language.
-
Specification