Method and apparatus for updating dictionary
First Claim
1. A method for updating a dictionary in a dictionary updating system with an unregistered word not registered in the dictionary, the method comprising:
- extracting a document of interest of a user in each predetermined extraction period from a predetermined server connected to the dictionary updating system through a network, and extracting candidate unregistered words existing in the extracted document according to predetermined unregistered word extraction rules;
extracting unregistered words among the candidate unregistered words and extracting candidate semantic classes of the unregistered word based on information on appearance frequencies of the candidate unregistered words retrieved from the document;
verifying the unregistered word according to a predetermined unregistered word verification method and determining the semantic class of the verified unregistered word with usage examples of the unregistered word obtained through a searching unit; and
updating the dictionary updating system with the verified unregistered word and the semantic class of the verified unregistered word.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and apparatus for updating a dictionary, by which documents of interest of a user are extracted through a network and a dictionary is updated with individual names and coined words extracted from the web documents, are provided. The dictionary updating method includes: extracting a document of interest of a user in each predetermined extraction period from a predetermined server connected to a dictionary updating system through a network, and extracting candidate unregistered words existing in the extracted document according to predetermined unregistered word extraction rules; based on information on appearance frequencies of the candidate unregistered words retrieved from the document, extracting unregistered words among the candidate unregistered words and extracting candidate semantic classes of the unregistered word; with usage examples of the unregistered word obtained through a searching unit, according to a predetermined unregistered word verification method, verifying the unregistered word and determining the semantic class of the verified unregistered word; and updating the dictionary updating system with the verified unregistered word and the semantic class of the verified unregistered word.
60 Citations
35 Claims
-
1. A method for updating a dictionary in a dictionary updating system with an unregistered word not registered in the dictionary, the method comprising:
-
extracting a document of interest of a user in each predetermined extraction period from a predetermined server connected to the dictionary updating system through a network, and extracting candidate unregistered words existing in the extracted document according to predetermined unregistered word extraction rules;
extracting unregistered words among the candidate unregistered words and extracting candidate semantic classes of the unregistered word based on information on appearance frequencies of the candidate unregistered words retrieved from the document;
verifying the unregistered word according to a predetermined unregistered word verification method and determining the semantic class of the verified unregistered word with usage examples of the unregistered word obtained through a searching unit; and
updating the dictionary updating system with the verified unregistered word and the semantic class of the verified unregistered word. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17)
-
-
18. A method for updating a dictionary of a voice recognition system providing a service corresponding to a voice command of a user, with an unregistered word not registered in the dictionary, the method comprising:
-
extracting a document of interest of a user in each predetermined extraction period from a predetermined server connected to the voice recognition system, and retrieving candidate unregistered words existing in the extracted document according to predetermined unregistered word extraction rules;
extracting unregistered words among the candidate unregistered words and extracting candidate semantic classes of the unregistered word based on information on appearance frequencies of the candidate unregistered words retrieved from the document; and
verifying the unregistered word according to a predetermined unregistered word verification method and determining the semantic class of the verified unregistered word with usage examples of the unregistered word obtained through a searching unit;
updating the voice recognition system with the unregistered word and the semantic class of the unregistered word. - View Dependent Claims (19, 20)
-
-
21. An apparatus for updating a dictionary comprising:
-
a document extraction unit to access a server through a network and extracting a document of interest of a user in each predetermined extraction period;
an unregistered word extraction unit to extract candidate unregistered words existing in the extracted document according to predetermined unregistered word extraction rules, and based on appearance frequency information of the candidate unregistered words in the document, to extract unregistered words among the candidate unregistered words;
an unregistered word verification unit to verify the unregistered words with usage examples of the unregistered words extracted through the server, and to determine the semantic classes of the verified unregistered words;
a first memory unit to store the unregistered words and the semantic classes of the unregistered words; and
a registration unit to register the unregistered words and the semantic classes of the unregistered words in a predetermined location of the memory unit. - View Dependent Claims (22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32)
-
-
33. An apparatus for updating a registered word comprising:
-
a first memory unit and a second memory unit to store registered words;
a document extraction unit accessing a server through a network and extracting a document of a field of interest of a user in each predetermined extraction period;
a registered word extraction unit to extract a registered word stored in the first memory unit in the document extracted during the extraction period; and
a control unit to re-register in the second memory unit, a registered word stored in the first memory unit, based on at least one of the user'"'"'s usage frequency of the registered word, the appearance frequency and the changed appearance frequency of the registered word in the document. - View Dependent Claims (34)
-
-
35. An apparatus for updating a dictionary of a voice recognition system providing a service corresponding to a voice command of a user, with an unregistered word not registered in the dictionary, the apparatus comprising:
-
a document extraction unit accessing a server through a network and extracting a document of interest of a user in each predetermined extraction period;
an unregistered word extraction unit extracting candidate unregistered words existing in the extracted document according to predetermined unregistered word extraction rules, and based on appearance frequency information of the candidate unregistered words in the document, extracting unregistered words among the candidate unregistered words;
an unregistered word verification unit verifying the unregistered words with usage examples of the unregistered words extracted through the server, and allocating semantic information of the verified unregistered words;
a memory unit storing the unregistered words and the semantic information of the unregistered words;
a voice recognition control unit controlling a voice recognition model and a natural language processing model in order to reflect an unregistered word stored in the memory unit; and
a registration unit registering the unregistered words and the semantic information of the unregistered words in a predetermined location of the memory unit.
-
Specification