System for distinguishing names of organizations in Asian writing systems
First Claim
1. A system for distinguishing a name of an organization in Chinese text, the system comprising:
- a receiving part that receives Chinese text in electronic format;
an identification part that identifies terms in the received Chinese text that belong to one group of terms that are frequently used to form a first term of a name of an organization, and another group of terms that are frequently used to form a last term of a name of an organization in Chinese;
a comparing part that compares a location in the Chinese text of the identified term that belongs to the other group to the identified term that belongs to the one group, and if predetermined conditions are met, determines that the identified term that belongs to the other group forms a name of an organization with the identified term that belongs to the one group; and
an output part that outputs information of at least one of the identified term and the name of the organization.
1 Assignment
0 Petitions
Accused Products
Abstract
A system for distinguishing names of organizations in Chinese text, which includes a computer. The computer has at least an input, an output, a processor, and a memory and storage arrangement. Data is accessible by the processor that includes at least two groups of terms that frequently respectively form the first and last terms of the names of organizations in Chinese. The system includes software, which when performed by the computer causes computer processing including identifying terms in Chinese text that has been input to the computer for terms corresponding to those in the groups in the data; comparing the location in the Chinese text of each identified term from one of the groups to identified terms from the other group, and if predefined conditions are met, determining that the identified term from one group forms the name of an organization with an identified term from the other group.
11 Citations
21 Claims
-
1. A system for distinguishing a name of an organization in Chinese text, the system comprising:
-
a receiving part that receives Chinese text in electronic format; an identification part that identifies terms in the received Chinese text that belong to one group of terms that are frequently used to form a first term of a name of an organization, and another group of terms that are frequently used to form a last term of a name of an organization in Chinese; a comparing part that compares a location in the Chinese text of the identified term that belongs to the other group to the identified term that belongs to the one group, and if predetermined conditions are met, determines that the identified term that belongs to the other group forms a name of an organization with the identified term that belongs to the one group; and an output part that outputs information of at least one of the identified term and the name of the organization. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A process for distinguishing a name of an organization appearing in a phrase or sentence in Chinese, the process comprising:
-
(a) establishing data including at least one group of terms comprising indicators that tend to indicate that a term immediately following an indicator is a first term of a name and of an organization, and another group of terms comprising terms frequently used to form a last term of a name of an organization in Chinese; (b) identifying in the Chinese phrase or sentence terms corresponding to the data; (c) comparing a location in the Chinese text of the identified term that belongs to the other group to the identified term that belongs to the one group, and if predefined conditions are met, determining that the identified term that belongs to the other group forms the name of an organization together with text immediately following, but not including, the identified term that belongs to the one group; (d) outputting information of at least one of the identified term and the name of the organization. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A storage medium readable by a computer, the storage medium storing a program of instructions executable by the computer to perform a function for distinguishing a name of an organization in a phrase or sentence in Chinese input to the computer, the function comprising:
-
receiving Chinese text in electronic format; identifying terms in the received Chinese text that belong to one group of terms that are frequently used to form a first term of a name of an organization, and another group of terms that are frequently used to form a last term of a name of an organization in Chinese; comparing a location in the Chinese text of the identified term that belongs to the other group to the identified term that belongs to the one group; if predetermined conditions are met, determining that the identified term that belongs to the other group forms a name of an organization with the identified term that belongs to the one group; and outputting information of at least one of the identified term and the name of the organization. - View Dependent Claims (16, 17, 18, 19, 20)
-
-
21. A method for distinguishing a name of an organization in a phrase or sentence in Chinese input to a computer, the method comprising:
-
receiving Chinese text in electronic format; identifying terms in the received Chinese text that belong to one group of terms that are frequently used to form a first term of a name of an organization, and another group of terms that are frequently used to form a last term of a name of an organization in Chinese; if predefined conditions are met, determining that the identified term that belongs to the other group forms a name of an organization with the identified term that belongs to the one group; and outputting information of at least one of the identification term and the name of the organization.
-
Specification