Disambiguating organization names
First Claim
1. A computer-implemented method of disambiguating organization names, the method comprising:
- selecting multiple names of organizations; and
using the computer to;
extract a first name identified in an electronic content item, wherein multiple different organizations have the first name;
for each of the multiple organizations, assemble a corresponding feature vector comprising;
a set of name features reflecting use of the first name in the electronic content item; and
a set of affiliation features reflecting one or more affiliations of the organization also identified in the electronic content item;
execute a model to identify, among the multiple organizations, only one organization to which the electronic content item refers; and
make the electronic content item available to a user interested in content items that reference the one organization.
2 Assignments
0 Petitions
Accused Products
Abstract
A system, method, and apparatus are provided for disambiguating organization names. Selected names that are shared among multiple organizations may or may not be categorized or characterized (e.g., by industry, by size, by reach). As content items are received (e.g., news stories, magazine articles, social media content), occurrences of the selected names are identified. Each item that includes at least one name is processed to determine which of the multiple entities that have the name (if any) is the organization referenced or mentioned in the item. The same model may be applied to disambiguate all names or, depending on the name'"'"'s categorization, different models or procedures may be applied to disambiguate the name.
24 Citations
20 Claims
-
1. A computer-implemented method of disambiguating organization names, the method comprising:
-
selecting multiple names of organizations; and using the computer to; extract a first name identified in an electronic content item, wherein multiple different organizations have the first name; for each of the multiple organizations, assemble a corresponding feature vector comprising; a set of name features reflecting use of the first name in the electronic content item; and a set of affiliation features reflecting one or more affiliations of the organization also identified in the electronic content item; execute a model to identify, among the multiple organizations, only one organization to which the electronic content item refers; and make the electronic content item available to a user interested in content items that reference the one organization. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. An apparatus for disambiguating organization names, the apparatus comprising:
-
one or more processors; and logic comprising instructions that, when executed by the one or more processors, cause the apparatus to; select multiple names of organizations; extract a first name identified in an electronic content item, wherein multiple different organizations have the first name; for each of the multiple organizations, assemble a corresponding feature vector comprising; a set of name features reflecting use of the first name in the electronic content item; and a set of affiliation features reflecting one or more affiliations of the organization also identified in the electronic content item; execute a model to identify, among the multiple organizations, only one organization to which the electronic content item refers; and make the electronic content item available to a user interested in content items that reference the one organization. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A system for disambiguating organization names, comprising:
-
a physical storage device storing multiple names of organizations; one or more processors; logic comprising instructions that, when executed by the one or more processors, cause the system to; select multiple names of organizations; extract a first name identified in an electronic content item, wherein multiple different organizations have the first name; for each of the multiple organizations, assemble a corresponding feature vector comprising; a set of name features reflecting use of the first name in the electronic content item; and a set of affiliation features reflecting one or more affiliations of the organization also identified in the electronic content item; execute a model to identify, among the multiple organizations, only one organization to which the electronic content item refers; and make the electronic content item available to a user interested in content items that reference the one organization. - View Dependent Claims (20)
-
Specification