Automated document analysis comprising company name recognition
First Claim
1. A method for performing, by at least one processing device, automated document analysis of a document comprising a body of text, the method comprising:
- identifying, by at least two company name recognition components implemented by the at least one processing device and based at least in part on a company identifier list, at least one company name occurrence in the body of text, wherein each of the at least two company name recognition components implements a company name recognition technique different from techniques implemented by others of the at least two company name recognition components;
updating, by the at least one processing device, the identified companies name list based on the at least one company name occurrence to provide an updated company identifier list; and
subsequent to updating the company identifier list, identifying, by the at least two company name recognition components and based on the updated company identifier list, at least one additional company name occurrence in the body of text.
6 Assignments
0 Petitions
Accused Products
Abstract
At least two processing device-implemented company name recognition components, operating upon a body of text in a document, identify at least one company name occurrence in the body of text based at least in part on a company identifier list. The company name recognition techniques implemented by each of the at least two company name recognition components are different from each other. The at least one company name occurrence is used to update the company identifier list. The updated company identifier list is then used by the at least two company name recognition components to identify at least one additional name occurrence in the same body of text. This process of repeatedly identifying occurrences of company names in the body of text and updating the company identifier list is performed until such time that no further company name occurrences are identified in the body of text.
-
Citations
33 Claims
-
1. A method for performing, by at least one processing device, automated document analysis of a document comprising a body of text, the method comprising:
-
identifying, by at least two company name recognition components implemented by the at least one processing device and based at least in part on a company identifier list, at least one company name occurrence in the body of text, wherein each of the at least two company name recognition components implements a company name recognition technique different from techniques implemented by others of the at least two company name recognition components; updating, by the at least one processing device, the identified companies name list based on the at least one company name occurrence to provide an updated company identifier list; and subsequent to updating the company identifier list, identifying, by the at least two company name recognition components and based on the updated company identifier list, at least one additional company name occurrence in the body of text. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. An apparatus comprising
at least one processing device; - and
memory operatively connected to the at least one processing device, the memory comprising executable instructions that when executed by the at least one processing device cause the at least one processing device to; identify, by at least two company name recognition techniques and based at least in part on a company identifier list, at least one company name occurrence in a body of text, wherein each of the at least two company name recognition techniques are different from each other; update the identified companies name list based on the at least one company name occurrence to provide an updated company identifier list; and subsequent to updating the company identifier list, identify, by the at least two company name recognition techniques and based on the updated company identifier list, at least one additional company name occurrence in the body of text. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20, 21, 22)
- and
-
23. A non-transitory computer readable medium comprising executable instructions that when executed by at least one processing device cause the at least one processing device to perform automated document analysis of a document comprising a body of text in which the at least one processing device is caused to:
-
identify, by at least two company name recognition techniques and based at least in part on a company identifier list, at least one company name occurrence in the body of text, wherein each of the at least two company name recognition techniques are different from each other; update the identified companies name list based on the at least one company name occurrence to provide an updated company identifier list; and subsequent to updating the company identifier list, identify, by the at least two company name recognition techniques and based on the updated company identifier list, at least one additional company name occurrence in the body of text. - View Dependent Claims (24, 25, 26, 27, 28, 29, 30, 31, 32, 33)
-
Specification