Automated document analysis comprising company name recognition
First Claim
1. A method for performing, by at least one processing device, automated document analysis of a document comprising a body of text, the method comprising:
- identifying at least one company name occurrence in the body of text based on matching portions of the body of text with company names in a company identifier list;
updating, by the at least one processing device, the company identifier list, wherein updating the company identifier list comprises adding the at least one company name occurrence to the company identifier list;
subsequent to updating the company identifier list, identifying, at least one additional company name occurrence in the body of text;
comparing the at least one additional company name occurrence against an excluded company name list; and
subsequent to the comparison, omitting the at least one additional company name occurrence from the company identifier list.
1 Assignment
0 Petitions
Accused Products
Abstract
At least two processing device-implemented company name recognition components, operating upon a body of text in a document, identify at least one company name occurrence in the body of text based at least in part on a company identifier list. The company name recognition techniques implemented by each of the at least two company name recognition components are different from each other. The at least one company name occurrence is used to update the company identifier list. The updated company identifier list is then used by the at least two company name recognition components to identify at least one additional name occurrence in the same body of text. This process of repeatedly identifying occurrences of company names in the body of text and updating the company identifier list is performed until such time that no further company name occurrences are identified in the body of text.
26 Citations
20 Claims
-
1. A method for performing, by at least one processing device, automated document analysis of a document comprising a body of text, the method comprising:
-
identifying at least one company name occurrence in the body of text based on matching portions of the body of text with company names in a company identifier list; updating, by the at least one processing device, the company identifier list, wherein updating the company identifier list comprises adding the at least one company name occurrence to the company identifier list; subsequent to updating the company identifier list, identifying, at least one additional company name occurrence in the body of text; comparing the at least one additional company name occurrence against an excluded company name list; and subsequent to the comparison, omitting the at least one additional company name occurrence from the company identifier list. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A system comprising
at least one processing device; - and
memory operatively connected to the at least one processing device, the memory comprising executable instructions that when executed by the at least one processing device cause the at least one processing device to; identify at least one company name occurrence in a body of text based on matching portions of the body of text with company names in a company identifier list; update the company identifier list, wherein updating the company identifier list comprises adding the at least one company name occurrence to the company identifier list; subsequent to updating the company identifier list, identify at least one additional company name occurrence in the body of text; compare the at least one additional company name occurrence against an excluded company name list; and subsequent to the comparison, omit the at least one additional company name occurrence from the company identifier list. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
- and
-
17. A non-transitory computer readable medium comprising executable instructions that when executed by at least one processing device cause the at least one processing device to perform automated document analysis of a document comprising a body of text in which the at least one processing device is caused to:
-
identify at least one company name occurrence in the body of text based on matching portions of the body of text with company names in a company identifier list; update the company identifier list, wherein updating the company identifier list comprises adding the at least one company name occurrence to the company identifier list; subsequent to updating the company identifier list, identify at least one additional company name occurrence in the body of text; compare the at least one additional company name occurrence against an excluded company name list; and subsequent to the comparison, omit the at least one additional company name occurrence from the company identifier list. - View Dependent Claims (18, 19, 20)
-
Specification