Process of extracting people's full names and titles from electronically stored text sources
First Claim
Patent Images
1. A system for extracting data from electronically sources comprising:
- a processing system using a plurality of component parts working in conjunction producing extraction results.
0 Assignments
0 Petitions
Accused Products
Abstract
The invention is a process by which peoples names are extracted from electronically stored text. Electronically stored text constitutes any data stream that includes the standard ASCII characters. Examples of data streams are word processor, spreadsheet, or HTML files. The invention can find peoples names stored anywhere within the text of a website or other electronic data repository. A web site can be scanned and names of people listed on the website can be retrieved and stored into a user'"'"'s database. When a name is identified within a stream of electronic text, additional information such as the person'"'"'s job title can also be extracted.
9 Citations
20 Claims
-
1. A system for extracting data from electronically sources comprising:
- a processing system using a plurality of component parts working in conjunction producing extraction results.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19)
-
20. A system for extracting data from electronically sources comprising:
- a processing system using a plurality of component parts working in conjunction producing extraction results, said conjunction parts including a plurality of databases, a plurality of algorithms and a plurality of user interface elements, where said databases includes an additional words database, a titles database a famous people database, and a historic figure database;
said algorithms includes an extraction algorithm, a substring scoring algorithm and a final name scoring algorithm; and
said user interface elements include a substring score threshold increments user interface element, a substring score decrements user interface element, and a substring score special cases user interface element.
- a processing system using a plurality of component parts working in conjunction producing extraction results, said conjunction parts including a plurality of databases, a plurality of algorithms and a plurality of user interface elements, where said databases includes an additional words database, a titles database a famous people database, and a historic figure database;
Specification