Systems, methods, interfaces and software for automated collection and integration of entity data into online databases and professional directories
First Claim
Patent Images
1. A computer-based system comprising software code executable by at least one processor, an authority database set of records, and memory, the system comprising:
- a parser code set when executed adapted to identify one or more lexical elements determined to be indicators of entity data contained within a document;
an extraction code set when executed adapted to extract an entity reference record from the document based at least in part on the lexical elements;
a comparison code set when executed adapted to compare the extracted reference record against the authority database set of records;
a first match code set when executed adapted to determine whether the extracted entity reference record matches any entity record in the authority database set of records;
a harvest code set when executed adapted to harvest a set of harvested entity records from at least one source other than the authority database set of records;
a second match code set when executed adapted to determine whether the entity reference record matches any records contained in the set of harvested entity records; and
a merge code set when executed adapted to merge the entity reference record with at least a portion of data from at least one record from the set of harvested entity records determined in executing the second match code set and to add the merged entity reference record to the authority set of records.
7 Assignments
0 Petitions
Accused Products
Abstract
An information-retrieval system includes a server that receives queries for documents from client devices and means for outputting results of queries to the client devices, with the results provided in association with one or more interactive control features that are selectable to invoke display of information regarding entities, such as professionals, referenced in the results.
-
Citations
38 Claims
-
1. A computer-based system comprising software code executable by at least one processor, an authority database set of records, and memory, the system comprising:
-
a parser code set when executed adapted to identify one or more lexical elements determined to be indicators of entity data contained within a document; an extraction code set when executed adapted to extract an entity reference record from the document based at least in part on the lexical elements; a comparison code set when executed adapted to compare the extracted reference record against the authority database set of records; a first match code set when executed adapted to determine whether the extracted entity reference record matches any entity record in the authority database set of records; a harvest code set when executed adapted to harvest a set of harvested entity records from at least one source other than the authority database set of records; a second match code set when executed adapted to determine whether the entity reference record matches any records contained in the set of harvested entity records; and a merge code set when executed adapted to merge the entity reference record with at least a portion of data from at least one record from the set of harvested entity records determined in executing the second match code set and to add the merged entity reference record to the authority set of records. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A computer-based system comprising software code executable by at least one processor, an authority database set of records, and memory, the system comprising:
-
means for parsing content of a document to identify one or more lexical elements determined to be indicators of entity data; means for extracting a first set of entity reference records from the document based at least in part on the lexical elements; means for comparing the first set of extracted reference records against the authority database set of records; means for communicatively connecting with an external database set of external entity records; means for determining whether one or more of the first set of entity reference records match one or more records from the set of external entity records; means for merging entity reference records from the first set of entity records with at least a portion of data from matching records from the set of external entity records and generating a set of merged entity reference records; and means for updating the authority database set of records with the set of merged entity reference records. - View Dependent Claims (16, 17, 18, 19, 20, 21)
-
-
22. A method compromising:
-
parsing content of a document to identify one or more lexical elements determined to be indicators of entity data; extracting an entity reference record from the document based at least in part on the lexical elements; comparing the extracted reference record against a predefined authority set of records; in response to failing to match the extracted entity reference record to any entity record in the authority set of records, harvesting a set of harvested entity records from sources other than the authority set of records and attempting to match the entity reference record to the set of harvested entity records; matching the entity reference record to at least one record from the set of harvested entity records; merging the entity reference record with at least a portion of data from the at least one record from the set of harvested entity records; and adding the merged entity reference record to the authority set of records. - View Dependent Claims (23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38)
-
Specification