SYSTEMS AND METHODS FOR EXTRACTING PEDIGREE AND FAMILY RELATIONSHIP INFORMATION FROM DOCUMENTS
First Claim
1. A computer-implemented method for extracting personal information from a family history document, comprising:
- applying optical character recognition (OCR) to a digital image of a family history document to create an OCR copy;
identifying a person'"'"'s name in the digital image;
extracting name data from the OCR copy representing the name;
confirming accuracy of the extracted name data;
publishing the extracted name data in a searchable format.
4 Assignments
0 Petitions
Accused Products
Abstract
A computer-implemented method for extracting information about individuals from a family history document includes applying optical character recognition (OCR) to a digital image of a family history document to create an OCR copy, identifying a person'"'"'s name in the digital image, extracting name data and related information from the OCR copy representing the name, identifying a family relationship indicator corresponding to the identified person'"'"'s name in the digital image, confirming accuracy of the extracted name data, and publishing the extracted name data and related information in a searchable format.
41 Citations
20 Claims
-
1. A computer-implemented method for extracting personal information from a family history document, comprising:
-
applying optical character recognition (OCR) to a digital image of a family history document to create an OCR copy; identifying a person'"'"'s name in the digital image; extracting name data from the OCR copy representing the name; confirming accuracy of the extracted name data; publishing the extracted name data in a searchable format. - View Dependent Claims (2, 3, 6, 7, 8, 9)
-
- 4. The method of cl 3, further comprising automatically associating the at least one of a birth date, a death date, and a marriage date with the person'"'"'s name.
-
10. A computing device configured to extract personal information from a family history document, comprising:
-
a processor; memory in electronic communication with the processor; a image processing module configured to; digitize a family history document to create a digital image, the digital image including information about individuals; conduct optical character recognition (OCR) on the digital image and create an OCR copy; a pedigree module configured to; create an extracted content file by correcting OCR errors, aggregating information about the individuals from the digital image, and identifying family relationships between individuals included in the digital image; provide the extracted content file in a searchable format. - View Dependent Claims (11, 12, 13, 14, 15, 16)
-
-
17. A computer-program product for extracting personal information from a family history document, the computer-program product comprising a computer-readable medium having instructions thereon, the instructions comprising:
-
code programmed to identify a person'"'"'s name in a digital image of a family history document; code programmed to extract data for the person'"'"'s name from an optical character recognition (OCR) copy of the digital image, the data including at least a family relationship indicator and at least one of a birth date, a death date, and a marriage date; code programmed to publish the extracted data in a searchable format.
-
-
18. A computer-implemented method for extracting personal information from a digital family history document, comprising:
-
displaying an image of the digital family history document; loading an optical character recognition (OCR) copy of the digital family history document; manually extract data from the image with data including at least a name for an individual identified in the image; automatically extracting at least some data from the image that is mapped to the OCR copy; providing the extracted data in a searchable format. - View Dependent Claims (19, 20)
-
Specification