Policy based population of genealogical archive data
First Claim
Patent Images
1. A system implemented in hardware and comprising a computer infrastructure operable to:
- create an electronic archive for a user based on a family tree;
discover Internet-based data associated with at least one member of the family tree comprising;
crawling through a plurality of nodes and analyzing at least one site identified in at least one of the plurality of nodes;
inspecting the at least one site for at least one new link that matches a name of a member of the family tree and links to another site at which the user has an account; and
adding the at least one new link to the at least one of the plurality of nodes upon receiving permission from the user;
determine a probability that the Internet-based data is related to the user;
determine if the probability exceeds a predefined threshold value;
identify the Internet-based data within the at least one site for which the probability exceeds the predefined threshold value and which complies with content policies and has not been collected in the past;
add the identified Internet-based data to the at least one of the plurality of nodes and mark the identified Internet-based data as new and un-reviewed data;
present the new and un-reviewed data to the user for permission or denial to add the new and un-reviewed data to the archive; and
add the Internet-based data from the at least one site to the archive upon receiving permission from the user.
2 Assignments
0 Petitions
Accused Products
Abstract
An approach for managing a family tree archive is provided. The approach includes creating an electronic archive based on a family tree. The approach also includes automatically discovering Internet-based data associated with at least one member of the family tree. The approach additionally includes adding the Internet-based data to the archive. The approach further includes storing the archive at a storage device.
-
Citations
11 Claims
-
1. A system implemented in hardware and comprising a computer infrastructure operable to:
-
create an electronic archive for a user based on a family tree; discover Internet-based data associated with at least one member of the family tree comprising; crawling through a plurality of nodes and analyzing at least one site identified in at least one of the plurality of nodes; inspecting the at least one site for at least one new link that matches a name of a member of the family tree and links to another site at which the user has an account; and adding the at least one new link to the at least one of the plurality of nodes upon receiving permission from the user; determine a probability that the Internet-based data is related to the user; determine if the probability exceeds a predefined threshold value; identify the Internet-based data within the at least one site for which the probability exceeds the predefined threshold value and which complies with content policies and has not been collected in the past; add the identified Internet-based data to the at least one of the plurality of nodes and mark the identified Internet-based data as new and un-reviewed data; present the new and un-reviewed data to the user for permission or denial to add the new and un-reviewed data to the archive; and add the Internet-based data from the at least one site to the archive upon receiving permission from the user. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
Specification