Automated data flow tracking
First Claim
Patent Images
1. A method comprising:
- receiving a query comprising one or more search terms;
identifying, using the one or more search terms, a set of responsive text-based data items from a plurality of text-based data items, whereinthe plurality of text-based data items were generated by a plurality of data sources, andthe set of responsive text-based data items comprises a plurality of the text-based data items that are responsive to the query;
identifying one or more related text-based data items in the set of responsive text-based data items, whereineach text-based data item comprises a plurality of text-based data pieces, andthe identifying the one or more related text-based data items comprisesparsing a first text-based data item of the plurality of text-based data items into at least two first text-based data pieces,parsing a second text-based data item of the plurality of text-based data items into at least two second text-based data pieces,generating a fingerprint for each of the first text-based data pieces and for each of the second text-based data pieces, andcomparing the fingerprints corresponding to the first text-based data pieces with the fingerprints corresponding to the second text-based data pieces to determine whether any of the first text-based data pieces are identical to any of the second text-based data pieces;
responding to the query with information identifying the set of responsive text-based data items and one or more relationships, whereinthe responding comprises generating a time-sorted list of information based on two or more of a plurality of sets of relationship-related metadata;
generating one or more clusters of related data items, whereineach of the one or more clusters is generated using the information, andeach cluster is grouped based upon a respective search term of the one or more search terms; and
displaying the one or more clusters to a user.
7 Assignments
0 Petitions
Accused Products
Abstract
Various automated data flow tracking techniques can involve obtaining metadata identifying the data items from multiple data sources and using that information to identify the relationships among the data items. This information can then be provided to users. For example, a method can involve receiving a query; accessing metadata, which identifies data items generated by multiple data sources; identifying a set of responsive data items from among the available data items; identifying one or more relationships between the data items in the set of responsive data items, and responding to the query with information identifying the set of responsive data items and the one or more relationships.
37 Citations
19 Claims
-
1. A method comprising:
-
receiving a query comprising one or more search terms; identifying, using the one or more search terms, a set of responsive text-based data items from a plurality of text-based data items, wherein the plurality of text-based data items were generated by a plurality of data sources, and the set of responsive text-based data items comprises a plurality of the text-based data items that are responsive to the query; identifying one or more related text-based data items in the set of responsive text-based data items, wherein each text-based data item comprises a plurality of text-based data pieces, and the identifying the one or more related text-based data items comprises parsing a first text-based data item of the plurality of text-based data items into at least two first text-based data pieces, parsing a second text-based data item of the plurality of text-based data items into at least two second text-based data pieces, generating a fingerprint for each of the first text-based data pieces and for each of the second text-based data pieces, and comparing the fingerprints corresponding to the first text-based data pieces with the fingerprints corresponding to the second text-based data pieces to determine whether any of the first text-based data pieces are identical to any of the second text-based data pieces; responding to the query with information identifying the set of responsive text-based data items and one or more relationships, wherein the responding comprises generating a time-sorted list of information based on two or more of a plurality of sets of relationship-related metadata; generating one or more clusters of related data items, wherein each of the one or more clusters is generated using the information, and each cluster is grouped based upon a respective search term of the one or more search terms; and displaying the one or more clusters to a user. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A computer readable storage medium comprising program instructions executable to:
-
receive a query comprising one or more search terms; identify, using the one or more search terms, a set of responsive text-based data items from a plurality of text-based data items, wherein the plurality of text-based data items were generated by a plurality of data sources, and the set of responsive text-based data items comprises a plurality of the text-based data items that are responsive to the query; identify one or more related text-based data items in the set of responsive text-based data items, wherein each text-based data item comprises a plurality of text-based data pieces, and identifying the one or more related text-based data items comprises parsing a first text-based data item of the plurality of text-based data items into at least two first text-based data pieces, parsing a second text-based data item of the plurality of text-based data items into at least two second text-based data pieces, generating a fingerprint for each of the first text-based data pieces and for each of the second text-based data pieces, and comparing the fingerprints corresponding to the first text-based data pieces with the fingerprints corresponding to the second text-based data pieces to determine whether any of the first text-based data pieces are identical to any of the second text-based data pieces; respond to the query with information identifying the set of responsive text-based data items and one or more relationships, wherein responding to the query comprises generating a time-sorted list of information based on two or more of a plurality of sets of relationship-related metadata; generate one or more clusters of related data items, wherein each of the one or more clusters is generated using the information, and each cluster is grouped based upon a respective search term of the one or more search terms; and displaying the one or more clusters to a user. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A system comprising:
-
one or more processors; and a memory coupled to the one or more processors, wherein the memory stores program instructions executable by the one or more processors to; receive a query comprising one or more search terms; identify, using the one or more search terms, a set of responsive text-based data items from a plurality of text-based data items, wherein the plurality of text-based data items were generated by a plurality of data sources, and the set of responsive text-based data items comprises a plurality of the text-based data items that are responsive to the query; identify one or more related text-based data items in the set of responsive text-based data items, wherein each text-based data item comprises a plurality of text-based data pieces, and identifying the one or more related text-based data items comprises parsing a first text-based data item of the plurality of text-based data items into at least two first text-based data pieces, parsing a second text-based data item of the plurality of text-based data items into at least two second text-based data pieces, generating a fingerprint for each of the first text-based data pieces and for each of the second text-based data pieces, and comparing the fingerprints corresponding to the first text-based data pieces with the fingerprints corresponding to the second text-based data pieces to determine whether any of the first text-based data pieces are identical to any of the second text-based data pieces; respond to the query with information identifying the set of responsive text-based data items and one or more relationships, wherein responding to the query comprises generating a time-sorted list of information based on two or more of a plurality of sets of relationship-related metadata; generate one or more clusters of related data items, wherein each of the one or more clusters is generated using the information, and each cluster is grouped based upon a respective search term of the one or more search terms; and displaying the one or more clusters to a user. - View Dependent Claims (16, 17, 18, 19)
-
Specification