Information extraction from a database
First Claim
Patent Images
1. A method performed by one or more server devices, the method comprising:
- searching, by the one or more server devices, a plurality of documents to identify a plurality of occurrences of a first tuple in text of the plurality of documents and a respective context for each occurrence of the first tuple in the text of the plurality of documents;
analyzing, by the one or more server devices, the identified plurality of occurrences and the respective context for each occurrence to identify a data pattern that corresponds to the first tuple;
extracting, by the one or more server devices, a second tuple from the text of the plurality of documents, using the data pattern, where the second tuple is different from the first tuple; and
storing, by the one or more server devices, the first tuple and the second tuple in a data storage.
1 Assignment
0 Petitions
Accused Products
Abstract
Techniques for extracting information from a database are provided. A database such as the Web is searched for occurrences of tuples of information. The occurrences of the tuples of information that were found in the database are analyzed to identify a pattern in which the tuples of information were stored. Additional tuples of information can then be extracted from the database utilizing the pattern. This process can be repeated with the additional tuples of information, if desired.
-
Citations
24 Claims
-
1. A method performed by one or more server devices, the method comprising:
-
searching, by the one or more server devices, a plurality of documents to identify a plurality of occurrences of a first tuple in text of the plurality of documents and a respective context for each occurrence of the first tuple in the text of the plurality of documents; analyzing, by the one or more server devices, the identified plurality of occurrences and the respective context for each occurrence to identify a data pattern that corresponds to the first tuple; extracting, by the one or more server devices, a second tuple from the text of the plurality of documents, using the data pattern, where the second tuple is different from the first tuple; and storing, by the one or more server devices, the first tuple and the second tuple in a data storage. - View Dependent Claims (7, 8, 9, 10, 23)
-
-
2. A computer-readable memory device comprising:
one or more instructions that, when executed by one or more processors, cause the one or more processors to; search a plurality of documents to identify a plurality of occurrences of a first tuple in text of the plurality of documents and a respective context for each occurrence of the first tuple in the text of the plurality of documents; analyze the identified plurality of occurrences and the respective context for each occurrence to identify a data pattern that corresponds to the first tuple; extract a second tuple from the text of the plurality of documents, using the data pattern, where the second tuple is different from the first tuple; and store the first tuple and the second tuple in a data storage. - View Dependent Claims (11, 12)
-
3. A system, comprising:
-
one or more memory devices to store computer-executable instructions; and one or more processors to execute the computer-executable instructions, to; search a plurality of documents to identify a plurality of occurrences of a first tuple in text of the plurality of documents and a respective context for each occurrence of the first tuple in the text of the plurality of documents; analyze the identified plurality of occurrences and the respective context for each occurrence to identify a data pattern that corresponds to the first tuple; extract a second tuple from the text of the plurality of documents, using the data pattern, where the second tuple is different from the first tuple; and store the first tuple and the second tuple in a data storage. - View Dependent Claims (13, 14)
-
-
4. A method performed by one or more server devices, the method comprising:
-
searching a plurality of documents to identify a plurality of occurrences of a first tuple; identifying, using the identified plurality of occurrences, a pattern that corresponds to the first tuple; searching the plurality of documents to identify an occurrence of a second tuple that matches the identified pattern and that is different from the first tuple; and storing the first and second tuples in a data storage. - View Dependent Claims (15, 16, 17, 18, 24)
-
-
5. A computer-readable memory device comprising:
one or more instructions that, when executed by one or more processors, cause the one or more processors to ; search a plurality of documents to identify a plurality of occurrences of a first tuple; identify, using the identified plurality of occurrences, a pattern that corresponds to the first tuple; identify an occurrence of a second tuple in the plurality of documents that matches the identified pattern and that is different from the first tuple; and store the first and second tuples in a data storage. - View Dependent Claims (19, 20)
-
6. A system, comprising:
-
one or more memory devices to store computer-executable instructions; and one or more processors to execute the computer-executable instructions, to; search a plurality of documents to identify a plurality of occurrences of a first tuple; identify, using the identified plurality of occurrences, a pattern that corresponds to the first tuple; identify an occurrence of a second tuple in the plurality of documents that matches the identified pattern and that is different from the first tuple; and store the first and second tuples in a data storage. - View Dependent Claims (21, 22)
-
Specification