Autonomous citation indexing and literature browsing using citation context
First Claim
Patent Images
1. A computer-implemented citation indexing system comprising:
- means for locating and acquiring publications in electronic format;
a document parser for extracting semantic features, including citations, from acquired publications; and
means for identifying citations to the same publication, where said means for identifying citations to the same publication normalizes citations, sorts citations by length and processes citations in order beginning with the longest length citation, computes distance measures to all previously identified groups of citations for each citation, and either adds the citation to an existing group or creates a new group, where said distance measure comprises word matching.
2 Assignments
0 Petitions
Accused Products
Abstract
An autonomous citation indexing system which can be used as an assistant agent automates and enhances the task of finding publications in electronic form, including publications located on the world wide web. The system parses citations from papers and identifies citations to the same paper that may differ in syntax. The system also extracts and provides the context of citations to a given paper, allowing a researcher to determine what is published in other papers about a given paper. Common citations and word or string vector distance similarity are used to find related articles in a search.
392 Citations
8 Claims
-
1. A computer-implemented citation indexing system comprising:
-
means for locating and acquiring publications in electronic format;
a document parser for extracting semantic features, including citations, from acquired publications; and
means for identifying citations to the same publication, where said means for identifying citations to the same publication normalizes citations, sorts citations by length and processes citations in order beginning with the longest length citation, computes distance measures to all previously identified groups of citations for each citation, and either adds the citation to an existing group or creates a new group, where said distance measure comprises word matching.
-
-
2. A computer-implemented citation indexing system comprising:
-
means for locating and acquiring publications in electronic format;
a document parser for extracting semantic features, including citations, from acquired publications; and
means for identifying citations to the same publication, where said means for identifying citations to the same publication normalizes citations, sorts citations by length and processes citations in order beginning with the longest length citation, computes distance measures to all previously identified groups of citations for each citation, and either adds the citation to an existing group or creates a new group, where said distance measure comprises word and phrase matching.
-
-
3. A computer-implemented citation indexing system comprising:
-
means for locating and acquiring publications in electronic format, where said publications in electronic format are publications on the world wide web;
a document parser for extracting semantic features, including citations, from acquired publications; and
means for identifying citations to the same publication, where said means for identifying citations to the same publication normalizes citations, sorts citations by length and processes citations in order beginning with the longest length citation, computes distance measures to all previously identified groups of citations for each citation, and either adds the citation to an existing group or creates a new group, where said distance measure comprises word matching.
-
-
4. A computer-implemented citation indexing system comprising:
-
means for locating and acquiring publications in electronic format, where said publications in electronic format are publications on the world wide web;
a document parser for extracting semantic features, including citations, from acquired publications; and
means for identifying citations to the same publication, where said means for identifying citations to the same publication normalizes citations, sorts citations by length and processes citations in order beginning with the longest length citation, computes distance measures to all previously identified groups of citations for each citation, and either adds the citation to an existing group or creates a new group, where said distance measure comprises word and phrase matching.
-
-
5. A method of computer-implemented citation indexing for identifying different forms of citations to the same publication comprising the steps of:
-
searching for desired publications in electronic format;
locating and acquiring the desired publications;
parsing the acquired publications for extracting and storing semantic features, including citations, from the acquired publications; and
identifying citations to the same publication, said identifying citations to the same publication comprising normalizing citations, sorting citations by length and processing citations in order beginning with the longest length citation, computing for each citation, distance measures to all previously identified groups of citations, and either adding the citation to an existing group or creating a new group, where said computing distance measures comprises word matching.
-
-
6. A method of computer-implemented citation indexing for identifying different forms of citations to the same publication comprising the steps of:
-
searching for desired publications in electronic format;
locating and acquiring the desired publications;
parsing the acquired publications for extracting and storing semantic features, including citations, from the acquired publications; and
identifying citations to the same publication, said identifying citations to the same publication comprising normalizing citations, sorting citations by length and processing citations in order beginning with the longest length, computing for each citation, distance measures to all previously identified groups of citations, and either adding the citation to an existing group or creating a new group, where said computing distance measures comprises word and phrase matching.
-
-
7. A method of computer-implemented citation indexing for identifying different forms of citations to the same publication comprising the steps of:
-
searching for desired publications in electronic format, where said publications in electronic format are publications on the world wide web;
locating and acquiring the desired publications;
parsing the acquired publications for extracting and storing semantic features, including citations, from the acquired publications; and
identifying citations to the same publication, said identifying citations to the same publication comprising normalizing citations, sorting citations by length and processing citations in order beginning with the longest length citation, computing for each citation, distance measures to all previously identified groups of citations, and either adding the citation to an existing group or creating a new group, where said computing distance measures comprises word matching.
-
-
8. A method of computer-implemented citation indexing for identifying different forms of citations to the same publication comprising the steps of:
-
searching for desired publications in electronic format, where said publications in electronic format are publications on the world wide web;
locating and acquiring the desired publications;
parsing the acquired publications for extracting and storing semantic features, including citations, from the acquired publications; and
identifying citations to the same publication, said identifying citations to the same publication comprising normalizing citations, sorting citations by length and processing citations in order beginning with the longest length, computing for each citation, distance measures to all previously identified groups of citations, and either adding the citation to an existing group or creating a new group, where said computing distance measures comprises word and phrase matching.
-
Specification