Autonomous citation indexing and literature browsing using citation context
First Claim
Patent Images
1. A computer-implemented citation indexing system comprising:
- means for locating and acquiring publications in electronic format;
a document parser for extracting semantic features, including citations, from acquired publications; and
means for autonomously identifying variant forms of citations to the same publication, where said means for autonomously identifying variant forms of citations to the same publication normalizes citations, sorts citations by length and processes citations in order beginning with the longest length citation, computes distance measures to all previously identified groups of citations for each citation, and either adds the citation to an existing group or creates a new group, where said distance measure comprises word matching.
1 Assignment
0 Petitions
Accused Products
Abstract
An autonomous citation indexing system which can be used as an assistant agent automates and enhances the task of finding publications in electronic form, including publications located on the world wide web. The system parses citations from papers and identifies citations to the same paper that may differ in syntax. The system also extracts and provides the context of citations to a given paper, allowing a researcher to determine what is published in other papers about a given paper. Common citations and word or string vector distance similarity are used to find related articles in a search.
116 Citations
8 Claims
-
1. A computer-implemented citation indexing system comprising:
-
means for locating and acquiring publications in electronic format;
a document parser for extracting semantic features, including citations, from acquired publications; and
means for autonomously identifying variant forms of citations to the same publication, where said means for autonomously identifying variant forms of citations to the same publication normalizes citations, sorts citations by length and processes citations in order beginning with the longest length citation, computes distance measures to all previously identified groups of citations for each citation, and either adds the citation to an existing group or creates a new group, where said distance measure comprises word matching. - View Dependent Claims (2, 3, 4)
-
-
5. A method of computer-implemented citation indexing for identifying different forms of citations to the same publication comprising the steps of:
-
searching for desired publications in electronic format;
locating and acquiring the desired publications;
parsing the acquired publications for extracting and storing semantic features, including citations, from the acquired publications; and
autonomously identifying variant forms of citations to the same publication, said autonomously identifying variant forms of citations to the same publication comprising normalizing citations, sorting citations by length and processing citations in order beginning with the longest length citation, computing for each citation, distance measures to all previously identified groups of citations, and either adding the citation to an existing group or creating a new group, where said computing distance measures comprises word matching. - View Dependent Claims (6, 7, 8)
-
Specification