DOCUMENT ANALYSIS AND ASSOCIATION SYSTEM AND METHOD
First Claim
1. A method for indexing a plurality of documents, each document comprising a text portion, the method comprising:
- a) parsing the text portion of each of the plurality of documents to form a plurality of respective local document indexes each associated with a respective document, and storing the local document index in a database, wherein each local document index comprises a plurality of local text terms contained in the respective document and a local weighting associated with each text term;
b) from the plurality of local document indexes, forming a global document index comprising a plurality of global text terms contained in the plurality of documents, and a global weighting associated with each global text term;
wherein the global weighting associated with each of the global text terms is determined with respect to a parameter associated with a reference global text term.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods and systems for indexing a plurality of documents, each document comprising a text portion, the method parses the text portion of each of the plurality of documents to form a plurality of respective local document indexes each associated with a respective document, and stores the local document index in a database, Each local document index has a plurality of local text terms and a local weighting associated with each text term From the plurality of local document indexes, forming a global document index associated with each global text term. The global weighting is determined with respect to a parameter associated with a reference global text term. Also, methods and systems for analyzing a text portion, retrieving documents from a database relevant to the text portion and for refining the results of a search are disclosed.
-
Citations
60 Claims
-
1. A method for indexing a plurality of documents, each document comprising a text portion, the method comprising:
-
a) parsing the text portion of each of the plurality of documents to form a plurality of respective local document indexes each associated with a respective document, and storing the local document index in a database, wherein each local document index comprises a plurality of local text terms contained in the respective document and a local weighting associated with each text term; b) from the plurality of local document indexes, forming a global document index comprising a plurality of global text terms contained in the plurality of documents, and a global weighting associated with each global text term; wherein the global weighting associated with each of the global text terms is determined with respect to a parameter associated with a reference global text term. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A system for indexing a plurality of documents, each document comprising a text portion, the system comprising:
-
a parsing module for parsing the text portion of each of the plurality of document to form a plurality of respective local document indexes each associated with a respective document, wherein each local document index comprises a plurality of local text terms contained in the respective document and a local weighting associated with each text term; a database adapted for storing each of the local document indexes in a memory; a processor for analysing the plurality of local document indexes and forming a global document index from the plurality of local document indexes, the global document index comprising a plurality of global text terms contained in the plurality of documents, and a global weighting associated with each global text term;
wherein the global weighting associated with each of the global text terms is determined with respect to a parameter associated with a reference global text term; and
wherein the global document index is stored in the database and related to each of the local document indexes.
-
-
11. A method for analysing a text portion and retrieving documents relevant to the text portion, to the method comprising:
-
a) receiving an input comprising an input text portion; b) identify at least one text term in the text portion; c) assigning at least one weight associated with the at least one text term; d) forming an input local index of the at least one text term and at least one associated local term weight, wherein the at least one associated local term weight is determined with reference to a global term index stored in a database, the global term index comprising a plurality of global text terms and associated global text term weights, and being formed from a plurality of reference documents, wherein a representation of each of the reference documents is stored in the database; e) querying the database to identify one or more of the reference documents of relevance with respect to the input text portion; and f) outputting a representation of the identified relevant reference documents. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42)
-
-
43. A method for refining the results of a search, the search results comprising a representation of a selected plurality of reference documents, such reference documents displayed being of relevance to an input text portion comprising one or more search terms, the selected plurality of reference documents comprising a subset of a plurality of documents in a database, the method comprising the steps of:
-
a) forming a local term index from the search terms, the local term index comprising one or more text terms, each local text term associated with a local text term weight; b) receiving and displaying the search results on a user interface, the user interface comprising input means for receiving user input with respect to one or more of the plurality of the displayed reference documents; c) accepting user input on one or more of the displayed reference documents; d) re-forming the local term index on the basis of the user input; e) on the basis of the re-formed input local term index, querying the database to identify one or more documents of enhanced relevance to the input text portion; and f) outputting a representation of the further identified reference documents of enhanced relevance. - View Dependent Claims (44, 45, 46, 47, 48, 49)
-
-
50. A system for refining the results of a search, the search results comprising a representation of a selected plurality of documents of relevance to one or more search terms, the selected plurality of documents comprising a subset of a plurality of documents in a database, the system comprising:
-
means for forming a local term index from the search terms, the local term index comprising one or more text terms, each local text term associated with a local text term weight; means for receiving and displaying the search results on a user interface, the user interface comprising input means for receiving user input with respect to each of the displayed reference documents; user input means for accepting user input on one or more of the displayed documents; processing means for analysing the user input re-forming the input local term index on the basis of the user input; query means for querying the database on the basis of the re-formed input local term index to identify one or more documents of enhanced relevance to the input text portion; and output means for outputting a representation of the further identified reference documents of enhanced relevance.
-
-
51. A system for analysing an input text portion and retrieving documents relevant to the text portion, the system comprising:
-
input means for receiving an input comprising an input text portion; identification means to identify at least one text term in the text portion; assignment means for assigning at least one weight associated with the at least one text term; indexing means for forming an input local term index of the at least one text term and at least one associated local term weight, wherein the at least one associated local text term weights is determined with reference to a global term index stored in a database, the global term index comprising a plurality of global text terms and associated global text term weights, and being formed from a plurality of reference documents, wherein a representation of each of the reference documents is stored in the database; query means for querying the database to identify one or more relevant reference documents with respect to the input text portion; and output means for outputting a representation of the identified relevant reference documents. - View Dependent Claims (52, 53, 54, 55, 56, 57)
-
-
58. A computer readable medium comprising a program for analysing a text portion and retrieving documents relevant to the text portion, said program controlling the operation of a data processing apparatus on which the program executes to perform the steps of:
-
a) receiving an input comprising an input text portion; b) identify at least one text term in the text portion; c) assigning at least one weight associated with the at least one text term; d) forming an input local index of the at least one text term and at least one associated local term weight, wherein the at least one associated local term weight is determined with reference to a global term index stored in a database, the global term index comprising a plurality of global text terms and associated global text term weights, and being formed from a plurality of reference documents, wherein a representation of each of the reference documents is stored in the database e) querying the database to identify one or more of the reference documents of relevance with respect to the input text portion; and f) outputting a representation of the identified relevant reference documents. - View Dependent Claims (59)
-
-
60. A computer readable medium comprising a program for refining the results of a search, the search results comprising a representation of a selected plurality of reference documents, such reference documents displayed being of relevance to an input text portion comprising one or more search terms, the selected plurality of documents comprising a subset of a plurality of documents in a database, said program controlling the operation of a data processing apparatus on which the program executes to perform the steps of:
-
a) forming a local term index from the search terms, the local term index comprising one or more text terms, each local text term associated with a local text term weight; b) receiving and displaying the search results on a user interface, the user interface comprising input means for receiving user input with respect to one or more of the plurality of the displayed reference documents; c) accepting user input on one or more of the displayed documents; d) re-forming the input local term index on the basis of the user input; e) on the basis of the re-formed input local term index, querying the database to identify one or more documents of enhanced relevance to the input text portion; and f) outputting a representation of the further identified reference documents of enhanced relevance.
-
Specification