Document analysis and association system and method
First Claim
1. A method for analysing a text portion and retrieving documents relevant to the text portion, the method comprising:
- a) receiving an input comprising an input text portion;
b) using a processor to identify at least one text term in the input text portion;
c) using the processor, assigning at least one weight associated with the at least one text term;
d) using the processor, to form an input local index of the at least one text term in the input text portion and at least one associated local term weight, wherein the at least one associated local term weight is determined with reference to a global term index stored in a database, the global term index comprising a plurality of global text terms and associated global text term weights, and being formed from a plurality of reference documents, wherein a representation of each of the reference documents is stored in the database;
e) using the processor, querying the database to identify one or more of the reference documents of relevance with respect to the input text portion;
f) outputting a representation of the identified relevant reference documents;
g) displaying the relevant reference documents on a user interface, the user interface comprising input means for receiving user input with respect to each of the displayed reference documents;
h) accepting user-input with respect to one or more of the displayed documents;
i) using the processor, re-forming the input local term index on the basis of the user input;
j) on the basis of the re-formed input local term index, using the processor to query the database to identify one or more relevant reference documents of enhanced relevance to the input text portion; and
k) using the processor, outputting a representation of the further identified reference documents of enhanced relevance to the user interface for presentation to a user;
wherein, the processor comprises a programmable data processing apparatus comprising instructions which execute via the processor to implement the steps (a) to (k).
2 Assignments
0 Petitions
Accused Products
Abstract
Methods and systems for indexing a plurality of documents, each document comprising a text portion, the method parses the text portion of each of the plurality of documents to form a plurality of respective local document indexes each associated with a respective document, and stores the local document index in a database, Each local document index has a plurality of local text terms and a local weighting associated with each text term From the plurality of local document indexes, forming a global document index associated with each global text term. The global weighting is determined with respect to a parameter associated with a reference global text term. Also, methods and systems for analyzing a text portion, retrieving documents from a database relevant to the text portion and for refining the results of a search are disclosed.
-
Citations
25 Claims
-
1. A method for analysing a text portion and retrieving documents relevant to the text portion, the method comprising:
-
a) receiving an input comprising an input text portion; b) using a processor to identify at least one text term in the input text portion; c) using the processor, assigning at least one weight associated with the at least one text term; d) using the processor, to form an input local index of the at least one text term in the input text portion and at least one associated local term weight, wherein the at least one associated local term weight is determined with reference to a global term index stored in a database, the global term index comprising a plurality of global text terms and associated global text term weights, and being formed from a plurality of reference documents, wherein a representation of each of the reference documents is stored in the database; e) using the processor, querying the database to identify one or more of the reference documents of relevance with respect to the input text portion; f) outputting a representation of the identified relevant reference documents; g) displaying the relevant reference documents on a user interface, the user interface comprising input means for receiving user input with respect to each of the displayed reference documents; h) accepting user-input with respect to one or more of the displayed documents; i) using the processor, re-forming the input local term index on the basis of the user input; j) on the basis of the re-formed input local term index, using the processor to query the database to identify one or more relevant reference documents of enhanced relevance to the input text portion; and k) using the processor, outputting a representation of the further identified reference documents of enhanced relevance to the user interface for presentation to a user;
wherein, the processor comprises a programmable data processing apparatus comprising instructions which execute via the processor to implement the steps (a) to (k). - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. The method for refining the results of a search, the search results comprising a representation of a selected plurality of reference documents, such reference documents displayed being of relevance to an input text portion comprising one or more search terms, the selected plurality of reference documents comprising a subset of a plurality of documents in a database, the method comprising the steps of:
-
a) using a processor to form a local term index from the search terms in the input text portion, the local term index comprising one or more text terms, each local text term associated with a local text term weight; b) using the processor, receiving and displaying the search results on a user interface, the user interface comprising input means for receiving user input with respect to one or more of the plurality of the displayed reference documents; c) accepting user input on one or more of the displayed reference documents; d) using the processor, re-forming the local term index on the basis of the user input; e) on the basis of the re-formed input local term index, using the processor to query the database to identify one or more documents of enhanced relevance to the input text portion; and f) outputting a representation of the further identified reference documents of enhanced relevance to the user interface for presentation to a user; wherein, the processor comprises a programmable data processing apparatus comprising instructions which execute via the processor to implement the steps (a) to (f); wherein in step a) the local text term weights for each of the local text terms are derived from a global text term index, the global text term index comprising a plurality of text terms associated with global text term weights, wherein the global text term weights are derived from text term analysis of a plurality of documents. - View Dependent Claims (16, 17, 18, 19)
-
-
20. The system for refining the results of a search, the search results comprising a representation of a selected plurality of documents of relevance to one or more search terms, the selected plurality of documents comprising a subset of a plurality of documents in a database, the system comprising:
-
processing means for forming a local term index from the search terms, the local term index comprising one or more text terms, each local text term associated with a local text term weight; processing means for receiving and displaying the search results on a user interface, the user interface comprising input means for receiving user input with respect to each of the displayed reference documents, weights for each of the local text terms are derived from a global text term index, the global text term index comprising a plurality of text terms associated with global text term weights, wherein the global text term weights are derived from text term analysis of a plurality of documents; user input means for accepting user input on one or more of the displayed documents; processing means for analysing the user input and re-forming the input local term index on the basis of the user input; query means for querying the database on the basis of the re-formed input local term index to identify one or more documents of enhanced relevance to the input text portion; and output means for outputting a representation of the further identified reference documents of enhanced relevance.
-
-
21. A system for analysing an input text portion and retrieving documents relevant to the text portion, the system comprising:
-
input means for receiving an input comprising an input text portion; identification means to identify at least one text term in the text portion; assignment means for assigning at least one weight associated with the at least one text term; indexing means for forming an input local term index of the at least one text term in the input text portion and at least one associated local term weight, wherein the at least one associated local text term weights is determined with reference to a global term index stored in a database, the global term index comprising a plurality of global text terms and associated global text term weights, and being formed from a plurality of reference documents, wherein a representation of each of the reference documents is stored in the database; query means for querying the database to identify one or more relevant reference documents with respect to the input text portion; output means for outputting a representation of the identified relevant reference documents; display means for displaying the relevant reference documents on a user interface, the user interface comprising input means for receiving user input with respect to each of the displayed reference documents; user input means accepting user-input on one or more of the displayed documents; processing means for analysing the user input and re-forming the input local text term index; query means for querying the database on the basis of the re-formed input local text term index to identify one or more relevant reference documents of enhanced relevance to the input text portion; and output means for outputting a representation of the further identified reference documents of enhanced relevance. - View Dependent Claims (22, 23)
-
-
24. A non-transitory computer readable medium comprising a program for analysing a text portion and retrieving documents relevant to the text portion, said program controlling the operation of a data processing apparatus on which the program executes on a processor to perform the steps of:
-
a) receiving an input comprising an input text portion; b) identify at least one text term in the text portion; c) assigning at least one weight associated with the at least one text term; d) forming an input local index of the at least one text term and at least one associated local term weight, wherein the at least one associated local term weight is determined with reference to a global term index stored in a database, the global term index comprising a plurality of global text terms and associated global text term weights, and being formed from a plurality of reference documents, wherein a representation of each of the reference documents is stored in the database e) querying the database to identify one or more of the reference documents of relevance with respect to the input text portion; f) outputting a representation of the identified relevant reference documents; g) displaying the relevant reference documents on a user interface, the user interface comprising input means for receiving user input with respect to each of the displayed reference documents; h) accepting user-input with respect to one or more of the displayed documents; i) re-forming the input local term index on the basis of the user input; j) on the basis of the re-formed input local term index, querying the database to identify one or more relevant reference documents of enhanced relevance to the input text portion; and k) outputting a representation of the further identified reference documents of enhanced relevance.
-
-
25. A non-transitory computer readable medium comprising a program for refining the results of a search, the search results comprising a representation of a selected plurality of reference documents, such reference documents displayed being of relevance to an input text portion comprising one or more search terms, the selected plurality of documents comprising a subset of a plurality of documents in a database, said program controlling the operation of a data processing apparatus on which the program executes on a processor to perform the steps of:
-
a) forming a local term index from the search terms, the local term index comprising one or more text terms, each local text term associated with a local text term weight, wherein the local text term weights for each of the local text terms are derived from a global text term index, the global text term index comprising a plurality of text terms associated with global text term weights, wherein the global text term weights are derived from text term analysis of a plurality of documents; b) receiving and displaying the search results on a user interface, the user interface comprising input means for receiving user input with respect to one or more of the plurality of the displayed reference documents; c) accepting user input on one or more of the displayed documents; d) re-forming the input local term index on the basis of the user input; e) on the basis of the re-formed input local term index, querying the database to identify one or more documents of enhanced relevance to the input text portion; and f) outputting a representation of the further identified reference documents of enhanced relevance.
-
Specification