Method and system for document presentation and analysis
First Claim
1. A document analysis and search system for searching through a plurality of documents and for analyzing documents located as a result of a conducted search, the search being directed to a predetermined subject matter, the system comprising:
- a client device comprising at least one of a non-transitory computer readable medium and a memory;
a program module stored on the client device, the client device being positioned in communication with a network, and the network being in communication with a document provider database and a thesaurus database, the program module comprising instructions executable by a processor of the client device to locate at least one document from among the plurality of documents, the program module comprisingan interface module; and
a document analysis module;
wherein the interface module receives concept data relating to the subject matter of the search, the concept data including at least one concept, the concept including a plurality of keywords used to conduct the search;
wherein the interface module receives a plurality of documents relating to the concept data from the document provider database;
wherein the interface module generates and displays a document analysis graphical user interface, the document analysis graphical user interface comprisinga keyword entry interface,a document relevancy interface,a document management interface, anda document image window,wherein the document analysis module generates statistical data based on the at least one concept, and wherein the statistical data is used to assess relevancy of each of the documents located in the search so that each of the documents can be displayed using the document relevancy interface and wherein the statistical data includes a count of a number of instances that each of the keywords appears in a document located in the search;
wherein the document analysis module transmits the statistical data to the interface module to be displayed;
wherein the document analysis module transforms the count of the number of instances of each of the keywords into a normalized count; and
wherein the normalized count is calculated by dividing a total number of text characters in each document by five to provide a normalized word count, dividing the count of the number of instances of each of the keywords by the normalized word count to provide a density of each of the keywords, and multiplying the density of each of the keywords by a predetermined value to provide the normalized count;
wherein the keyword entry interface allows entry of one or more keyword groups, and wherein each keyword group includes a plurality of keywords that are conceptually related to one another.
0 Assignments
0 Petitions
Accused Products
Abstract
A document analysis system receives multiple concepts along with multiple reference documents and generates sensory indicators that assist a researcher in assessing the relevance of each of the documents to the concepts. In one exemplary aspect, the document analysis system displays a table of keywords separated into blocks, each block of keywords corresponding to one of the concepts. Each block is colored according to the prevalence of any keyword within a given keyword group. The color of a block thus indicates the relative presence of a concept in the document. The document analysis system also determines a unique color for each block of keywords for highlighting in the text of the document. In this manner a researcher can quickly identify passages that contain multiple concepts. Additionally, the researcher is provided the ability to quickly locate reference characters, figure numbers and patent numbers in the document.
43 Citations
45 Claims
-
1. A document analysis and search system for searching through a plurality of documents and for analyzing documents located as a result of a conducted search, the search being directed to a predetermined subject matter, the system comprising:
-
a client device comprising at least one of a non-transitory computer readable medium and a memory; a program module stored on the client device, the client device being positioned in communication with a network, and the network being in communication with a document provider database and a thesaurus database, the program module comprising instructions executable by a processor of the client device to locate at least one document from among the plurality of documents, the program module comprising an interface module; and a document analysis module; wherein the interface module receives concept data relating to the subject matter of the search, the concept data including at least one concept, the concept including a plurality of keywords used to conduct the search; wherein the interface module receives a plurality of documents relating to the concept data from the document provider database; wherein the interface module generates and displays a document analysis graphical user interface, the document analysis graphical user interface comprising a keyword entry interface, a document relevancy interface, a document management interface, and a document image window, wherein the document analysis module generates statistical data based on the at least one concept, and wherein the statistical data is used to assess relevancy of each of the documents located in the search so that each of the documents can be displayed using the document relevancy interface and wherein the statistical data includes a count of a number of instances that each of the keywords appears in a document located in the search; wherein the document analysis module transmits the statistical data to the interface module to be displayed; wherein the document analysis module transforms the count of the number of instances of each of the keywords into a normalized count; and
wherein the normalized count is calculated by dividing a total number of text characters in each document by five to provide a normalized word count, dividing the count of the number of instances of each of the keywords by the normalized word count to provide a density of each of the keywords, and multiplying the density of each of the keywords by a predetermined value to provide the normalized count;wherein the keyword entry interface allows entry of one or more keyword groups, and wherein each keyword group includes a plurality of keywords that are conceptually related to one another. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23)
-
-
24. A method for conducting a search through a plurality of documents and for analyzing the documents located as a result of the search, wherein the search is directed to a predetermined subject matter, the method comprising:
-
using a program module stored on at least one of a non-transitory computer readable medium and a memory of a client device to locate at least one document from among a plurality of documents; communicating from the client device being to a network, and the network being in communication with a document provider database and a thesaurus database; receiving concept data relating to the subject matter of the search, the concept data including at least one concept, the concept including a plurality of keywords used to conduct the search; receiving a plurality of documents relating to the concept data from the document provider database; generating and displaying a document analysis graphical user interface, the document analysis graphical user interface comprising a keyword entry interface, a document relevancy interface, a document management interface, and a document image window; generating statistical data based on the at least one concept, the statistical data being used to assess relevancy of the at least one document located in the search so that the at least one document can be displayed using the document relevancy interface, wherein the statistical data includes a count of a number of instances that each of the keywords appears in a document located in the search; transforming the count of the number of instances of each of the keywords into a normalized count; and
wherein the normalized count is calculated by dividing a total number of text characters in each document by five to provide a normalized word count, dividing the count of the number of instances of each of the keywords by the normalized word count to provide a density of each of the keywords, and multiplying the density of each of the keywords by a predetermined value to provide the normalized count;transmitting the statistical data to be displayed; and wherein the keyword entry interface allows entry of one or more keyword groups, and wherein each keyword group includes a plurality of keywords that are conceptually related to one another. - View Dependent Claims (25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45)
-
Specification