Method and system for probabilistically quantifying and visualizing relevance between two or more citationally or contextually related data objects

US 9,075,849 B2
Filed: 07/22/2014
Issued: 07/07/2015
Est. Priority Date: 09/27/2005
Status: Active Grant

First Claim

Patent Images

1. A computerized search engine for identifying and ranking relevant documents from a corpus of citationally-related documents, said computerized search engine comprising:

an input interface that enables a user to select a first set of identification information identifying one or more input documents from said corpus of citationally-related documents;

a computer-accessible index stored in a computer-readable storage device, said computer-accessible index comprising identification information identifying each potential input document from said corpus of citationally-related documents and, for each said potential input document, identification information identifying a selected number of citationally-related potential output documents from said corpus of citationally-related documents, said computer-accessible index further comprising for each pair of citationally-related potential input document and potential output document a first numerical score that is statistically correlated to the probability that a direct citation exists between each said pair of citationally related documents and wherein said first numerical score is calculated based at least in part on how many indirect citations exist between each said pair of citationally related documents and, for each indirect citation, how many citation links separate each said pair of citationally-related documents;

a computer processor configured to execute instructions stored in a computer-readable storage device, said instructions configured to cause said computer processor to;

access, from said computer-accessible index, a second set of identification information identifying one or more output documents corresponding to each of said one or more input documents and, for each identified pair of citationally-related input document and output document, said corresponding first numerical score; and

calculate, for each identified output document, a second numerical score that is statistically correlated to the probability that a direct citation exists between any of said one or more input documents and each said identified output document, and wherein said second numerical score is calculated based at least in part on said first numerical score; and

an output interface to display search results comprising identification information corresponding to said one or more output documents and wherein said search results are ranked in accordance with said second numerical score.

View all claims

1 Assignment

Timeline View

Assignment View

1 Petition

Accused Products

Abstract

In one embodiment a method for probabilistically quantifying a degree of relevance between two or more citationally or contextually related data objects, such as patent documents, non-patent documents, web pages, personal and corporate contacts information, product information, consumer to behavior, technical or scientific information, address information, and the like is provided. In another embodiment a method for visualizing and displaying relevance between two or more citationally or contextually related data objects is provided. In another embodiment a search input/output interface that utilizes an iterative self-organizing mapping technique to automatically generate a visual map of relevant patents and/or other related documents desired to be explored, searched or analyzed is provided. In another embodiment, a search input/output interface that displays and/or communicates search input criteria and corresponding search results in a way that facilitates intuitive understanding and visualization of the logical relationships between two or more related concepts being searched is provided.

Citations

20 Claims

1. A computerized search engine for identifying and ranking relevant documents from a corpus of citationally-related documents, said computerized search engine comprising:
- an input interface that enables a user to select a first set of identification information identifying one or more input documents from said corpus of citationally-related documents;
  
  a computer-accessible index stored in a computer-readable storage device, said computer-accessible index comprising identification information identifying each potential input document from said corpus of citationally-related documents and, for each said potential input document, identification information identifying a selected number of citationally-related potential output documents from said corpus of citationally-related documents, said computer-accessible index further comprising for each pair of citationally-related potential input document and potential output document a first numerical score that is statistically correlated to the probability that a direct citation exists between each said pair of citationally related documents and wherein said first numerical score is calculated based at least in part on how many indirect citations exist between each said pair of citationally related documents and, for each indirect citation, how many citation links separate each said pair of citationally-related documents;
  
  a computer processor configured to execute instructions stored in a computer-readable storage device, said instructions configured to cause said computer processor to;
  
  access, from said computer-accessible index, a second set of identification information identifying one or more output documents corresponding to each of said one or more input documents and, for each identified pair of citationally-related input document and output document, said corresponding first numerical score; and
  
  calculate, for each identified output document, a second numerical score that is statistically correlated to the probability that a direct citation exists between any of said one or more input documents and each said identified output document, and wherein said second numerical score is calculated based at least in part on said first numerical score; and
  
  an output interface to display search results comprising identification information corresponding to said one or more output documents and wherein said search results are ranked in accordance with said second numerical score.
- View Dependent Claims (2, 3, 4, 5, 6, 7)
- - 2. The computerized search engine of claim 1 wherein said computer-accessible index comprises, for each said potential input document, identification information identifying each citationally-related potential output document extending at least three generations and not more than five generations from each said potential input document.
  - 3. The computerized search engine of claim 1 wherein said instructions are configured to cause said computer processor to calculate said second numerical score for each said identified output document by calculating the mathematical sum of said first numerical score for each corresponding identified pair of citationally-related input document and output document.
  - 4. The computerized search engine of claim 1 wherein said instructions are configured to cause said computer processor to calculate said second numerical score for each said identified output document by calculating the statistical probability that each said identified output document is citationally related at the first generation to said one or more input documents.
  - 5. The computerized search engine of claim 1 wherein said input interface and/or said output interface enable said user to select said first set of identification information at least in part from said search results.
  - 6. The computerized search engine of claim 1 wherein said output interface visually displays said search results in the form of an interactive chart or graph.
  - 7. The computerized search engine of claim 1 wherein said output interface visually displays said search results in the form of an interactive self-organizing map.

8. A computer system for enabling a user to execute one or more search queries to identify and rank relevant documents from a corpus of citationally-related documents, said computer system comprising:
- an input interface that enables said user to select a first set of identification information identifying one or more input documents from said corpus of citationally-related documents;
  
  a computer-accessible index stored in a computer-readable storage device, said computer-accessible index comprising identification information identifying each potential input document from said corpus of citationally-related documents and, for each said potential input document, identification information identifying a selected number of citationally-related potential output documents from said corpus of citationally-related documents, said computer-accessible index further comprising for each pair of citationally-related potential input document and potential output document a first numerical score configured to have a statistical correlation to whether a direct citation exists between said corresponding pair of citationally-related documents and wherein said first numerical score is calculated based at least in part on how many indirect citations exist between each said pair of citationally related documents and, for each indirect citation, how many citation links separate each said pair of citationally-related documents;
  
  a computer processor configured to execute instructions stored in a computer-readable storage device, said instructions configured to cause said computer processor to;
  
  use said first set of identification information to ascertain, from said computer-accessible index, a second set of identification information identifying, for each of said one or more input documents, a selected number of citationally-related output documents and, for each pair of citationally-related input document and output document, said first numerical score; and
  
  calculate, for each of said citationally-related output documents, a second numerical score configured to have a statistical correlation to whether a direct citation exists between any of said one or more input documents and each of said citationally-related output documents, and wherein said second numerical score is calculated based at least in part on said first numerical score; and
  
  an output interface to present search query results comprising a third set of identification information identifying one or more of said citationally-related output documents and wherein said search query results are sorted and displayed in accordance with said second numerical score.
- View Dependent Claims (9, 10, 11, 12, 13, 14)
- - 9. The computer system of claim 8 wherein said computer-accessible index comprises, for each said potential input document, identification information identifying each citationally-related potential output document extending at least three generations and not more than five generations from each said potential input document.
  - 10. The computer system of claim 8 wherein said instructions are configured to cause said computer processor to calculate said second numerical score for each of said citationally-related output documents by calculating the mathematical sum of said first numerical score for each corresponding pair of citationally-related input document and output document.
  - 11. The computer system of claim 8 wherein said instructions are configured to cause said computer processor to calculate said second numerical score for each of said citationally-related output documents by calculating the statistical probability that each of said citationally-related output documents is citationally related at the first generation to said one or more input documents.
  - 12. The computer system of claim 8 wherein said input interface and said output interface enable said user to select said first set of identification information at least in part from said search query results comprising said third set of identification information.
  - 13. The computer system of claim 8 wherein said output interface presents said search query results in the form of an interactive chart or graph.
  - 14. The computer system of claim 8 wherein said output presents said search query results in the form of an interactive self-organizing map.

15. A computer-implemented method for identifying and ranking relevant documents from a corpus of citationally-related documents, said computer-implemented method comprising:
- under control of a computing device configured with specific computer-executable instructions;
  
  receiving a first set of identification information identifying one or more input documents from said corpus of citationally-related documents;
  
  using said first set of identification information to ascertain, from a computer-accessible index;
  
  i) a second set of identification information identifying, for each of said one or more input documents, a selected number of citationally-related output documents, and ii) for each identified pair of citationally-related input document and output document, a first numerical score having a statistical correlation to whether a direct citation exists between each said identified pair of citationally-related documents,said computer-accessible index comprising;
  
  i) identification information identifying each potential input document from said corpus of citationally-related documents, ii) identification information identifying, for each said potential input document, a selected number of citationally-related potential output documents from said corpus of citationally-related documents, and iii) said first numerical score pre-calculated for each potential pair of citationally-related potential input document and potential output document and wherein said first numerical score is calculated based at least in part on how many indirect citations exist between each said potential pair of citationally related documents and, for each indirect citation, how many citation links separate each said potential pair of citationally-related documents;
  
  calculating, for each of said citationally-related output documents, a second numerical score configured to have a statistical correlation to whether a direct citation exists between any of said one or more input documents and each of said citationally-related output documents, and wherein said second numerical score is calculated based at least in part on said first numerical score; and
  
  displaying a search query result comprising a third set of identification information identifying one or more of said citationally-related output documents and wherein said search query results are sorted in accordance with each said corresponding second numerical score.
- View Dependent Claims (16, 17, 18, 19, 20)
- - 16. The computer-implemented method of claim 15 wherein said computer-accessible index comprises, for each said potential input document, identification information identifying each citationally-related potential output document extending at least three generations and no more than five generations from each said potential input document.
  - 17. The computer-implemented method of claim 15 wherein calculating said second numerical score comprises calculating, for each output document, the mathematical sum of said first numerical score for each corresponding identified pair of citationally-related input document and output document.
  - 18. The computer-implemented method of claim 15 wherein calculating said second numerical score comprises calculating, for each output document, the statistical probability that each said output document is citationally related at the first generation to at least one of said one or more input documents.
  - 19. The computer-implemented method of claim 15 further comprising selecting said first set of identification information at least in part from said displayed search query results comprising said third set of identification information.
  - 20. The computer-implemented method of claim 15 further comprising visually displaying said search query results in the form of an interactive chart, graph, or map.

Specification

Resources

Litigation Campaign Assessment

Litigation Data

Current Assignee
PatentRatings, LLC
Original Assignee
PatentRatings, LLC
Inventors
Barney, Jonathan A.
Primary Examiner(s)
Jami, Hares

Application Number

US14/338,208
Publication Number

US 20150046420A1
Time in Patent Office

350 Days
Field of Search

707/705, 707/722, 707/726, 707/728, 707/731, 707/923, 707/930, 707/933, 707/937, 707/999.1
US Class Current

1/1
CPC Class Codes

G06F 16/14   Details of searching files ...

G06F 16/2228   Indexing structures

G06F 16/24578   using ranking

G06F 16/2465   Query processing support fo...

G06F 16/248   Presentation of query results

G06F 16/26   Visual data mining; Browsin...

G06F 16/334   Query execution G06F16/335 ...

G06F 16/3346   using probabilistic model

G06F 16/34   Browsing; Visualisation the...

G06F 16/382   using citations hypermedia ...

G06F 16/93   Document management systems

G06F 16/95   Retrieval from the web

G06F 16/951   Indexing; Web crawling tech...

G06F 2216/11   Patent retrieval

Y10S 707/912   Applications of a database

Y10S 707/923   Intellectual property

Y10S 707/93   intellectual property analysis

Y10S 707/933   Citation analysis

Y10S 707/937   intellectual property searc...

Method and system for probabilistically quantifying and visualizing relevance between two or more citationally or contextually related data objects

First Claim

1 Assignment

1 Petition

Accused Products

Abstract

Citations

20 Claims

Specification

Solutions

Use Cases

Quick Links

Method and system for probabilistically quantifying and visualizing relevance between two or more citationally or contextually related data objects

First Claim

1 Assignment

Subscription Required

Subscription Required

1 Petition

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

20 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links