System and method for generating summary of research document

US 11,157,538 B2
Filed: 09/28/2018
Issued: 10/26/2021
Est. Priority Date: 04/30/2018
Status: Active Grant

First Claim

Patent Images

1. A system for generating a summary of at least one research document, the system comprising:

a computing device associated with an entity, wherein the computing device, comprises a computer readable program code, configured to;

upload the at least one research document,acquire informatory data related to the at least one research document, andpreprocess the at least one research document to extract information included;

a data repository comprising an ontological database and a synonym database; and

a server arrangement communicably coupled via one or more data communication networks with the computing device and the data repository, the server arrangement configured to;

acquire, from the computing device, the information included in the at least one research document,analyze, the information using the ontological database and the synonym database to identify a set of keywords corresponding to the at least one research document,assign a first score to each of the keywords based on a document-centric property, the informatory data and a popularity index of each of the keyword, wherein the popularity index of each of the keyword is a metric for quantifying number of times the keyword is included in a web-activity, and wherein the document-centric property of a keyword includes at least one of;

a location of the keyword in the at least one research document, an occurrence-frequency of the keyword in the at least one research document,assign a second score to one or more relationships between the keywords based on a relationship-centric property, wherein assigning the second score to one or more relationships between the keywords, based on the relationship-centric property, includes;

identifying one or more relationships between the keywords,identifying semantics of the one or more relationships in the at least one research document, andanalyzing world knowledge to determine a cognizance-index of the semantics of each of the one or more relationships, wherein the cognizance-index denotes an awareness of the one or more relationships, andgenerate the summary for the at least one research document, wherein the summary comprises;

a first portion generated based upon the informatory data,a second portion generated based on the keywords in the one or more relationships having the second score below a predefined threshold, anda third portion generated, using a machine learning algorithm, based on the first score of the keywords.

View all claims

2 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

Disclosed is a system for generating summary of at least one research document. The system comprising computing device associated with an entity, data repository comprising ontological database and synonym database and server arrangement communicably coupled via one or more data communication networks with the computing device and the data repository. The server arrangement is configured to acquire information included in at least one research document, analyze information using ontological database and synonym database to identify set of keywords corresponding to at least one research document, assign first score to each of the keywords based on document-centric property, assign second score to one or more relationships between the keywords based on relationship-centric property and generate summary for at least one research document. Summary comprises: first portion generated based upon informatory data, second portion generated based on keywords and third portion generated using machine learning algorithm based on first score of keywords.

12 Citations

9 Claims

1. A system for generating a summary of at least one research document, the system comprising:
- a computing device associated with an entity, wherein the computing device, comprises a computer readable program code, configured to;
  
  upload the at least one research document,acquire informatory data related to the at least one research document, andpreprocess the at least one research document to extract information included;
  
  a data repository comprising an ontological database and a synonym database; and
  
  a server arrangement communicably coupled via one or more data communication networks with the computing device and the data repository, the server arrangement configured to;
  
  acquire, from the computing device, the information included in the at least one research document,analyze, the information using the ontological database and the synonym database to identify a set of keywords corresponding to the at least one research document,assign a first score to each of the keywords based on a document-centric property, the informatory data and a popularity index of each of the keyword, wherein the popularity index of each of the keyword is a metric for quantifying number of times the keyword is included in a web-activity, and wherein the document-centric property of a keyword includes at least one of;
  
  a location of the keyword in the at least one research document, an occurrence-frequency of the keyword in the at least one research document,assign a second score to one or more relationships between the keywords based on a relationship-centric property, wherein assigning the second score to one or more relationships between the keywords, based on the relationship-centric property, includes;
  
  identifying one or more relationships between the keywords,identifying semantics of the one or more relationships in the at least one research document, andanalyzing world knowledge to determine a cognizance-index of the semantics of each of the one or more relationships, wherein the cognizance-index denotes an awareness of the one or more relationships, andgenerate the summary for the at least one research document, wherein the summary comprises;
  
  a first portion generated based upon the informatory data,a second portion generated based on the keywords in the one or more relationships having the second score below a predefined threshold, anda third portion generated, using a machine learning algorithm, based on the first score of the keywords.
- View Dependent Claims (2, 3, 4, 5)
- - 2. The system according to the claim 1, wherein preprocessing includes extracting entire content pertaining to the at least one research document.
  - 3. The system according to the claim 1, wherein preprocessing includes extracting selective content pertaining to the at least one research document.
  - 4. The system according to the claim 1, wherein the informatory data includes:
    - metadata related to the at least one research document,hypotheses of the at least one research document, andstatistical significance of the hypotheses.
  - 5. The system according to the claim 1, wherein the machine learning algorithm is implemented as a natural language generator.

6. A method for generating a summary of at least one research document, wherein the method is implemented using a system comprising:
- a computing device associated with an entity, wherein the computing device, comprises a computer readable program code, configured to;
  
  upload the at least one research document,acquire informatory data related to the at least one research document, andpreprocess the at least one research document to extract information;
  
  a data repository comprising an ontological database and a synonym database; and
  
  a server arrangement communicably coupled via one or more data communication networks with the computing device and the data repository, wherein the method comprises;
  
  acquiring, from the computing device, the information included in the at least one research document,analyzing, the information using the ontological database and the synonym database to identify a set of keywords corresponding to the at least one research document,assigning a first score to each of the keywords based on a document-centric property, the informatory data and a popularity index of each of the keyword, wherein the popularity index of each of the keyword is a metric for quantifying number of times the keyword is included in a web-activity, and wherein the document-centric property of a keyword includes at least one of;
  
  a location of the keyword in the at least one research document, an occurrence-frequency of the keyword in the at least one research document,assigning a second score to one or more relationships between the keywords based on a relationship-centric property, wherein assigning the second score to one or more relationships between the keywords, based on the relationship-centric property, includes;
  
  identifying one or more relationships between the keywords,identifying semantics of the one or more relationships in the at least one research document, andanalyzing world knowledge to determine a cognizance-index of the semantics of each of the one or more relationship, wherein the cognizance-index denotes an awareness of the one or more relationships, andgenerating the summary for the at least one research document, wherein the summary comprises;
  
  a first portion generated based upon the informatory data,a second portion generated based on the keywords in the one or more relationships having the second score below a predefined threshold, anda third portion generated, using a machine learning algorithm, based on the first score of the keywords.
- View Dependent Claims (7, 8, 9)
- - 7. The method according to the claim 6, wherein preprocessing includes extracting entire content pertaining to the at least one research document.
  - 8. The method according to the claim 6, wherein preprocessing includes extracting selective content pertaining to the at least one research document.
  - 9. The method according to the claim 6, wherein the informatory data includes:
    - metadata related to the at least one research document,hypotheses of the at least one research document, andstatistical significance of the hypotheses.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Innoplexus AG
Original Assignee
Innoplexus AG
Inventors
Keskar, Abhijit
Primary Examiner(s)
Kuddus, Daniel A

Application Number

US16/145,670
Publication Number

US 20190332719A1
Time in Patent Office

1,124 Days
Field of Search
US Class Current
CPC Class Codes

G06F 16/313   Selection or weighting of t...

G06F 16/345   Summarisation for human users

G06F 16/38   Retrieval characterised by ...

G06F 40/247   Thesauruses; Synonyms

G06F 40/30   Semantic analysis

G06N 20/00   Machine learning

G06N 5/022   Knowledge engineering; Know...

G06N 5/048   Fuzzy inferencing

System and method for generating summary of research document

First Claim

2 Assignments

0 Petitions

Accused Products

Abstract

12 Citations

9 Claims

Specification

Solutions

Use Cases

Quick Links

System and method for generating summary of research document

First Claim

2 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

12 Citations

9 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links