Query-independent entity importance in books
First Claim
1. A system for presenting entities mentioned in books, comprising:
- a non-transitory computer-readable storage medium storing executable computer program instructions comprising;
an entity importance engine for assigning query-independent importance scores to entities of an entity type mentioned by books in a corpus, the entity importance engine comprising a references module for identifying third party references to sections of a book in which an entity is mentioned, assigning importance values to the sections of the book, and calculating an importance score for the entity responsive at least in part to the importance values, wherein the importance values for sections of the book referenced by third parties are elevated relative to the importance values for sections of the book not referenced by third parties;
a search module for receiving a search query from a requestor and identifying a list of a plurality of books in the corpus that at least partially satisfy the query, and ranking the books in the list in an order based at least in part on the query-independent importance scores assigned to entities mentioned by the books; and
a presentation module for presenting the plurality of books to the requestor in the ranked order; and
a processor for executing the computer program instructions.
2 Assignments
0 Petitions
Accused Products
Abstract
A corpus contains information including text from books and metadata about the books. The book texts mention entities of various types, such as location, date, event, and person entities. An entity importance engine analyzes the information in the corpus to identify the entities mentioned therein, and ranks the entities using query-independent importance scores. The importance scores can be based in part on the contexts in which the entities are mentioned by the books. A presentation engine enables searching of the corpus to identify books satisfying a search query. The presentation engine presents information about a designated book. Entities mentioned in the book are presented in a style selected to enhance comprehension and utility. For example, location entities can be presented on a map with markers showing the locations of the entities. The number of entities presented are limited and ordered based on the query-independent importance scores.
93 Citations
25 Claims
-
1. A system for presenting entities mentioned in books, comprising:
a non-transitory computer-readable storage medium storing executable computer program instructions comprising; an entity importance engine for assigning query-independent importance scores to entities of an entity type mentioned by books in a corpus, the entity importance engine comprising a references module for identifying third party references to sections of a book in which an entity is mentioned, assigning importance values to the sections of the book, and calculating an importance score for the entity responsive at least in part to the importance values, wherein the importance values for sections of the book referenced by third parties are elevated relative to the importance values for sections of the book not referenced by third parties; a search module for receiving a search query from a requestor and identifying a list of a plurality of books in the corpus that at least partially satisfy the query, and ranking the books in the list in an order based at least in part on the query-independent importance scores assigned to entities mentioned by the books; and a presentation module for presenting the plurality of books to the requestor in the ranked order; and a processor for executing the computer program instructions. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
8. A computer program product stored on a non-transitory computer-readable storage medium having executable computer program instructions for presenting entities mentioned in books, comprising:
-
an entity importance engine for assigning query-independent importance scores to entities mentioned by books in a corpus, the entity importance engine comprising a library classification module for determining whether library classification data mention an entity in a book and calculating an importance score for the entity responsive at least in part to the library classification data, wherein the importance scores of entities mentioned in the library classification data are elevated relative to importance scores of entities not mentioned in the library classification data; a search module for receiving a search query from a requestor and identifying a list of a plurality of books in the corpus that at least partially satisfy the query, and ranking the books in the list in an order based at least in part on the query-independent importance scores assigned to entities mentioned by the books; and a presentation module for presenting the plurality of books to the requestor in the ranked order. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A computer-implemented method of determining an importance score for an entity mentioned in a book, comprising:
-
identifying third-party references to sections of the book, the third party references external to the book; assigning importance values to the sections of the book, wherein the importance values for sections of the book referenced by third parties are elevated relative to the importance values for sections of the book not referenced by third parties; identifying one or more sections of the book that mention the entity; and calculating the importance score for the entity responsive at least in part to the importance values of the one or more sections of the book that mention the entity, the importance score indicating an importance of the entity relative to other entities mentioned in the book. - View Dependent Claims (16, 17, 18, 19, 20, 21)
-
-
22. A computer-implemented method of determining an importance score for an entity mentioned in a book, comprising:
-
identifying library classification data indicating a classification of the book; determining whether the library classification data mention entities from the book; assigning query-independent importance scores to the entities of an entity type mentioned in the book, wherein the importance scores of entities mentioned in the library classification data are elevated relative to importance scores of entities not mentioned in the library classification data; selecting a subset of the plurality of entities responsive to the entities'"'"' importance scores and entity type; and presenting the selected subset of entities to a user. - View Dependent Claims (23, 24, 25)
-
Specification