NLP-based systems and methods for providing quotations
First Claim
1. A method in a content recommendation system, the method comprising:
- under control of a computing system having a computer processor, using the computer processor to provide quotation information by automatically,under control of the computer processor, extracting a quotation from a text document in a corpus of text documents indexed by the computing system;
under control of the computer processor, identifying one or more entities that are referenced by the text document, each of the determined entities being electronically represented by the content recommendation system, wherein identifying the one or more entities includes linking together multiple mentions of a same entity across the text document, the linking including resolving pronoun coreference to identify an entity that is a speaker of the quotation;
under control of the computer processor, attributing the quotation to the speaker of the quotation by storing data that associates the quotation with the identified entity that is the speaker of the quotation; and
under control of the computer processor, providing the quotation by transmitting text that represents the quotation and the attributed speaker.
4 Assignments
0 Petitions
Accused Products
Abstract
Techniques for providing quotations obtained from text documents using natural language processing techniques are described. Some embodiments provide a content recommendation system (“CRS”) configured to provide quotations by extracting quotations from a corpus text documents, and providing access to the extracted quotations in response to search requests received from users. The CRS may extract quotations by using natural language processing-based techniques to identify one or more entities, such as people, places, objects, concepts, or the like, that are referenced by the extracted quotations. The CRS may then store the extracted quotations along with identified entities, such as quotation speakers and subjects, for later access via search requests.
-
Citations
30 Claims
-
1. A method in a content recommendation system, the method comprising:
-
under control of a computing system having a computer processor, using the computer processor to provide quotation information by automatically, under control of the computer processor, extracting a quotation from a text document in a corpus of text documents indexed by the computing system; under control of the computer processor, identifying one or more entities that are referenced by the text document, each of the determined entities being electronically represented by the content recommendation system, wherein identifying the one or more entities includes linking together multiple mentions of a same entity across the text document, the linking including resolving pronoun coreference to identify an entity that is a speaker of the quotation; under control of the computer processor, attributing the quotation to the speaker of the quotation by storing data that associates the quotation with the identified entity that is the speaker of the quotation; and under control of the computer processor, providing the quotation by transmitting text that represents the quotation and the attributed speaker. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23)
-
-
24. A computing system configured to recommend content, comprising:
-
a memory; a module stored on the memory that is configured, when executed, to; extract a quotation from a text document in a corpus of text documents; identify one or more entities that are referenced by the text document, each of the determined entities being electronically represented by the content recommendation system, wherein identifying the one or more entities includes linking together multiple mentions of a same entity across the text document, the linking including resolving pronoun coreference to identify an entity that is a speaker of the quotation; attributing the quotation to the speaker of the quotation by storing data that associates the quotation with the identified entity that is the speaker of the quotation; and provide the quotation by transmitting text that represents the quotation and the attributed speaker. - View Dependent Claims (25, 26, 27)
-
-
28. A non-transitory computer-readable medium including:
contents that, when executed, cause a computing system to recommend content, by performing a method comprising; extracting a quotation from a text document in a corpus of text documents; identifying one or more entities that are referenced by the text document, each of the determined entities being electronically represented by the content recommendation system, wherein identifying the one or more entities includes linking together multiple mentions of a same entity across the text document, the linking including resolving pronoun coreference to identify an entity that is a speaker of the quotation; attributing the quotation to the speaker of the quotation by storing data that associates the quotation with the identified entity that is the speaker of the quotation; and providing the quotation by transmitting text that represents the quotation and the attributed speaker. - View Dependent Claims (29, 30)
Specification