FAST TITLE/SUMMARY EXTRACTION FROM LONG DESCRIPTIONS
First Claim
1. A method comprising:
- receiving one or more sentences;
based on the one or more sentences, generating a tree comprising a plurality of nodes representing the one or more sentences;
determining a first set of one or more nodes to remove from the tree;
wherein said determining is based, at least in part, on which nodes in the tree comprises a recognized named entity;
based on said determining, removing said first set of one or more nodes of the plurality of nodes from the tree;
after said removing, reconstructing, from remaining nodes of the plurality of nodes, a first set of one or more sub-sentences of said one or more sentences, wherein the first set of one or more sub-sentences are shorter in length than said one or more sentences;
wherein the method is performed by one or more special-purpose computing devices.
6 Assignments
0 Petitions
Accused Products
Abstract
Techniques are described herein for automatic generation of a title or summary from a long body of text. A grammatical tree representing one or more sentences of the long body of text is generated. One or more nodes from the grammatical tree are selected to be removed. According to one embodiment, a particular node is selected to be removed based on its position in the grammatical tree and its node-type, where the node type represents a grammatical element of the sentence. Once the particular node is selected, a branch of the tree is cut at the node. After branch has been cut, one or more sub-sentences are generated from the remaining nodes in the grammatical tree. The one or more sub-sentences may be returned as a title or summary.
64 Citations
20 Claims
-
1. A method comprising:
-
receiving one or more sentences; based on the one or more sentences, generating a tree comprising a plurality of nodes representing the one or more sentences; determining a first set of one or more nodes to remove from the tree; wherein said determining is based, at least in part, on which nodes in the tree comprises a recognized named entity; based on said determining, removing said first set of one or more nodes of the plurality of nodes from the tree; after said removing, reconstructing, from remaining nodes of the plurality of nodes, a first set of one or more sub-sentences of said one or more sentences, wherein the first set of one or more sub-sentences are shorter in length than said one or more sentences; wherein the method is performed by one or more special-purpose computing devices. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 12, 13, 14, 15)
-
-
11. A non-transitory computer-readable storage medium that stores instructions which, when executed by one or more processors, cause the one or more processors to perform steps comprising:
-
receiving one or more sentences; based on the one or more sentences, generating a tree comprising a plurality of nodes representing the one or more sentences; determining a first set of one or more nodes to remove from the tree; wherein said determining is based, at least in part, on which nodes in the tree comprises a recognized named entity; based on said determining, removing said first set of one or more nodes of the plurality of nodes from the tree; after said removing, reconstructing, from remaining nodes of the plurality of nodes, a first set of one or more sub-sentences of said one or more sentences, wherein the first set of one or more sub-sentences are shorter in length than said one or more sentences; wherein the method is performed by one or more special-purpose computing devices. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification