Human language analyzer for detecting clauses, clause types, and clause relationships
First Claim
1. A computer-program product tangibly embodied in a non-transitory machine-readable storage medium, the computer-program product including instructions operable to cause a clause analyzer to:
- receive a clause request, at the clause analyzer, to locate clauses within text data representing information in a human language, wherein the text data comprises a plurality of clauses of a sentence portion;
generate, responsive to the clause request, a dependency request to a dependency parser to determine dependency information for a plurality of tokens of the plurality of clauses of the sentence portion, wherein each token of the plurality of tokens comprises one or more characters of the text data, and the dependency information indicates a dependency of a respective token of the sentence portion on at least one other token of the sentence portion;
receive, responsive to the dependency request, token information in a token data set, wherein the token information comprises one or more token identifiers of identified tokens in the sentence portion of the text data, and wherein the token information comprises dependency information indicating a dependency of a respective token of the identified tokens on at least one other token of the sentence portion;
determine a location for each of the plurality of clauses of the sentence portion in a hierarchy of clauses by associating to each clause in the sentence portion;
one of a plurality of levels of the hierarchy of clauses; and
any respective connection to a clause associated to a different level of the plurality of levels;
generate a new data set based on the token data set and the hierarchy of clauses, wherein the new data set comprises information representing the token information, and the new data set comprises one or more location identifiers indicating a location of a respective token of the token information according to a location, in the hierarchy of clauses, of a clause comprising the respective token of the token information; and
output the new data set.
1 Assignment
0 Petitions
Accused Products
Abstract
A human language analyzer receives, at the human language analyzer, text data representing information in a human language. The human language analyzer receives a computer command for identifying a text data component of the text data. The computer command comprises at least two requirements for the text data component. The human language analyzer, responsive to identifying that the first requirement and the second requirement is met, locates the text data component from one of two clauses. A clause analyzer receives a clause request to locate clauses within text data representing information in a human language. The clause analyzer receives, responsive to a dependency request, token information in a token data set. The clause analyzer determines a location for each clause of the sentence portion in a hierarchy of clauses. The clause analyzer generates and outputs a new data set based on the token data set and the hierarchy of clauses.
54 Citations
30 Claims
-
1. A computer-program product tangibly embodied in a non-transitory machine-readable storage medium, the computer-program product including instructions operable to cause a clause analyzer to:
-
receive a clause request, at the clause analyzer, to locate clauses within text data representing information in a human language, wherein the text data comprises a plurality of clauses of a sentence portion; generate, responsive to the clause request, a dependency request to a dependency parser to determine dependency information for a plurality of tokens of the plurality of clauses of the sentence portion, wherein each token of the plurality of tokens comprises one or more characters of the text data, and the dependency information indicates a dependency of a respective token of the sentence portion on at least one other token of the sentence portion; receive, responsive to the dependency request, token information in a token data set, wherein the token information comprises one or more token identifiers of identified tokens in the sentence portion of the text data, and wherein the token information comprises dependency information indicating a dependency of a respective token of the identified tokens on at least one other token of the sentence portion; determine a location for each of the plurality of clauses of the sentence portion in a hierarchy of clauses by associating to each clause in the sentence portion; one of a plurality of levels of the hierarchy of clauses; and any respective connection to a clause associated to a different level of the plurality of levels; generate a new data set based on the token data set and the hierarchy of clauses, wherein the new data set comprises information representing the token information, and the new data set comprises one or more location identifiers indicating a location of a respective token of the token information according to a location, in the hierarchy of clauses, of a clause comprising the respective token of the token information; and output the new data set. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20)
-
-
21. A computer-implemented method comprising:
-
receiving a clause request, at a clause analyzer, to locate clauses within text data representing information in a human language, wherein the text data comprises a plurality of clauses of a sentence portion; generating, responsive to the clause request, a dependency request to a dependency parser to determine dependency information for a plurality of tokens of the plurality of clauses of the sentence portion, wherein each token of the plurality of tokens comprises one or more characters of the text data, and the dependency information indicates a dependency of a respective token of the sentence portion on at least one other token of the sentence portion; receiving, responsive to the dependency request, token information in a token data set, wherein the token information comprises one or more token identifiers of identified tokens in the sentence portion of the text data, and wherein the token information comprises dependency information indicating a dependency of a respective token of the identified tokens on at least one other token of the sentence portion; determining a location for each of the plurality of clauses of the sentence portion in a hierarchy of clauses by associating to each clause in the sentence portion; one of a plurality of levels of the hierarchy of clauses; and any respective connection to a clause associated to a different level of the plurality of levels; generating a new data set based on the token data set and the hierarchy of clauses, wherein the new data set comprises information representing the token information, and the new data set comprises one or more location identifiers indicating a location of a respective token of the token information according to a location, in the hierarchy of clauses, of a clause comprising the respective token of the token information; and outputting the new data set. - View Dependent Claims (22, 23, 24, 25, 26, 27, 28, 29)
-
-
30. A clause analyzer comprising processor and memory, the memory containing instructions executable by the processor wherein the clause analyzer is configured to:
-
receive a clause request, at the clause analyzer, to locate clauses within text data representing information in a human language, wherein the text data comprises a plurality of clauses of a sentence portion; generate, responsive to the clause request, a dependency request to a dependency parser to determine dependency information for a plurality of tokens of the plurality of clauses of the sentence portion, wherein each token of the plurality of tokens comprises one or more characters of the text data, and the dependency information indicates a dependency of a respective token of the sentence portion on at least one other token of the sentence portion; receive, responsive to the dependency request, token information in a token data set, wherein the token information comprises one or more token identifiers of identified tokens in the sentence portion of the text data, and wherein the token information comprises dependency information indicating a dependency of a respective token of the identified tokens on at least one other token of the sentence portion; determine a location for each of the plurality of clauses of the sentence portion in a hierarchy of clauses by associating to each clause in the sentence portion; one of a plurality of levels of the hierarchy of clauses; and any respective connection to a clause associated to a different level of the plurality of levels; generate a new data set based on the token data set and the hierarchy of clauses, wherein the new data set comprises information representing the token information, and the new data set comprises one or more location identifiers indicating a location of a respective token of the token information according to a location, in the hierarchy of clauses, of a clause comprising the respective token of the token information; and output the new data set.
-
Specification