Graphical models for representing text documents for computer analysis
First Claim
Patent Images
1. A method, comprising:
- receiving a document including a plurality of ordered words;
creating a graph data structure for the document, wherein the graph data structure includes a plurality of nodes and edges, each node representing a distinct word in the document and each edge identifying a number of times two nodes occur within at least one word from each other;
storing the graph data structure in an information repository;
receiving a request to perform text analysis on the document; and
performing text analysis on the graph data structure and providing a result that is responsive to the request.
1 Assignment
0 Petitions
Accused Products
Abstract
In a method for representing a text document with a graphical model, a document including a plurality of ordered words is received and a graph data structure for the document is created. The graph data structure includes a plurality of nodes and edges, with each node representing a distinct word in the document and each edge identifying a number of times two nodes occur within a predetermined distance from each other. The graph data structure is stored in an information repository.
-
Citations
9 Claims
-
1. A method, comprising:
-
receiving a document including a plurality of ordered words; creating a graph data structure for the document, wherein the graph data structure includes a plurality of nodes and edges, each node representing a distinct word in the document and each edge identifying a number of times two nodes occur within at least one word from each other; storing the graph data structure in an information repository; receiving a request to perform text analysis on the document; and performing text analysis on the graph data structure and providing a result that is responsive to the request. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A computer program product, comprising:
-
a computer readable storage medium having computer readable program code embodied therewith, the computer readable program code comprising; computer readable program code configured to receive a document including a plurality of ordered words; and computer readable program code configured to create a graph data structure for the document, wherein the graph data structure includes a plurality of nodes and edges, each node representing a distinct word in the document and each edge identifying a number of times two nodes occur within at least one word from each other. - View Dependent Claims (7, 8, 9)
-
Specification