DATA PROVENANCE SYSTEM
First Claim
Patent Images
1. A method comprising:
- accessing an electronic artifact comprising content of a particular type of media;
determining text corresponding to the content;
performing natural language processing on the text to identify at least a subset of words in a statement within the text and determine meanings of each word in the subset of words; and
generate a context image for the electronic artifact based on the natural language processing, wherein the context image comprises a graph comprising nodes corresponding to the subset of words and the context image defines relationships between the subset of words.
1 Assignment
0 Petitions
Accused Products
Abstract
An electronic artifact is accessed which includes content of a particular type of media. Text is determined corresponding to the content and natural language processing is performed on the text to identify at least a subset of words in a statement within the text and determine meanings of each word in the subset of words. A context image is generated for the electronic artifact based on the natural language processing, where the context image includes a graph including nodes corresponding to the subset of words and the context image defines relationships between the subset of words.
4 Citations
20 Claims
-
1. A method comprising:
-
accessing an electronic artifact comprising content of a particular type of media; determining text corresponding to the content; performing natural language processing on the text to identify at least a subset of words in a statement within the text and determine meanings of each word in the subset of words; and generate a context image for the electronic artifact based on the natural language processing, wherein the context image comprises a graph comprising nodes corresponding to the subset of words and the context image defines relationships between the subset of words. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. A computer program product comprising a computer readable storage medium comprising computer readable program code embodied therewith, the computer readable program code comprising:
-
computer readable program code configured to identify digital media of a particular type; computer readable program code configured to determine text statements from content of the digital media; computer readable program code configured to perform natural language processing on the text statements to; identify a first word in a particular one of the text statements as a key term in the particular text statement, wherein the key term represents a topic of the particular text statement; and identify a set of second words in the particular text statement representing attributes of the topic; computer readable program code configured to generate a context image for the statement, wherein the context image comprises a graph comprising nodes corresponding to the first word and the set of second words and defining relationships between the nodes to indicate that the set of second words represent attributes of the topic represented by the first word; and computer readable program code configured to determine a similarity score for the particular text statement based on a comparison of the context image with a plurality of other context images generated from other digital media.
-
-
18. A system comprising:
-
a data processing apparatus; a memory element storing data comprising an electronic artifact; a text extractor, executable by the data processing apparatus to determine a text statement from content of the electronic artifact; a natural language processor, executable by the data processing apparatus to assess the text statement to; determine meanings of a set of words included in the text statement; identify a first word in the set of words as a key term in the text statement, wherein the key term represents a topic of the text statement; and identify a set of second words in the text statement representing attributes of the topic; and a context image generator, executable by the data processing apparatus to generate a context image for the text statement, wherein the context image comprises a graph comprising nodes corresponding to the first word and the set of second words and defining relationships between the nodes to indicate that the set of second words represent attributes of the topic represented by the first word. - View Dependent Claims (19, 20)
-
Specification