System and method for creating and editing, an on-line publication
First Claim
1. A method of creating a dynamic representation from data received from an information source, comprising the steps of:
- storing the data received from the information source;
identifying at least one text instance in the data using a text processing module;
finding one or more images in an image database module relating to the at least one text instance;
selecting at least one image from the one or more found images to display with the data, comprising the steps of;
identifying a number of unique text instances with associated images in the data;
comparing the number of unique text instances with a minimum number and terminating the method if the number of unique text instances is less than the minimum number;
storing the number of unique text instances in a temporary structure;
ranking the unique text instances by their frequency in the data;
processing paragraphs of the data beginning with the paragraph having the most unique text instances;
indicating in a document object that an image corresponding to the most frequent unique text instance in a paragraph will be displayed in the dynamic representation of the paragraph; and
grouping paragraphs into sections, wherein two consecutive paragraphs are grouped together if one paragraph has a unique text instance and an image link so that each section has a unique text instance and an image to display; and
generating the dynamic representation of the data from the at least one selected image and the data.
1 Assignment
0 Petitions
Accused Products
Abstract
A system and method automatically generate an on-line document from raw text into an engaging, interactive form for a plurality of viewers. Unstructured articles are read from an information feed. A computation process extracts and tags proper names of people, products, organizations, and places and categorizes them. An image database is used to link these proper names with image files. The image database consists of a series of attribute-value pairs for active searching of names. A URL query string is input to the database to extract the location of the image in the database file system. An Extensible Markup Language (XML) file is created from the raw text of the article, the list of proper names in the processed data and the image file references. The XML file is stored in a file system. An Extensible Stylesheet Language (XSL) file provides templates containing computational relationships between the text and images. The XML and XSL style sheets are combined to generate a Hypertext Markup Language (HTML) file containing an on-line story of the unstructured articles in a Java Applet which allows the system to provide a variety of interactive behaviors for a final presentation available by a viewer from a browser.
43 Citations
16 Claims
-
1. A method of creating a dynamic representation from data received from an information source, comprising the steps of:
-
storing the data received from the information source;
identifying at least one text instance in the data using a text processing module;
finding one or more images in an image database module relating to the at least one text instance;
selecting at least one image from the one or more found images to display with the data, comprising the steps of;
identifying a number of unique text instances with associated images in the data;
comparing the number of unique text instances with a minimum number and terminating the method if the number of unique text instances is less than the minimum number;
storing the number of unique text instances in a temporary structure;
ranking the unique text instances by their frequency in the data;
processing paragraphs of the data beginning with the paragraph having the most unique text instances;
indicating in a document object that an image corresponding to the most frequent unique text instance in a paragraph will be displayed in the dynamic representation of the paragraph; and
grouping paragraphs into sections, wherein two consecutive paragraphs are grouped together if one paragraph has a unique text instance and an image link so that each section has a unique text instance and an image to display; and
generating the dynamic representation of the data from the at least one selected image and the data. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. An apparatus for creating a dynamic representation from data received from an information source, the apparatus comprising:
-
a memory; and
at least one processor coupled to the memory and operative to;
(i) store the data received from the information source in a file system;
(ii) identify at least one text instance using a text processing module;
(iii) find one or more images in an image database module relating to the at least one text instance;
(iv) select at least one image from the one or more found images to display with the data, comprising the operations of;
(a) identifying a number of unique text instances with associated images in the data;
(b) comparing the number of unique text instances with a minimum number and terminating the method if the number of unique text instances is less than the minimum number;
(c) storing the number of unique text instances in a temporary structure;
(d) ranking the unique text instances by their frequency in the data;
(e) processing paragraphs of the data beginning with the paragraph having the most unique text instances;
(f) indicating in a document object that an image corresponding to the most frequent unique text instance in a paragraph will be displayed in the dynamic representation of the paragraph; and
(g) grouping paragraphs into sections, wherein two consecutive paragraphs are grouped together if one paragraph has a unique text instance and an image link so that each section has a unique text instance and an image to display; and
(v) generate the dynamic representation of the data from the at least one selected image and the data.
-
-
16. An article of manufacture for creating a dynamic representation from data received from an information source, comprising a machine readable medium containing one or more programs which when executed implement the steps of:
-
storing the data received from the information source in a file system;
identifying at least one text instance in the data using a text processing module;
finding image in one or more images in an image database module relating to the at least one text instance;
selecting at least one image from the one or more found images to display with the data, comprising the steps of;
identifying a number of unique text instances with associated images in the data;
comparing the number of unique text instances with a minimum number and terminating the method if the number of unique text instances is less than the minimum number;
storing the number of unique text instances in a temporary structure;
ranking the unique text instances by their frequency in the data;
processing paragraphs of the data beginning with the paragraph having the most unique text instances;
indicating in a document object that an image corresponding to the most frequent unique text instance in a paragraph will be displayed in the dynamic representation of the paragraph; and
grouping paragraphs into sections, wherein two consecutive paragraphs are grouped together if one paragraph has a unique text instance and an image link so that each section has a unique text instance and an image to display; and
generating a dynamic representation of the data from the at least one selected image and the data.
-
Specification