System and method for web text content aggregation and presentation
First Claim
Patent Images
1. A system for processing and presenting web content comprising:
- a hardware network interface configured to receive web content from one or more web content providers, the web content including text-based content;
a processor; and
a computer-readable storage medium storing instructions executable by the processor to;
extract the text-based content from the web content;
parse the web content to extract the text-based content with a web content processing module by applying a different parsing strategy for each particular content provider associated with the web content, each parsing strategy being configured to filter and extract text-based content from an associated content provider for that parsing strategy based on a web document structure unique to the associated content provider;
select and retrieve a template document for encoding the text-based content based on one or more of a type of the text-based content and a content provider that provided the text-based content, the template document including placeholders to be replaced with respective portions of the text-based content, a configuration of the template document and the placeholders of the template document being selected based on the type of the text-based content, where different template document configurations having one or more different placeholders from other template document configurations are selectively employed to encode different types of text-based content; and
encode the text-based content to obtain encoded content that includes the text-based content with an encoding module using the retrieved template document, the retrieved template document being configured for the text-based content to encode and format the text-based content according to an encoding schema that is adapted for the type of the text-based content, and the encoded content having a format suitable for presenting the text-based content as spoken audio,the system further including a content database including one or more web text tables for storing each item of the text-based content in respective rows of the web text tables, and including a category table for storing available categories of text-based content in respective rows of the category table, where each web text table of the one or more web text tables is associated with a different type of web text stored in that web text table, and each web text table includes different fields from other web text tables based on the type of web text stored in that web text table.
2 Assignments
0 Petitions
Accused Products
Abstract
A system and method for aggregating text-based content and presenting the text-based content as spoken audio is described herein, where a server module retrieves and aggregates web content from web content providers that may include text-based web content that is then extracted, filtered and categorizes for a client module to retrieve and play as spoken audio.
50 Citations
50 Claims
-
1. A system for processing and presenting web content comprising:
-
a hardware network interface configured to receive web content from one or more web content providers, the web content including text-based content; a processor; and a computer-readable storage medium storing instructions executable by the processor to; extract the text-based content from the web content; parse the web content to extract the text-based content with a web content processing module by applying a different parsing strategy for each particular content provider associated with the web content, each parsing strategy being configured to filter and extract text-based content from an associated content provider for that parsing strategy based on a web document structure unique to the associated content provider; select and retrieve a template document for encoding the text-based content based on one or more of a type of the text-based content and a content provider that provided the text-based content, the template document including placeholders to be replaced with respective portions of the text-based content, a configuration of the template document and the placeholders of the template document being selected based on the type of the text-based content, where different template document configurations having one or more different placeholders from other template document configurations are selectively employed to encode different types of text-based content; and encode the text-based content to obtain encoded content that includes the text-based content with an encoding module using the retrieved template document, the retrieved template document being configured for the text-based content to encode and format the text-based content according to an encoding schema that is adapted for the type of the text-based content, and the encoded content having a format suitable for presenting the text-based content as spoken audio, the system further including a content database including one or more web text tables for storing each item of the text-based content in respective rows of the web text tables, and including a category table for storing available categories of text-based content in respective rows of the category table, where each web text table of the one or more web text tables is associated with a different type of web text stored in that web text table, and each web text table includes different fields from other web text tables based on the type of web text stored in that web text table. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28)
-
-
29. A device for presenting web content comprising:
-
a hardware network interface configured to receive encoded content, the encoded content includes text-based content extracted from web content, and the encoded content is encoded by an encoding device in a format suitable for presenting the text-based content as spoken audio using a template automatically selected by an encoding device processor of the encoding device based on one or more of a type of the text-based content and a content provider for the text-based content, the template including placeholders to be replaced with respective portions of the text-based content, a configuration of the template and the placeholders in the template being selected based on the type of the text-based content and selectively employed to generate the encoded content; a display configured to display a user interface for requesting, presenting, and selecting web text items; an audio output device configured to output speech audio; a processor; and a computer-readable storage medium storing instructions executable by the processor to; decode the encoded content with a decoding module to access the text-based content, the text-based content being accessed, via the hardware network interface, from a content database including one or more web text tables for storing each item of the text-based content in respective rows of the web text tables, the content database including a category table for storing available categories of text-based content in respective rows of the category table, where each web text table of the one or more web text tables is associated with a different type of web text stored in that web text table, and each web text table includes different fields from other web text tables based on the type of web text stored in that web text table; display the user interface including a plurality of panels, the plurality of panels including an item list panel for displaying one or more web text items of the text-based content, and a content panel for displaying at least a portion of a selected web text item of the text-based content; generate a speech audio signal based on the text-based content with a text-to-speech module responsive to receiving input to the user interface requesting a first web text item to be output as spoken audio; and output the speech audio signal via the audio output device to present the first web text item as spoken audio. - View Dependent Claims (30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49)
-
-
50. A system for processing and presenting web content comprising:
-
a server module that includes a first processor and a first computer-readable storage medium, the first computer-readable storage medium storing first instructions executable by the first processor to receive web content from one or more web content providers, the web content including text-based content, and the first instructions further executable by the first processor to provide; an aggregation module that generates one or more requests for web content from the one or more web content providers; and a web content processing module that parses the web content received at the server module to extract the text-based content; the server module further including a content database including one or more web text tables for storing each item of the text-based content in respective rows of the web text tables, and including a category table for storing available categories of text-based content in respective rows of the category table, where each web text table of the one or more web text tables is associated with a different type of web text stored in that web text table, and each web text table includes different fields from other web text tables based on the type of web text stored in that web text table; an encoding module that encodes the text-based content in a markup language to obtain encoded content as a document formatted in the markup language, the document including the text-based content based on a template document retrieved from a database storing a plurality of template documents, the template document including placeholders to be replaced with respective portions of the text-based content, a configuration of the template document and the placeholders of the template document being selected based on a type of the text-based content, the encoding module using a schema for the markup language that is adapted for a type of web text requested, and the encoded content having a format suitable for presenting the text-based content as spoken audio; and a client module in signal communication with the server module that includes a second processor and a second computer-readable storage medium, the second computer-readable storage medium storing second instructions executable by the second processor to receive the encoded content from the server module, the second instructions further executable by the second processor to provide; a decoding module that decodes the encoded content to access the text-based content; a translation module that receives the text-based content from the decoding module and translates the text-based content from a first language to a second language, the first language being different from the second language, and the first language being a language in which the client module provided a request for the text-based content; and a text-to-speech module that receives the text-based content from the translation module and generates a speech audio signal based on the text-based content such that output of the speech audio signal presents the web text as spoken audio in the second language.
-
Specification