System and method for web text content aggregation and presentation

US 9,754,045 B2
Filed: 12/02/2011
Issued: 09/05/2017
Est. Priority Date: 04/01/2011
Status: Active Grant

First Claim

Patent Images

1. A system for processing and presenting web content comprising:

a hardware network interface configured to receive web content from one or more web content providers, the web content including text-based content;

a processor; and

a computer-readable storage medium storing instructions executable by the processor to;

extract the text-based content from the web content;

parse the web content to extract the text-based content with a web content processing module by applying a different parsing strategy for each particular content provider associated with the web content, each parsing strategy being configured to filter and extract text-based content from an associated content provider for that parsing strategy based on a web document structure unique to the associated content provider;

select and retrieve a template document for encoding the text-based content based on one or more of a type of the text-based content and a content provider that provided the text-based content, the template document including placeholders to be replaced with respective portions of the text-based content, a configuration of the template document and the placeholders of the template document being selected based on the type of the text-based content, where different template document configurations having one or more different placeholders from other template document configurations are selectively employed to encode different types of text-based content; and

encode the text-based content to obtain encoded content that includes the text-based content with an encoding module using the retrieved template document, the retrieved template document being configured for the text-based content to encode and format the text-based content according to an encoding schema that is adapted for the type of the text-based content, and the encoded content having a format suitable for presenting the text-based content as spoken audio,the system further including a content database including one or more web text tables for storing each item of the text-based content in respective rows of the web text tables, and including a category table for storing available categories of text-based content in respective rows of the category table, where each web text table of the one or more web text tables is associated with a different type of web text stored in that web text table, and each web text table includes different fields from other web text tables based on the type of web text stored in that web text table.

View all claims

2 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A system and method for aggregating text-based content and presenting the text-based content as spoken audio is described herein, where a server module retrieves and aggregates web content from web content providers that may include text-based web content that is then extracted, filtered and categorizes for a client module to retrieve and play as spoken audio.

50 Citations

View as Search Results

50 Claims

1. A system for processing and presenting web content comprising:
- a hardware network interface configured to receive web content from one or more web content providers, the web content including text-based content;
  
  a processor; and
  
  a computer-readable storage medium storing instructions executable by the processor to;
  
  extract the text-based content from the web content;
  
  parse the web content to extract the text-based content with a web content processing module by applying a different parsing strategy for each particular content provider associated with the web content, each parsing strategy being configured to filter and extract text-based content from an associated content provider for that parsing strategy based on a web document structure unique to the associated content provider;
  
  select and retrieve a template document for encoding the text-based content based on one or more of a type of the text-based content and a content provider that provided the text-based content, the template document including placeholders to be replaced with respective portions of the text-based content, a configuration of the template document and the placeholders of the template document being selected based on the type of the text-based content, where different template document configurations having one or more different placeholders from other template document configurations are selectively employed to encode different types of text-based content; and
  
  encode the text-based content to obtain encoded content that includes the text-based content with an encoding module using the retrieved template document, the retrieved template document being configured for the text-based content to encode and format the text-based content according to an encoding schema that is adapted for the type of the text-based content, and the encoded content having a format suitable for presenting the text-based content as spoken audio,the system further including a content database including one or more web text tables for storing each item of the text-based content in respective rows of the web text tables, and including a category table for storing available categories of text-based content in respective rows of the category table, where each web text table of the one or more web text tables is associated with a different type of web text stored in that web text table, and each web text table includes different fields from other web text tables based on the type of web text stored in that web text table.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28)
- - 2. The system of claim 1 where the encoded content is transmitted to a client in response to receipt of a request for text-based content from the client where output of a speech audio signal that is generated based on the text-based content included in the encoded content presents the text-based content as spoken audio.
  - 3. The system of claim 2 where the request for text-based content includes a URL having one or more parameters associated with the requested text-based content.
  - 4. The system of claim 2 where:
    - the request for text-based content indicates a predetermined web content provider; and
      
      the encoded content transmitted to the client includes text-based content associated with the predetermined web content provider.
  - 5. The system of claim 2 where:
    - the request for text-based content indicates a predetermined category; and
      
      the encoded content transmitted to the client includes text-based content associated with the predetermined category.
  - 6. The system of claim 2 where:
    - the request for text-based content includes location information that relates to a determined geographic location of the client; and
      
      the encoded content transmitted to the client includes text-based content associated with the location information.
  - 7. The system of claim 6 where the text-based content associated with the location information includes traffic information that relates to one or more roads near the determined geographic location of the client.
  - 8. The system of claim 1 where the encoded content is automatically transmitted to a client in response to a trigger.
  - 9. The system of claim 8 where the trigger is an end to a periodic interval or receipt of web content from one of the web content providers.
  - 10. The system of claim 1 where:
    - the encoding module encodes the text-based content in an XML-format to obtain the encoded content as an XML-formatted document, the encoding module using an XML schema that is indicated in the template document and adapted for a type of web text requested, the template document being selected based on the type of web text requested; and
      
      the XML-formatted document is transmitted to a client for presentation of the text-based content as spoken audio.
  - 11. The system of claim 1 where, for each template document of a plurality of template documents, a database indicates one or more of a type of web text and a content provider associated with that template document, and where all text-based content having a type of web text or originating from a content provider is encoded via an encoding scheme from the template document associated with that type of web text or that content provider.
  - 12. The system of claim 10 wherein the instructions are further executable by the processor to compress the encoded content with a compression module to obtain compressed content that includes the encoded content, and wherein each parsing strategy is configured to analyze and distinguish HTML structure and HTML tags used by the associated content provider for that parsing strategy.
  - 13. The system of claim 1 wherein the instructions are further executable by the processor to generate one or more requests for web content from the one or more web content providers with an aggregation module.
  - 14. The system of claim 13 where the aggregation module automatically generates the one or more requests for web content at a periodic interval.
  - 15. The system of claim 14 where:
    - the aggregation module receives web content from one of the web content providers in response to receipt of one of the requests for web content at the web content provider; and
      
      the aggregation module withholds generating subsequent requests for non-text-based web content associated with the web content received from the web content provider.
  - 16. The system of claim 13 where the aggregation module generates at least one of the requests for web content in response to receipt of a request for text-based content from a client.
  - 17. The system of claim 13 where at least one of the requests for web content includes a request for a web page document.
  - 18. The system of claim 13 where at least one of the requests for web content includes a request for a web feed document.
  - 19. The system of claim 18 where:
    - the web feed document includes a list of one or more web page documents; and
      
      the aggregation module iterates through the list and generates a request for at least one of the web page documents in the list.
  - 20. The system of claim 19 where:
    - the web content received from one of the web content providers has a predetermined format; and
      
      the web content processing module applies a parsing strategy based on the predetermined format to extract the text-based content from the web content.
  - 21. The system of claim 19 where the web content processing module categorizes the text-based content.
  - 22. The system of claim 19 further comprising a data storage module that stores the text-based content.
  - 23. The system of claim 22 where the data storage module is one or more relational databases comprising one or more web text tables having fields depending on the type of the text-based content.
  - 24. The system of claim 1 further comprising an account storage module that stores one or more client profiles that are respectively associated with one or more clients.
  - 25. The system of claim 24 where the one or more client profiles respectively indicate one or more predetermined web content providers.
  - 26. The system of claim 25 where the encoded content includes text-based content associated with at least one of the predetermined web content providers indicated in one of the client profiles.
  - 27. The system of claim 1 where access to the system is subscription-based such that:
    - the system transmits the encoded content in response to receipt of a request for text-based content where the request is associated with a valid subscription; and
      
      the system ignores the request for text-based content where the request is not associated with a valid subscription.
  - 28. The system of claim 27 further comprising an access control module that determines whether the request for text-based content is associated with a valid subscription.

29. A device for presenting web content comprising:
- a hardware network interface configured to receive encoded content, the encoded content includes text-based content extracted from web content, and the encoded content is encoded by an encoding device in a format suitable for presenting the text-based content as spoken audio using a template automatically selected by an encoding device processor of the encoding device based on one or more of a type of the text-based content and a content provider for the text-based content, the template including placeholders to be replaced with respective portions of the text-based content, a configuration of the template and the placeholders in the template being selected based on the type of the text-based content and selectively employed to generate the encoded content;
  
  a display configured to display a user interface for requesting, presenting, and selecting web text items;
  
  an audio output device configured to output speech audio;
  
  a processor; and
  
  a computer-readable storage medium storing instructions executable by the processor to;
  
  decode the encoded content with a decoding module to access the text-based content, the text-based content being accessed, via the hardware network interface, from a content database including one or more web text tables for storing each item of the text-based content in respective rows of the web text tables, the content database including a category table for storing available categories of text-based content in respective rows of the category table, where each web text table of the one or more web text tables is associated with a different type of web text stored in that web text table, and each web text table includes different fields from other web text tables based on the type of web text stored in that web text table;
  
  display the user interface including a plurality of panels, the plurality of panels including an item list panel for displaying one or more web text items of the text-based content, and a content panel for displaying at least a portion of a selected web text item of the text-based content;
  
  generate a speech audio signal based on the text-based content with a text-to-speech module responsive to receiving input to the user interface requesting a first web text item to be output as spoken audio; and
  
  output the speech audio signal via the audio output device to present the first web text item as spoken audio.
- View Dependent Claims (30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49)
- - 30. The device of claim 29 where the encoded content includes text-based content that is in a first natural language, the instructions further executable by the processor to:
    - translate the text-based content in the first natural language into translated text-based content in a second natural language that is different from the first natural language with a translation module.
  - 31. The device of claim 29 where the device transmits a request for text-based content to a server and receives the encoded content in response to receipt of the request at the server.
  - 32. The device of claim 31 where the request for the text-based content includes a URL having one or more parameters associated with the requested text-based content.
  - 33. The device of claim 31 where:
    - the request for the text-based content indicates a predetermined content provider; and
      
      the encoded content received in response to the request includes text-based content associated with the predetermined content provider.
  - 34. The device of claim 31 where:
    - the request for the text-based content indicates a predetermined category; and
      
      the encoded content received in response to the request includes text-based content associated with the predetermined category.
  - 35. The device of claim 31 where:
    - the device is in signal communication with a positioning module that determines a geographic location of the device;
      
      the request for the text-based content includes location information that relates to the determined geographic location of the device; and
      
      the encoded content received in response to the request includes text-based content associated with the location information.
  - 36. The device of claim 35 where the text-based content associated with the location information includes traffic information that relates to one or more roads near the determined geographic location of the device.
  - 37. The device of claim 29 where the encoded content is an XML-formatted document.
  - 38. The device of claim 29 where the device receives compressed content that includes the encoded content and the instructions are further executable by the processor to:
    - decompress the compressed content with a decompression module to access the encoded content.
  - 39. The device of claim 29 where the speech audio signal is a pulse code modulated signal.
  - 40. The device of claim 29 where the audio output device receives the speech audio signal and outputs the speech audio signal such that the text-based content is presented as spoken audio.
  - 41. The device of claim 40 where the speech audio signal is output at the audio output device in response to receipt of user input at the device.
  - 42. The device of claim 41 where, in response to the user input at the device:
    - output of a first speech audio signal associated with first text-based content is terminated; and
      
      output of a second speech audio signal associated with second text-based content is automatically initiated.
  - 43. The device of claim 40 where the speech audio signal is automatically output at the audio output device in response to receipt of the encoded content at the device.
  - 44. The device of claim 40 where a plurality of speech audio signals respectively associated with a plurality of text-based content associated with a plurality of different web text categories is randomly output in succession at the audio output device responsive to receiving user input selecting a “
    - random”
      
      user interface button.
  - 45. The device of claim 29 where the device is installed in a vehicle.
  - 46. The device of claim 45 where the device is in signal communication with a vehicle audio system and the speech audio signal is output at the vehicle audio system.
  - 47. The device of claim 29 where the encoded content includes text-based content associated with one or more predetermined web content providers that are indicated in a profile associated with the device.
  - 48. The device of claim 29 where receipt of the encoded content is subscription-based such that the device receives the encoded content in response to receipt at a server of a request for text-based content that is associated with a valid subscription.
  - 49. The device of claim 48 where the request for text-based content includes access credentials that indicate the request is associated with a valid subscription.

50. A system for processing and presenting web content comprising:
- a server module that includes a first processor and a first computer-readable storage medium, the first computer-readable storage medium storing first instructions executable by the first processor to receive web content from one or more web content providers, the web content including text-based content, and the first instructions further executable by the first processor to provide;
  
  an aggregation module that generates one or more requests for web content from the one or more web content providers; and
  
  a web content processing module that parses the web content received at the server module to extract the text-based content;
  
  the server module further including a content database including one or more web text tables for storing each item of the text-based content in respective rows of the web text tables, and including a category table for storing available categories of text-based content in respective rows of the category table, where each web text table of the one or more web text tables is associated with a different type of web text stored in that web text table, and each web text table includes different fields from other web text tables based on the type of web text stored in that web text table;
  
  an encoding module that encodes the text-based content in a markup language to obtain encoded content as a document formatted in the markup language, the document including the text-based content based on a template document retrieved from a database storing a plurality of template documents, the template document including placeholders to be replaced with respective portions of the text-based content, a configuration of the template document and the placeholders of the template document being selected based on a type of the text-based content, the encoding module using a schema for the markup language that is adapted for a type of web text requested, and the encoded content having a format suitable for presenting the text-based content as spoken audio; and
  
  a client module in signal communication with the server module that includes a second processor and a second computer-readable storage medium, the second computer-readable storage medium storing second instructions executable by the second processor to receive the encoded content from the server module, the second instructions further executable by the second processor to provide;
  
  a decoding module that decodes the encoded content to access the text-based content;
  
  a translation module that receives the text-based content from the decoding module and translates the text-based content from a first language to a second language, the first language being different from the second language, and the first language being a language in which the client module provided a request for the text-based content; and
  
  a text-to-speech module that receives the text-based content from the translation module and generates a speech audio signal based on the text-based content such that output of the speech audio signal presents the web text as spoken audio in the second language.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Harman International (China) Holdings Co. Ltd. (Samsung Electronics Co. Ltd.)
Original Assignee
Harman International (China) Holdings Co. Ltd. (Samsung Electronics Co. Ltd.)
Inventors
Wang, Charles Chuanming, Ling, Yong
Primary Examiner(s)
ADESANYA, OLUJIMI A

Application Number

US13/310,615
Publication Number

US 20120253814A1
Time in Patent Office

2,104 Days
Field of Search

704258, 704260
US Class Current
CPC Class Codes

G06F 16/9577 Optimising the visualizatio...

G10L 13/02 Methods for producing synth...

System and method for web text content aggregation and presentation

First Claim

2 Assignments

0 Petitions

Accused Products

Abstract

50 Citations

50 Claims

Specification

Solutions

Use Cases

Quick Links

System and method for web text content aggregation and presentation

First Claim

2 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

50 Citations

50 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links