System and method for dynamically translating HTML to VoiceXML intelligently

US 7,185,276 B2
Filed: 08/09/2001
Issued: 02/27/2007
Est. Priority Date: 08/09/2001
Status: Expired due to Fees

First Claim

Patent Images

1. A system for dynamically translating a Hypertext Markup Language (HTML) document to Voice eXtensible Markup Language (VoiceXML) form comprising:

a voice server for receiving a user request and, in response to the user request, making a Hypertext Transfer Protocol (HTTP) request;

a voice session manager for receiving the HTTP request from the voice server and, in response to the HTTP request, accessing the HTML document, translating the HTML document to a VoiceXML document and sending the VoiceXML document to the voice server, so that the voice server can send the VoiceXML document to the user in an audible form; and

a document structure analyzer java server page (DSA JSP) for partitioning the HTML document into a plurality of text sections and a plurality of link sections;

wherein the DSA JSP differentiates between the plurality of text sections and the plurality of link sections by calculating a link density D1 of a section, where the section may be a link section if the link density D1 is greater than about 0.75, or otherwise the section may be a text section;

wherein the link density D1 is given by equation D1=(Hc−

KIl)/Sc, where Hc is a number of non-tag characters in a section that appears inside HREF, a link tag in html, K is a weight value equal to about 5, I1 is a number of links within image maps in the section, and Sc is a total number of non-tag characters in the section.

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A system and method for dynamically translating a Hypertext Markup Language (HTML) document to Voice eXtensible Markup Language (VoiceXML) form includes a VoiceXML server for receiving a user request and, in response to the user request, making a Hypertext Transfer Protocol (HTTP) request, a VoiceXML session manager for receiving the HTTP request from the voice server and, in response to the HTTP request, accessing the HTML document, translating the HTML document to a VoiceXML document after performing document structure analysis (DSA) and text summarization (TS) of the HTML document and including user profile information with the VoiceXML document and sending the VoiceXML document to the voice server, so that the voice server can send the VoiceXML document to the user in an audible form.

206 Citations

21 Claims

1. A system for dynamically translating a Hypertext Markup Language (HTML) document to Voice eXtensible Markup Language (VoiceXML) form comprising:
- a voice server for receiving a user request and, in response to the user request, making a Hypertext Transfer Protocol (HTTP) request;
  
  a voice session manager for receiving the HTTP request from the voice server and, in response to the HTTP request, accessing the HTML document, translating the HTML document to a VoiceXML document and sending the VoiceXML document to the voice server, so that the voice server can send the VoiceXML document to the user in an audible form; and
  
  a document structure analyzer java server page (DSA JSP) for partitioning the HTML document into a plurality of text sections and a plurality of link sections;
  
  wherein the DSA JSP differentiates between the plurality of text sections and the plurality of link sections by calculating a link density D1 of a section, where the section may be a link section if the link density D1 is greater than about 0.75, or otherwise the section may be a text section;
  
  wherein the link density D1 is given by equation D1=(Hc−
  
  KIl)/Sc, where Hc is a number of non-tag characters in a section that appears inside HREF, a link tag in html, K is a weight value equal to about 5, I1 is a number of links within image maps in the section, and Sc is a total number of non-tag characters in the section.
- View Dependent Claims (2, 3, 4, 5, 6, 7)
- - 2. The system for dynamically translating an HTML document to VoiceXML form according to claim 1, further comprising a text summarization java server page (TS JSP) for performing summarization of the plurality of text sections of the HTML document.
  - 3. The system for dynamically translating an HTML document to VoiceXML form according to claim 2, wherein the TS JSP provides text highlights or an abstract that contains important clauses or sentences from the plurality of text sections.
  - 4. The system for dynamically translating an HTML document to VoiceXML form according to claim 1, wherein a plurality of earcons are provided for the user to differentiate between the plurality of text sections and the plurality of link sections.
  - 5. The system for dynamically translating an HTML document to VoiceXML form according to claim 1, further comprising a user profile java server page for interpreting user profile information stored in a database.
  - 6. The system for dynamically translating an HTML document to VoiceXML form according to claim 5, wherein the user profile information includes one or more of authentication information, bookmarks, a list of favorite sites, e-mail account information and user default options.
  - 7. The system for dynamically translating an HTML document to VoiceXML form according to claim 1, wherein the voice session manager calls an HTML parser that parses and corrects the HTML document.

8. A method for dynamically translating an HTML document to VoiceXML form, comprising the steps of:
- making an HTTP request in response to a request by a user;
  
  accessing the HTML document in response to the HTTP request;
  
  translating the HTML document to a VoiceXML document; and
  
  sending the VoiceXML document to the user in an audible form; and
  
  partitioning the HTML document into a plurality of text sections and a plurality of link sections;
  
  wherein the plurality of text sections and the plurality of link sections are differentiated by calculating a link density D1 of a section, where the section may be a link section if the link density D1 is greater than about 0.75, or otherwise the section may be a text section;
  
  wherein the link density D1 is given by the equation D1=(Hc−
  
  KII)/Sc, where He is a number of non-tag characters in a section that appears inside HREF, a link tag in html, K is a weight value equal to about 5, I1 is a number of links within image maps in the section, and Sc is a total number of non-tag characters in the section.
- View Dependent Claims (9, 10, 11, 12, 13, 14, 15, 16)
- - 9. The method for dynamically translating an HTML document to VoiceXML form according to claim 8, further comprising the step of performing summarization of the plurality of text sections of the HTML document.
  - 10. The method for dynamically translating an HTML document to VoiceXML form according to claim 8, further comprising the step of providing text highlights or an abstract that contains important clauses or sentences from the plurality of text sections.
  - 11. The method for dynamically translating an HTML document to VoiceXML form according to claim 8, further comprising the step of:
    - providing a plurality of earcons for the user to differentiate between the plurality of text sections and the plurality of link sections.
  - 12. The method for dynamically translating an HTML document to VoiceXML form according to claim 8, further comprising the steps of:
    - extracting a segment from the HTML document, the segment including a plurality of tag sequences;
      
      processing the plurality of tag sequences;
      
      finding the largest tag sequence of the plurality of tag sequences, if the plurality of tag sequences are section titles or text tags; and
      
      forming a plurality of segment sections, if the plurality of tag sequences are not section titles or text tags; and
      
      collecting the plurality of segment sections.
  - 13. The method for dynamically translating an HTML document to VoiceXML form according to claim 12, further comprising the steps of:
    - processing the plurality of segment sections;
      
      obtaining an HTML markup of a segment section if the segment section is a text section;
      
      summarizing the HTML markup of the segment section; and
      
      forming an HTML markup object structure from the summarized HTML markup.
  - 14. The method for dynamically translating an HTML document to VoiceXML form according to claim 8, further comprising the step of interpreting user profile information.
  - 15. The method for dynamically translating an HTML document to VoiceXML form according to claim 14, wherein the user profile information includes one or more of authentication information, bookmarks, a list of favorite sites, e-mail account information and user default options.
  - 16. The method for dynamically translating an HTML document to VoiceXML form according to claim 8, further comprising the steps of:
    - making an HTTP connection and accessing a universal resource allocator (URL);
      
      parsing an HTTP header of the HTML document;
      
      correcting the HTML document if HTML is ill-formed; and
      
      converting the HTML document to object representation.

17. A method for dynamically translating an HTML document to VoiceXML form, comprising the steps of:
- making an HTTP request in response to a request by a user;
  
  accessing the HTML document in response to the HTTP request;
  
  translating the HTML document to a VoiceXML document;
  
  sending the VoiceXML document to the user in an audible form;
  
  partitioning the HTML document into a plurality of text sections and a plurality of link sections;
  
  extracting a segment from the HTML document, the segment including a plurality of tag sequences;
  
  processing the plurality of tag sequences;
  
  finding the largest tag sequence of the plurality of tag sequences, if the plurality of tag sequences are section titles or text tags;
  
  forming a plurality of segment sections, if the plurality of tag sequences are not section titles or text tags; and
  
  collecting the plurality of segment sections;
  
  processing the plurality of segment sections;
  
  obtaining an HTML markup of a segment section if the segment section is a text section;
  
  summarizing the HTML markup of the segment section;
  
  forming an HTML markup object structure from the summarized HTML markup; and
  
  further comprising the steps of;
  
  processing a plurality of tags in the HTML markup object structure;
  
  adding a VoiceXML audio tag from a paragraph or text earcon;
  
  creating java speech markup language (JSML) text for a text-to-speech (TTS) engine;
  
  creating a grammar from embedded tags;
  
  creating a VoiceXML prompt tag if a tag among the plurality of tags is a paragraph tag or a text tag; and
  
  creating a VoiceXML form tag.

18. A system for dynamically translating a Hypertext Markup Language (HTML) document to Voice eXtensible Markup Language (VoiceXML) form comprising:
- a voice server for receiving a user request and, in response to the user request, making a Hypertext Transfer Protocol (HTTP) request;
  
  a voice session manager for receiving the HTTP request from the voice server and, in response to the HTTP request, accessing the HTML document, translating the HTML document to a VoiceXML document and sending the VoiceXML document to the voice server, so that the voice server can send the VoiceXML document to the user in an audible form;
  
  a document structure analyzer java server page (DSA JSP) for partitioning the HTML document into plurality of text sections and a plurality of link sections;
  
  a text summarization java server page (TS JSP) for performing summarization of the plurality of text sections of the HTML document; and
  
  a user profile java server page for interpreting user profile information stored in a database, including one or more of authentication information, bookmarks, a list of favorite Web sites, e-mail account information and user default options;
  
  wherein the DSA JSP differentiates between the plurality of text sections and the plurality of link sections by calculating a link density D1 of a section, where the section may be a link section if the link density D1 is greater than about 0.75, or otherwise the section may be a text section;
  
  wherein the link density D1 is given by the equation D1=(Hc−
  
  KII)/Sc, where Hc is a number of non-tag characters in a section that appears inside HREF, a link tag in html, K is a weight value equal to about 5, Ii is a number of links within image maps in the section, and Sc is a total number of non-tag characters in the section.
- View Dependent Claims (19, 20, 21)
- - 19. The system for dynamically translating an HTML document to VoiceXML form according to claim 18, wherein a plurality of earcons are provided for the user to differentiate between the plurality of text sections and the plurality of link sections.
  - 20. The system for dynamically translating an HTML document to VoiceXML form according to claim 18, wherein the TS JSP provides text highlights or an abstract that contains important clauses or sentences from the plurality of text sections.
  - 21. The system for dynamically translating an HTML document to VoiceXML form according to claim 18, wherein the voice session manager calls an HTML parser that parses and corrects the HTML document.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Voxera Corporation
Original Assignee
Voxera Corporation
Inventors
Keswa, Mduduzi
Primary Examiner(s)
Hong; Stephen
Assistant Examiner(s)
Campbell; Joshua D

Application Number

US09/924,445
Publication Number

US 20040205614A1
Time in Patent Office

2,028 Days
Field of Search

715/500, 715/500.1, 715511-513, 715/523, 715/539, 715/747, 707/1, 707/10, 709/218, 709/224
US Class Current

715/239
CPC Class Codes

H04L 63/08   for authentication of entit...

H04L 67/02   based on web technology, e....

H04L 67/56   Provisioning of proxy servi...

H04L 67/565   Conversion or adaptation of...

H04M 3/4938   comprising a voice browser ...

Y10S 707/99931   Database or file accessing

System and method for dynamically translating HTML to VoiceXML intelligently

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

206 Citations

21 Claims

Specification

Use Cases

Quick Links

Others

System and method for dynamically translating HTML to VoiceXML intelligently

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

206 Citations

21 Claims

Specification

Subscription Required

Use Cases

Quick Links

Others