SYSTEMS AND METHODS FOR NORMALIZING INPUT MEDIA
First Claim
1. A system for processing input media for provision to a text to speech engine comprising:
- a rules engine configured to maintain and update rules for processing the input media, wherein the rules comprise pre-parsing rules, parsing rules, tagging rules, and post-parsing rules;
a pre-parsing filter module configured to determine one or more metadata attributes using pre-parsing rules, wherein one metadata attribute is an application type;
a parsing filter module configured to query the rules engine for parsing rules associated with the one or more metadata attributes and to identify a content component from the input media using the parsing rules;
a context and language detector configured to determine a default context and a default language for at least part of the content component;
a learning agent configured to divide the content component into units of interest;
a tagging module configured to query the rules engine for tagging rules associated with the default context and the default language and to iteratively assign tags to the units of interest using the tagging rules, wherein each tag is associated with a post-parsing rule;
a post-parsing filter module configured to modify the content component by executing the post-parsing rules identified by the tags assigned to the units of interest;
wherein the context and language detector, tagging module, learning agent and post-parsing filter module are configured to iteratively process the content component and modifications thereto until there are no further modifications or a threshold number of iterations are performed; and
an output module configured to transmit the modified content component.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and system for processing input media for provision to a text to speech engine comprising: a rules engine configured to maintain and update rules for processing the input media; a pre-parsing filter module configured to determine one or more metadata attributes using pre-parsing rules; a parsing filter module configured to identify content component from the input media using the parsing rules; a context and language detector configured to determine a default context and a default language; a learning agent configured to divide the content component into units of interest; a tagging module configured to iteratively assign tags to the units of interest using the tagging rules, wherein each tag is associated with a post-parsing rule; a post-parsing filter module configured to modify the content component by executing the post-parsing rules identified by the tags assigned to the phrases and strings. The context and language detector, tagging module, learning agent and post-parsing filter module are configured to iteratively process the content component and modifications thereto until there are no further modifications or a threshold number of iterations are performed.
-
Citations
29 Claims
-
1. A system for processing input media for provision to a text to speech engine comprising:
-
a rules engine configured to maintain and update rules for processing the input media, wherein the rules comprise pre-parsing rules, parsing rules, tagging rules, and post-parsing rules; a pre-parsing filter module configured to determine one or more metadata attributes using pre-parsing rules, wherein one metadata attribute is an application type; a parsing filter module configured to query the rules engine for parsing rules associated with the one or more metadata attributes and to identify a content component from the input media using the parsing rules; a context and language detector configured to determine a default context and a default language for at least part of the content component; a learning agent configured to divide the content component into units of interest; a tagging module configured to query the rules engine for tagging rules associated with the default context and the default language and to iteratively assign tags to the units of interest using the tagging rules, wherein each tag is associated with a post-parsing rule; a post-parsing filter module configured to modify the content component by executing the post-parsing rules identified by the tags assigned to the units of interest; wherein the context and language detector, tagging module, learning agent and post-parsing filter module are configured to iteratively process the content component and modifications thereto until there are no further modifications or a threshold number of iterations are performed; and an output module configured to transmit the modified content component. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A method for processing input media for provision to a text to speech engine comprising:
-
maintaining and updating rules for processing the input media, wherein the rules comprise pre-parsing rules, parsing rules, tagging rules, and post-parsing rules; determining one or more metadata attributes using pre-parsing rules, wherein one metadata attribute is an application type; identifying a content component from the input media using parsing rules associated with the one or more metadata attributes; determining, for at least part of the content component, a default context and a default language; dividing the content component into units of interest; iteratively assigning tags to the units of interest using the tagging rules associated with the default context and the default language, wherein each tag is associated with a post-parsing rule; modifying the content component by executing the post-parsing rules identified by the tags assigned to the phrases and strings; iteratively processing the content component and modifications thereto until there are no further modifications or a threshold number of iterations are performed; and outputting the modified content component. - View Dependent Claims (16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28)
-
-
29. A non-transitory computer-readable medium upon which a plurality of instructions are stored, the instructions for performing the steps of:
-
maintaining and updating rules for processing the input media, wherein the rules comprise pre-parsing rules, parsing rules, tagging rules, and post-parsing rules; determining one or more metadata attributes using pre-parsing rules, wherein one metadata attribute is an application type; identifying a content component from the input media using parsing rules associated with the one or more metadata attributes; determining, for at least part of the content component, a default context and a default language; dividing the content component into units of interest; iteratively assigning tags to the units of interest using the tagging rules associated with the default context and the default language, wherein each tag is associated with a post-parsing rule; modifying the content component by executing the post-parsing rules identified by the tags assigned to the phrases and strings; iteratively processing the content component and modifications thereto until there are no further modifications or a threshold number of iterations are performed; and outputting the modified content component.
-
Specification