Analyzing web site for translation
First Claim
1. A machine implemented method for providing statistics characterizing content for language translation, comprising the steps of:
- obtaining content in a first language from an Internet source by crawling the Internet source via a publicly accessible network path;
automatically identifying a portion of the content in the first language that is not yet translated into a second language; and
generating statistics based only on the portion of the content in the first language identified automatically that is not yet translated in order to estimate the amount of work involved in language translation from the first language to the second language.
4 Assignments
0 Petitions
Accused Products
Abstract
A system, method and computer readable medium for synchronizing web content is disclosed. The method includes retrieving a first web content in a first language from a web site, the first web content corresponding to a second web content wherein the second web content is a translation in a second language of the first web content. The method further includes dividing the first web content into a plurality of translatable components and generating a unique identifier for each of the plurality of translatable components. The method further includes matching each of the plurality of translatable components to a plurality of translated components of the second web content using the unique identifier of each of the plurality of translatable components. If a translatable component is not matched to a translated component, the method further includes designating the translatable component for translation into the second language.
-
Citations
9 Claims
-
1. A machine implemented method for providing statistics characterizing content for language translation, comprising the steps of:
-
obtaining content in a first language from an Internet source by crawling the Internet source via a publicly accessible network path; automatically identifying a portion of the content in the first language that is not yet translated into a second language; and generating statistics based only on the portion of the content in the first language identified automatically that is not yet translated in order to estimate the amount of work involved in language translation from the first language to the second language. - View Dependent Claims (2, 3, 4)
-
-
5. A system for providing statistics characterizing content for language translation, comprising:
-
a content retrieval mechanism configured for obtaining content in a first language from an Internet source by crawling the Internet source via a publicly accessible network path; a content analysis mechanism configured for automatically identifying a portion of the content in the first language that is not yet translated into a second language; and a content sizing mechanism configured for generating statistics based only on the portion of the content in the first language identified automatically that is not yet translated in order to estimate the amount of work involved in language translation from the first language to the second language. - View Dependent Claims (6, 7, 8, 9)
-
Specification