×

Methods for automatic generation of parallel corpora

  • US 10,552,548 B2
  • Filed: 01/30/2018
  • Issued: 02/04/2020
  • Est. Priority Date: 02/28/2014
  • Status: Active Grant
First Claim
Patent Images

1. A computer implemented method comprising:

  • obtaining a first item listing that is posted by a first seller in a first language and that is related to selling a particular item that is a product, a service, or a combination of a product and service;

    obtaining a second item listing that is posted by a second seller in a second language and that is also related to selling the particular item;

    aligning the first item listing with the second item listing in response to the first item listing and the second item listing both being related to selling the same item;

    identifying a first organizational structure with respect to first hierarchal relationships between first hypertext markup language (HTML) tags of first HTML code of a first description of the first item listing;

    identifying a second organizational structure with respect to second hierarchal relationships between second HTML tags of second HTML code of a second description of the second item listing;

    measuring, based on the aligning of the first item listing with the second item listing, an organizational structural similarity of the first HTML code with respect to the second HTML code by comparing the first organizational structure against the second organizational structure, the comparing including comparing the first hierarchal relationships against the second hierarchal relationships by comparing first nodes and first edges of a first tree that represents the first hierarchal relationships against second nodes and second edges of a second tree that represents the second hierarchal relationships; and

    in response to the first HTML code and the second HTML code being determined as being organizationally structurally similar based on the measured organizational structural similarity, forming the first description into a first sentence in the first language as a translation of the second description into the first language.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×