×

Identifying documents which form translated pairs, within a document collection

  • US 20070033001A1
  • Filed: 08/03/2005
  • Published: 02/08/2007
  • Est. Priority Date: 08/03/2005
  • Status: Active Grant
First Claim
Patent Images

1. A method, comprising:

  • obtaining a group of documents;

    determining reduced size versions of said documents; and

    comparing said reduced size versions, to determine documents that represent similar information; and

    using said documents that represent similar information for training for a text-to-text application.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×