×

METHOD FOR COMPUTING SIMILARITY BETWEEN TEXT SPANS USING FACTORED WORD SEQUENCE KERNELS

  • US 20090175545A1
  • Filed: 01/04/2008
  • Published: 07/09/2009
  • Est. Priority Date: 01/04/2008
  • Status: Active Grant
First Claim
Patent Images

1. A method of comparing spans of text comprising:

  • computing a similarity measure between a first sequence of symbols representing a first text span and a second sequence of symbols representing a second text span as a function of the occurrences of optionally noncontiguous subsequences of symbols shared by the two sequences of symbols, wherein each of the symbols comprises at least one consecutive word, the words being enriched with linguistic information allowing them to be defined according to a set of linguistic factors, whereby pairs of symbols in the first and second sequences forming a shared subsequence of symbols are each matched according to at least one of the factors and wherein all pairs of matching symbols in a shared subsequence need not match according to the same factor.

View all claims
  • 6 Assignments
Timeline View
Assignment View
    ×
    ×