×

Header-token driven automatic text segmentation

  • US 9,529,862 B2
  • Filed: 05/28/2015
  • Issued: 12/27/2016
  • Est. Priority Date: 12/28/2006
  • Status: Active Grant
First Claim
Patent Images

1. A system comprising:

  • a processor-implemented segmentation module configured to;

    receive data from a client machine, the data comprising a product title and a product description;

    identify a first token in the product title;

    receive a token probability value associated with the first token;

    assign a value to the first token, the value indicating that, one of;

    the first token also occurs in the product description, a lexical association exists between the first token and a second token in the product description, andthe lexical association does not exist and the first token is absent from the product description;

    compute a relevance value of a segmented group of tokens that occur in the product description and include the first token with the assigned value without requiring previously defined data tagging of the data beforehand of an unstructured text, the relevance value of the segmented group computed based on the value assigned to the first token; and

    determine and store in memory an indication that the segmented group of tokens is a most relevant segmented group of tokens in the product description;

    wherein the assigning of the value to the first token includes;

    initially assigning and storing a default value that indicates the lexical association does not exist and the first token is absent from the product description; and

    overwriting the stored initially assigned default value based on the first token occurring in the product description.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×