×

Method, apparatus, and storage medium for text information processing

  • US 10,262,059 B2
  • Filed: 06/06/2016
  • Issued: 04/16/2019
  • Est. Priority Date: 03/14/2014
  • Status: Active Grant
First Claim
Patent Images

1. A text information processing method, applied to a terminal, the terminal comprising one or more processors, a memory, and program instructions stored in the memory, the program instructions being executed by the one or more processors, and the method comprising:

  • performing word segmentation on a target text according to a preset fixed word segmentation policy, to obtain a word segmentation result;

    comparing the word segmentation result with a preset word segmentation list, and obtaining a word, which is not in the preset word segmentation list, as a new word;

    adding the new word to the preset word segmentation list, to obtain a test word segmentation list;

    classifying a test text according to the preset word segmentation list, to obtain a first text, and classifying the test text according to the test word segmentation list, to obtain a second text;

    calculating classification accuracy of the first text and classification accuracy of the second text;

    comparing the classification accuracy of the first text with the classification accuracy of the second text, and determining a target new word from the new word according to a comparison result;

    adding the target new word to the preset word segmentation list, to obtain a target preset word segmentation list; and

    classifying the target text according to the target preset word segmentation list,wherein the classifying a test text according to the preset word segmentation list, to obtain a first text, and classifying the test text according to the test word segmentation list, to obtain a second text comprises;

    classifying the test text according to a preset classification algorithm, to obtain the first text, wherein the preset classification algorithm is associated with the preset word segmentation list; and

    classifying the test text according to the preset classification algorithm, to obtain the second text, wherein the preset classification algorithm is associated with the test word segmentation list; and

    the classifying the target text according to the target preset word segmentation list comprises;

    calibrating the preset classification algorithm according to the target preset word segmentation list, and classifying the target text according to the calibrated preset classification algorithm.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×