×

Method and system for fast, generic, online and offline, multi-source text analysis and visualization

  • US 7,792,816 B2
  • Filed: 01/31/2008
  • Issued: 09/07/2010
  • Est. Priority Date: 02/01/2007
  • Status: Expired due to Fees
First Claim
Patent Images

1. In a computer system having at least one user interface including at least one output device and at least one input device, a method comprising:

  • a) receiving from a user through at least one input device an identification of at least one text source;

    b) from each said identified text source, retrieving at least one text passage;

    c) for each said retrieved text passage, parsing the said passage into words, identifying multi-word expressions in the said passage and applying a stemming algorithm to the said passage;

    d) for each word from the said text passages, determining a number of times the said word appears in the said passages; and

    e) causing to be displayed on an output device a predetermined number of words from the said text passages, wherein distances between the said predetermined number of words in a display on the said output device are determined at least in part by a word weight for each said displayed word and by a link weight for each pair of said displayed words, and wherein the word weight for each said displayed word is determined at least in part by a number of times the said word appears in the said passages; and

    wherein the link weight for each said pair of said displayed words is determined at least in part by the number of times each said word appears in the said passages and by a number of times the said word pair appears in a same window in the said passages; and

    wherein the method, further comprising receiving the said predetermined number from a user through at least one input device; and

    wherein the method further comprisingf) receiving from a user an instruction to delete at least one word from the said display; and

    g) causing to be displayed on an output device the predetermined number of words from the said text passages without the at least one word which the said user instructed to be deleted;

    wherein distances between the said displayed words are determined at least in part by the word weight for each said displayed word and by the link weight for each pair of said displayed words, andwherein the word weight for each said displayed word is determined at least in part by a number of times the said word appears in the said passages; and

    wherein the link weight for each said pair of said displayed words is determined at least in part by the number of times each said word appears in the said passages and by a number of times the said word pair appears in a same window in the said passages.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×