×

Conceptual world representation natural language understanding system and method

  • US 8,442,814 B2
  • Filed: 03/01/2011
  • Issued: 05/14/2013
  • Est. Priority Date: 07/12/2002
  • Status: Expired due to Term
First Claim
Patent Images

1. A method of processing free text documents for indexing, said method comprising:

  • typographically segmenting a free text document using a computer-based ontology management system, said typographically segmenting comprising;

    delimiting said free text document into words, sentences, titles, list items and paragraphs based on character patterns in said free text document;

    functionally segmenting said free text document using the computer-based ontology management system, said functionally segmenting comprising;

    grouping words into multi-word terms, segmenting said sentences into clause-phrase segments, and grouping words into noun phrases, wherein said grouping words into multi-word terms is accomplished by identifying at least two adjacent words;

    re-writing at least one of said at least two adjacent words to generate a pairing of at least two adjacent words containing at least one re-written word;

    searching a lexicon of terms for said pairing of at least two adjacent words containing at least one re-written word;

    if said pairing of at least two adjacent words containing at least one re-written word is found in said lexicon, replacing said pairing of at least two adjacent words with said pairing of at least two adjacent words containing at least one re-written word; and

    tagging said pairing of at least two adjacent words containing at least one re-written word as a multi-word term.

View all claims
  • 3 Assignments
Timeline View
Assignment View
    ×
    ×