×

Method and system for generating lexicon of cooccurrence relations in natural language

  • US 4,942,526 A
  • Filed: 10/24/1986
  • Issued: 07/17/1990
  • Est. Priority Date: 10/25/1985
  • Status: Expired due to Fees
First Claim
Patent Images

1. A method, using a computer including a processor and a memory, of generating cooccurrence relation information indicating whether a sequence of words in a given sentence described in a natural language is semantically correct or not, said method comprising the steps of:

  • (a) defining categories of sentences on the basis of the types of documents in which the sentences appear;

    (b) defining fields of sentences on the basis of the subject matters of the sentences;

    (c) preparing a text corpus by collecting input textual sentences belonging to the same category or the same field as the given sentence;

    (d) preparing a cooccurrence relation table containing grammar or a set of grammatical rules for analyzing the textual sentences of the text corpus to permit determining a cooccurrence relation between words in the textual sentences;

    (e) determining a hypothesized cooccurrence relation between words in the sequence of words in the given sentence on the basis of a cooccurrence relation from said cooccurrence relation table, the hypothesized cooccurrence relation indicating a particular possible concurrence relation between words in the given sentence;

    (f) deriving an actual cooccurrence relation between words in the sequence of words in the given sentence from the determined hypothesized cooccurrence relation;

    (g) determining whether the actual cooccurrence relation exceeds a predetermined threshold condition for a valid cooccurrence relation; and

    (h) when the actual cooccurrence relation exceeds the predetermined threshold condition, outputting information indicating the actual cooccurrence relation as a valid cooccurrence relation.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×