×

Natural language processing system and method

  • US 9,152,623 B2
  • Filed: 11/04/2013
  • Issued: 10/06/2015
  • Est. Priority Date: 11/02/2012
  • Status: Active Grant
First Claim
Patent Images

1. A system for natural language processing comprising:

  • a processor configured to execute a natural language processing method, the method comprising,the processor receiving input data from one or more data sources, wherein the input data comprises one or more of plain text, and tokenized text;

    the processor tokenizing the text;

    the processor aggregating tokens into a three-level structure comprising a word level, a phrase level, and a clause level wherein,tokens are aggregated into words, words are aggregated into phrases, and phrases are aggregated into clauses;

    each element on a higher level comprises one or more elements of a lower level;

    tokens that are one or more of coordinating and subordinating words within a phrase are aggregated into one or more words that are separate from respective, coordinated or subordinated words, and the one or more of coordinating and subordinating words and their respective, coordinated or subordinated words are aggregated into one phrase;

    words that are one or more of coordinating and subordinating phrases within a clause are aggregated into one or more phrases that are separate from respective, coordinated or subordinated phrases, and the one or more of coordinating and subordinating phrases and their respective, coordinated or subordinated phrases are aggregated into one clause; and

    phrases that are one or more of coordinating and subordinating clauses are aggregated into one or more clauses that are separate from their respective coordinated or subordinated clauses;

    the processor determining syntactic connections between at least,every clause and its syntactically superior clause in the same sentence;

    every clause and its syntactically superior phrase, if they exist within the same sentence;

    every phrase and its syntactically superior phrase within the same clause; and

    every word and its syntactically superior word within the same phrase; and

    wherein,an element without its syntactically superior element becomes a root element;

    if one or more coordinated elements have a common syntactically superior element, the coordinated elements are connected to their respective coordinating element, and the coordinating element is connected to the element which is syntactically superior to the respective coordinated elements; and

    if one or more coordinated elements are all syntactically superior to an element, the syntactically subordinated element is connected to the element coordinating its respective syntactically superior elements; and

    the processor classifying each word, phrase and clause, whereinclassification reflects a syntactic function of the word, the phrase and the clause; and

    phrases and clauses share a partially common set of possible syntactic functions comprising at least subject, object, complement and attribute function.

View all claims
  • 3 Assignments
Timeline View
Assignment View
    ×
    ×