×

Method and apparatus for normalizing and converting structured content

  • US 7,680,867 B2
  • Filed: 01/10/2006
  • Issued: 03/16/2010
  • Est. Priority Date: 06/26/2000
  • Status: Active Grant
First Claim
Patent Images

1. A method for use in converting content of electronic data from a source form to a target form, said electronic data having a machine format and an informational content independent of said machine format and any machine instructions, said method comprising the steps of:

  • defining, by utilizing a computer, a transformation matrix for use by a machine tool involving;

    providing a set of source content elements reflecting a source environment, wherein said source content elements include human readable informational content entered by one or more human users in a form free from compliance with any device format, said source content elements reflecting inconsistencies of linguistics, at least including different terms to identify same subject matter, and syntax, at least including different ordering of terms;

    providing a set of normalized content elements that are amenable to transformation to the target form;

    establishing a normalization structure for normalizing said set of source content elements to said set of normalized content elements with respect to linguistics and syntax, wherein the source content elements correspond to a single normalized content element, and wherein said normalization structure is based on a knowledge base developed from information about said set of source content elements, the establishing the normalization structure comprises utilizing grammar rules to identify one or more attributes of at least a subset of said set of source content elements and utilizing linguistics rules to identify attributes or attribute values of said source content elements that are expressed in a plurality of forms;

    defining a set of rules for converting said normalized content elements to target content elements;

    receiving an item of electronic data having a machine format and information content including at least one source content element and extracting said information content using said machine format;

    using a first operating of said machine tool to apply said transformation matrix by;

    identifying a first source content element under consideration;

    applying said normalization structure to said first source content element to identify a first normalized content element; and

    using said set of rules with respect to said first normalized content element to convert said first source content element to said target form,assisting in applying said normalization structure by associating contextual information with said normalized content elements, wherein said associating contextual information comprises providing tags for schematizing source information; and

    providing, by using second operating of said machine tool, an output including said first source content element converted to said target form.

View all claims
  • 3 Assignments
Timeline View
Assignment View
    ×
    ×