System and method for future-proofing devices using metaschema
First Claim
1. A method for determining whether a document is acceptable or rejected for use by an apparatus, said apparatus comprising a parser for parsing said document, wherein the document is stored on a computer-readable storage medium and includes plural elements and attributes arranged according to a schema of elements and attributes, said method comprising the steps of:
- using a computer to determine a version of the document;
compiling a metaschema based on the parser and the determined version of the document, said metaschema providing a set of rules for a valid change of the schema, each of the set of rules comprising a value and an expression of at least one of an element and an attribute of the schema;
using a computer to execute steps of;
(a) determining a similarity score for said document by comparing at least one of the elements and the attributes of said document to the value and the expression of at least one of an element and an attribute of the schema in the metaschema; and
(b) mapping the document to the schema to produce a list of data values for information requested from the compiled metaschema; and
using a computer to use the similarity score to accept or reject further use of the document, wherein the list of data values is used to extract information from the mapped document if the document is accepted for further use.
1 Assignment
0 Petitions
Accused Products
Abstract
Disclosed are arrangements which seek to update schemas in a controlled manner so that a device (3710) or service can accurately parse documents conforming to a new version of a schema (3724, 3734) without the need for translation services or extra information within the document to provide guides for translation. One embodiment provides a “best-effort” attempt to process documents of an unknown schema, based on a defined similarity threshold, instead of rejecting such documents as being invalid. The arrangements disclosed seek to provide a satisfactory output from an input document that conforms with an unexpected schema. A method for constraining evolution of a schema associated with a markup language includes the steps of defining the schema and defining a metaschema associated with the schema. The metaschema provides rules for valid changes to the schema.
-
Citations
16 Claims
-
1. A method for determining whether a document is acceptable or rejected for use by an apparatus, said apparatus comprising a parser for parsing said document, wherein the document is stored on a computer-readable storage medium and includes plural elements and attributes arranged according to a schema of elements and attributes, said method comprising the steps of:
-
using a computer to determine a version of the document; compiling a metaschema based on the parser and the determined version of the document, said metaschema providing a set of rules for a valid change of the schema, each of the set of rules comprising a value and an expression of at least one of an element and an attribute of the schema; using a computer to execute steps of; (a) determining a similarity score for said document by comparing at least one of the elements and the attributes of said document to the value and the expression of at least one of an element and an attribute of the schema in the metaschema; and (b) mapping the document to the schema to produce a list of data values for information requested from the compiled metaschema; and using a computer to use the similarity score to accept or reject further use of the document, wherein the list of data values is used to extract information from the mapped document if the document is accepted for further use. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 11, 12, 13, 14)
-
-
9. A device having a processor and a memory, wherein the memory stores a computer-executable program executable by the processor for controlling the device to determine whether a document is acceptable or rejected for use by an apparatus, said apparatus comprising a parser for parsing said document, wherein the document is stored in the memory and includes plural elements and attributes arranged according to a schema of elements and attributes, and wherein the computer-executable program stored in the memory comprises:
-
computer-executable program code for determining a version of the document; computer-executable program code for compiling a metaschema based on the parser and the determined version of the document, said metaschema providing a set of rules for a valid change of the schema, each of the set of rules comprising a value and an expression of at least one of an element and an attribute of the schema; computer-executable program code for executing; (a) computer-executable code for determining a similarity score for said document by comparing at least one of the elements and the attributes of said document to the value and the expression of at least one of an element and an attribute of the schema in the metaschema; and (b) computer-executable code for mapping the document to the schema to produce a list of data values for information requested from the compiled metaschema; and computer-executable program code for using the similarity score to accept or reject further use of the document, wherein the list of data values is used to extract information from the mapped document if the document is accepted for further use.
-
-
10. A computer readable storage medium having a computer executable program recorded therein for causing a computer to determine whether a document is acceptable or rejected for use by an apparatus, said apparatus comprising a parser for parsing said document, wherein the document is stored in the computer readable storage medium and includes plural elements and attributes arranged according to a schema of elements and attributes, said computer executable program comprising:
-
code for determining a version of a document; code for compiling a metaschema based on the parser and the determined version of the document, said metaschema providing a set of rules for a valid change of the schema, each of the set of rules comprising a value and an expression of at least one of an element and an attribute of the schema; code for executing; (a) code for determining a similarity score for said document by comparing at least one of the elements and the attributes of said document to the value and the expression of at least one of an element and an attribute of the schema in the metaschema; and (b) code for mapping the document to the schema to produce a list of data values for information requested from the compiled metaschema; and code for using said similarity score to accept or reject further use of said document, wherein the list of data values is used to extract information from the mapped document if the document is accepted for further use.
-
-
15. A method of determining whether one or more documents are acceptable or rejected for use by an apparatus, said apparatus comprising a parser for parsing said document, wherein the documents are stored on a computer-readable storage medium and each include plural elements and attributes arranged according to a schema of elements and attributes, said method comprising the steps of:
-
using a computer to determine a version of the document; using a computer to compile a metaschema based on the parser and the determined version of the document, said metaschema providing a set of rules for a valid change of the schema, each of the set of rules comprising a value and an expression of at least one of an element and an attribute of the schema; using a computer to; (a) determine a similarity score for each said document by comparing at least one of the elements and the attributes of said document to the value and the expression of at least one of an element and an attribute of the schema in the metaschema; and (b) map the document to the schema to produce a list of data values for information requested from the compiled metaschema; and using a computer to use the similarity score to accept or reject further use of the document, wherein the list of data values is used to extract information from the mapped document if the document is accepted for further use. - View Dependent Claims (16)
-
Specification