System and method for using XML to normalize documents
First Claim
1. A method for using extensible markup language to normalize documents, the method comprising the steps of:
- determining a type of object repository storing at least one object, the object comprising metadata;
identifying the at least one object stored in the at least one object repository;
extracting at least one portion of the at least one object, wherein the at least one portion is extracted in extensible markup language (XML) format; and
transmitting the at least one portion to a processor; and
processing the at least one portion.
1 Assignment
0 Petitions
Accused Products
Abstract
A system, method, and processor readable medium for normalizing documents using extensible markup language (XML). The system may determine a type of object repository storing at least one object. The object may include metadata. The system may then identify the object stored in the object repository. At least one portion of the one object may be extracted from the repository, wherein the portion is extracted in extensible markup language (XML) format. Preferably, some of the metadata is preserved. The metadata preserved may include at least one of author, title, subject, date created, date modified, list of modifiers, and link list information. The portion may then be transmitted to a processor. The processor may perform one or more processes on the portion. A mapping may be performed that maps at least one field in the object with a field designation identifier. The processor may include at least one of a full-text engine, a metrics engine, and a taxonomy engine.
24 Citations
20 Claims
-
1. A method for using extensible markup language to normalize documents, the method comprising the steps of:
-
determining a type of object repository storing at least one object, the object comprising metadata;
identifying the at least one object stored in the at least one object repository;
extracting at least one portion of the at least one object, wherein the at least one portion is extracted in extensible markup language (XML) format; and
transmitting the at least one portion to a processor; and
processing the at least one portion. - View Dependent Claims (2, 3, 4, 5, 10)
-
-
6. A system for using extensible markup language to normalize documents, the system comprising:
-
a determining module that determines a type of object repository storing at least one object, the object comprising metadata;
an identifying module that identifies the at least one object stored in the at least one object repository;
an extracting module that extracts at least one portion of the at least one object, wherein the at least one portion is extracted in extensible markup language (XML) format; and
a transmitting module that transmits the at least one portion to a processor; and
a processing module that processes the at least one portion. - View Dependent Claims (7, 8, 9)
-
-
11. A system for using extensible markup language to normalize documents, the system comprising:
-
determining means for determining a type of object repository storing at least one object, the object comprising metadata;
identifying means for identifying the at least one object stored in the at least one object repository;
extracting means for extracting at least one portion of the at least one object, wherein the at least one portion is extracted in extensible markup language (XML) format; and
transmitting means for transmitting the at least one portion to a processor; and
processing means for processing the at least one portion. - View Dependent Claims (12, 13, 14, 15)
-
-
16. A processor readable medium comprising processor readable code for causing a processor to use extensible markup language to normalize documents, the medium comprising:
-
determining code that causes a processor to determine a type of object repository storing at least one object, the object comprising metadata;
identifying code that causes a processor to identify the at least one object stored in the at least one object repository;
extracting code that causes a processor to extract at least one portion of the at least one object, wherein the at least one portion is extracted in extensible markup language (XML) format;
transmitting code that causes a processor to transmit the at least one portion to a processor; and
processing code that causes a processor to process the at least one portion. - View Dependent Claims (17, 18, 19, 20)
-
Specification