Indexing and searching of information including handler chaining
First Claim
1. A computer readable medium including executable instructions which, when executed, manage information by:
- creating a plurality of information sources, each information source comprising a metadata definition document (MDD) having a uniform representation of metadata and including full-text search data (FTSData), wherein each information source can be linked to an underlying information source having a representative file format and used to provide additional content for use in searching and indexing operations;
using a schema to define the MDD for each information source, the schema including a FTSData element to specify FTSData, asset and content elements to identify a linked underlying information source, and a queries element to identify query to underlying information source mappings;
calling an intermediate handler to extract metadata and FTSData from the plurality of information sources, the intermediate handler to return the metadata and FTSData from each information source based in part on operating modes including;
a first operating mode to append FTSData of each information source with FTSData of a corresponding linked underlying information source and provide a union of metadata from each information source and the corresponding linked underlying information source, including ignoring the metadata of linked underlying information sources for conflicts of first metadata types;
a second operating mode to append FTSData of each information source with FTSData of the corresponding linked underlying information source and provide a union of metadata from each information source and the corresponding linked underlying information source, including ignoring the metadata of the information sources for conflicts of the first metadata types;
a third operating mode to return FTSData of each information source and provide a union of metadata from each information source and the corresponding linked underlying information source, including ignoring the metadata of the linked underlying information sources for conflicts of the first metadata types and second metadata types; and
,a fourth operating mode to return FTSData of each linked underlying information source and provide a union of metadata from each information source and the corresponding linked underlying information source, including ignoring the metadata of the information sources for conflicts of the first metadata types and the second metadata types;
invoking a registered handler to extract metadata and other information from each linked underlying information source; and
,using the metadata and FTSData as part of the indexing and searching operations.
2 Assignments
0 Petitions
Accused Products
Abstract
Embodiments are provided to manage information associated with a number of disparate information sources. In an embodiment, a system includes a data structure having a uniform metadata representation and a metadata handler for interacting with the data structure. The data structure can be associated with an underlying file or some other type of information source. In one embodiment, a number of handlers can be chained together to extract and return information according to a desired mode of operation. The embodiments provide extensibility and can be tailored according to a desired implementation. Other embodiments and implementations are available.
-
Citations
20 Claims
-
1. A computer readable medium including executable instructions which, when executed, manage information by:
-
creating a plurality of information sources, each information source comprising a metadata definition document (MDD) having a uniform representation of metadata and including full-text search data (FTSData), wherein each information source can be linked to an underlying information source having a representative file format and used to provide additional content for use in searching and indexing operations; using a schema to define the MDD for each information source, the schema including a FTSData element to specify FTSData, asset and content elements to identify a linked underlying information source, and a queries element to identify query to underlying information source mappings; calling an intermediate handler to extract metadata and FTSData from the plurality of information sources, the intermediate handler to return the metadata and FTSData from each information source based in part on operating modes including; a first operating mode to append FTSData of each information source with FTSData of a corresponding linked underlying information source and provide a union of metadata from each information source and the corresponding linked underlying information source, including ignoring the metadata of linked underlying information sources for conflicts of first metadata types; a second operating mode to append FTSData of each information source with FTSData of the corresponding linked underlying information source and provide a union of metadata from each information source and the corresponding linked underlying information source, including ignoring the metadata of the information sources for conflicts of the first metadata types; a third operating mode to return FTSData of each information source and provide a union of metadata from each information source and the corresponding linked underlying information source, including ignoring the metadata of the linked underlying information sources for conflicts of the first metadata types and second metadata types; and
,a fourth operating mode to return FTSData of each linked underlying information source and provide a union of metadata from each information source and the corresponding linked underlying information source, including ignoring the metadata of the information sources for conflicts of the first metadata types and the second metadata types; invoking a registered handler to extract metadata and other information from each linked underlying information source; and
,using the metadata and FTSData as part of the indexing and searching operations. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A system to manage information comprising:
-
a plurality of information sources, each information source comprising a MDD that includes a uniform representation of metadata and being defined using a schema that includes a FTSData element to specify FTSData, asset and content elements to identify a linked underlying information source, and a queries element to identify query to underlying information source mappings, wherein the plurality of information sources include FTSData and are linked to underlying information sources; an intermediate handler configured to filter metadata from the plurality of information sources and return the metadata and FTSData from each information source based in part on operating modes that include; a first operating mode to append FTSData of each information source with FTSData of a corresponding linked underlying information source and provide a union of metadata from each information source and the corresponding linked underlying information source, wherein the intermediate handler operates as part of the first operating mode to ignore the metadata of linked underlying information sources for conflicts of first metadata types; a second operating mode to append FTSData of each information source with FTSData of the corresponding linked underlying information source and provide a union of metadata from each information source and the corresponding linked underlying information source, wherein the intermediate handler operates as part of the second operating mode to ignore the metadata of the information sources for conflicts of the first metadata types; a third operating mode to return FTSData of each information source and provide a union of metadata from each information source and the corresponding linked underlying information source, wherein the intermediate handler operates as part of the third operating mode to ignore metadata of the linked underlying information sources for conflicts of the first metadata types and second metadata types; and
,a fourth operating mode to return FTSData of each linked underlying information source and provide a union of metadata from each information source and the corresponding linked underlying information source, wherein the intermediate handler operates to ignore metadata of the information sources for conflicts of the first metadata types and the second metadata types; a chained handler configured to filter metadata from the underlying information source, wherein the intermediate handler can call on the chained handler to filter the metadata; and
,an indexer to index the extracted metadata and any existing FTSData against the information source. - View Dependent Claims (16, 17)
-
-
18. A method of managing information comprising:
-
defining a data structure including a uniform representation of metadata, wherein the data structure can include an element defining free-form text that can be used to provide additional details about the data structure; creating a plurality of information sources based in part on the data structure, each information source comprising a MDD having the uniform representation of metadata and including full-text search data (FTSData), wherein the each information source can be linked to an underlying information source having a representative file format; extracting the metadata and free-form text from each information source by invoking a metadata handler to extract the metadata and free-form text, the metadata handler to return the metadata and FTSData from each information source based in part on operating modes that include; a first operating mode to append FTSData of each information source with FTSData of a corresponding linked underlying information source and provide a union of metadata from each information source and the corresponding linked underlying information source, wherein the metadata handler operates as part of the first operating mode to ignore the metadata of linked underlying information sources for conflicts of first metadata types; a second operating mode to append FTSData of each information source with FTSData of the corresponding linked underlying information source and provide a union of metadata from each information source and the corresponding linked underlying information source, wherein the metadata handler operates as part of the second operating mode to ignore the metadata of the information sources for conflicts of the first metadata types; a third operating mode to return FTSData of each information source and provide a union of metadata from each information source and the corresponding linked underlying information source, wherein the metadata handler operates as part of the third operating mode to ignore metadata of the linked underlying information sources for conflicts of the first metadata types and second metadata types; and
,a fourth operating mode to return FTSData of each linked underlying information source and provide a union of metadata from each information source and the corresponding linked underlying information source, wherein the metadata handler operates as part of the fourth operating mode to ignore metadata of the information sources for conflicts of the first metadata types and the second metadata types; chaining a second handler that is configured to extract underlying metadata from a corresponding linked underlying information source; and
,performing an indexing operation by using the extracted metadata and free-form text to index against the data structure. - View Dependent Claims (19, 20)
-
Specification