File system with access and retrieval of XML documents
First Claim
Patent Images
1. A computer implemented method of information retrieval, comprising the steps of:
- retrieving structural information of memorized documents according to a document type declaration that corresponds to each of said documents;
retrieving elements, attributes and values of said elements and said attributes of said documents;
generating a multilevel inverted index from said structural information, said elements, said attributes and said values;
accepting a specification from a user having members that comprise at least one of said elements, said attributes and said values;
responsive to said specification, extracting data from said index that complies with at least one of said members;
displaying virtual directory paths of corresponding ones of said documents, wherein said directory paths each comprise a sequence of said members, and wherein contents of directories that are identified in said directory paths comprise selected ones of said documents possessing said specification.
1 Assignment
0 Petitions
Accused Products
Abstract
An XML-aware file system exploits attributes encoded in an XML document. The file system presents a dynamic directory structure to the user, and breaks the conventional tight linkage between sets of files and the physical directory structure, thus allowing different users to see files organized in a different fashion. The dynamic structure is based upon content, which is extracted using an inverted index according to attributes and values defined by the XML structure.
174 Citations
51 Claims
-
1. A computer implemented method of information retrieval, comprising the steps of:
-
retrieving structural information of memorized documents according to a document type declaration that corresponds to each of said documents;
retrieving elements, attributes and values of said elements and said attributes of said documents;
generating a multilevel inverted index from said structural information, said elements, said attributes and said values;
accepting a specification from a user having members that comprise at least one of said elements, said attributes and said values;
responsive to said specification, extracting data from said index that complies with at least one of said members;
displaying virtual directory paths of corresponding ones of said documents, wherein said directory paths each comprise a sequence of said members, and wherein contents of directories that are identified in said directory paths comprise selected ones of said documents possessing said specification. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A computer software product, comprising a computer-readable medium in which computer program instructions are stored, which instructions, when read by a computer, cause the computer to perform the steps of:
-
retrieving structural information of memorized documents according to a document type declaration that corresponds to each of said documents;
retrieving elements, attributes and values of said elements and said attributes of said documents;
generating a multilevel inverted index from said structural information, said elements, said attributes and said values;
accepting a specification from a user having members that comprise at least one of said elements, said attributes and said values;
responsive to said specification, extracting data from said index that complies with at least one of said members;
associating said data with corresponding ones of said documents;
displaying said corresponding ones of said documents as virtual directory paths, wherein said directory paths each comprise a sequence of said members, and wherein contents of directories that are identified in said directory paths comprise selected ones of said documents possessing said specification. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A computer implemented information retrieval system for presenting a semantically dependent directory structure of XML files to a user, comprising:
-
a file system engine, that receives a file request via a file system application programming interface and issues file system calls to an operating system, wherein said file request specifies a file content of memorized files;
an XML parser, linked to said file system engine, that retrieves structural information of XML documents, said XML parser further retrieving at least one of elements, attributes and respective values thereof from said XML documents;
an indexer, linked to said XML parser, for constructing an inverted index of said elements and said attributes and said respective values thereof, wherein responsive to said file request, said file system engine retrieves postings of said inverted index that satisfy requirements of said file request, and returns directory paths to said file system application programming interface of selected ones of said XML documents corresponding to said postings. - View Dependent Claims (20, 21, 22, 23, 24, 25, 26)
-
-
27. A computer implemented method of information retrieval, comprising the steps of:
-
retrieving structural information of memorized documents according to a document type declaration that corresponds to each of said documents, wherein said documents are written in a markup language;
retrieving elements, attributes and values of said elements and said attributes of said documents;
generating a multilevel inverted index from said structural information, said elements, said attributes and said values;
accepting a specification from a user having members that comprise at least one of said elements, said attributes and said values;
responsive to said specification, extracting data from said index that complies with at least one of said members;
displaying virtual directory paths of corresponding ones of said documents, wherein said directory paths each comprise a sequence of said members, and wherein contents of directories that are identified in said directory paths comprise selected ones of said documents possessing said specification. - View Dependent Claims (28, 29, 30, 31, 32, 33)
-
-
34. A computer software product, comprising a computer-readable medium in which computer program instructions are stored, which instructions, when read by a computer, cause the computer to perform the steps of:
-
retrieving structural information of memorized documents according to a document type declaration that corresponds to each of said documents;
wherein said documents are written in a markup language;
retrieving elements, attributes and values of said elements and said attributes of said documents;
generating a multilevel inverted index from said structural information, said elements, said attributes and said values;
accepting a specification from a user having members that comprise at least one of said elements, said attributes and said values;
responsive to said specification, extracting data from said index that complies with at least one of said members;
associating said data with corresponding ones of said documents;
displaying said corresponding ones of said documents as virtual directory paths, wherein said directory paths each comprise a sequence of said members, and wherein contents of directories that are identified in said directory paths comprise selected ones of said documents possessing said specification. - View Dependent Claims (35, 36, 37, 38, 39, 40, 41, 42, 43)
-
-
44. A computer implemented information retrieval system for presenting a semantically dependent directory structure of document files to a user, wherein documents of said document files are written in a markup language, comprising:
-
a file system engine, that receives a file request via a file system application programming interface and issues file system calls to an operating system, wherein said file request specifies a file content of memorized files;
a parser of said markup language, linked to said file system engine, that retrieves structural information of said documents, said parser further retrieving at least one of elements, attributes and respective values thereof from said documents;
an indexer, linked to said parser, for constructing an inverted index of said elements and said attributes and said respective values thereof, wherein responsive to said file request, said file system engine retrieves postings of said inverted index that satisfy requirements of said file request, and returns directory paths to said file system application programming interface of selected ones of said documents corresponding to said postings. - View Dependent Claims (45, 46, 47, 48, 49, 50, 51)
-
Specification