Method of processing queries in a database system, and database system and software product for implementing such method
First Claim
1. A method of processing a query in a database system, wherein a plurality of row identifiers are defined to designate respective rows of a reference table having columns respectively associated with data attributes, said rows containing groups of related words assigned to said attributes in a collection of data, wherein a plurality of thesauruses each associated with a respective attribute and data representing reference table row identifier lists respectively associated with entries of said thesauruses are stored, wherein each thesaurus associated with one attribute is defined with reference to a partition into subsets of a set of words which can be assigned to said one attribute and has a respective entry for each subset including at least one word assigned to said one attribute in the collection of data, the reference table row identifier list associated with said thesaurus entry including any identifier allocated to a row of the reference table having a word of said subset assigned to said one attribute, the method comprising the steps of:
- determining a matching reference table row identifier list based on a combination of thesaurus entries relevant to the query and on the stored data representing the reference table row identifier lists associated with said relevant thesaurus entries; and
processing said matching row identifier list to output a response, wherein the step of processing the matching row identifier list comprises, for at least one attribute specified in the query, selecting a thesaurus associated with said attribute and detecting entries of the selected thesaurus with which identifier lists having a non-empty intersection with the matching row identifier list are associated.
3 Assignments
0 Petitions
Accused Products
Abstract
A reference table, which may not be stored, has columns associated with data attributes and rows containing related words assigned to those attributes in a collection of data. The stored data include thesauruses associated with the attributes, and reference table row identifier lists respectively associated with thesaurus entries. Each thesaurus is defined with reference to a partition into subsets of the words which can be assigned to the associated attribute, and has a respective entry for each subset including an assigned word, the row identifier list associated with this entry including any identifier allocated to a row of the reference table having a word of the subset assigned to the associated attribute. A matching reference table row identifier list is determined from the data representing the row identifier lists associated with thesaurus entries relevant to the query. To output a response, a thesaurus associated with at least one attribute is selected, and the entries of the selected thesaurus with which identifier lists having a non-empty intersection with the matching row identifier list are associated are detected.
70 Citations
55 Claims
-
1. A method of processing a query in a database system, wherein a plurality of row identifiers are defined to designate respective rows of a reference table having columns respectively associated with data attributes, said rows containing groups of related words assigned to said attributes in a collection of data, wherein a plurality of thesauruses each associated with a respective attribute and data representing reference table row identifier lists respectively associated with entries of said thesauruses are stored, wherein each thesaurus associated with one attribute is defined with reference to a partition into subsets of a set of words which can be assigned to said one attribute and has a respective entry for each subset including at least one word assigned to said one attribute in the collection of data, the reference table row identifier list associated with said thesaurus entry including any identifier allocated to a row of the reference table having a word of said subset assigned to said one attribute, the method comprising the steps of:
-
determining a matching reference table row identifier list based on a combination of thesaurus entries relevant to the query and on the stored data representing the reference table row identifier lists associated with said relevant thesaurus entries; and
processing said matching row identifier list to output a response, wherein the step of processing the matching row identifier list comprises, for at least one attribute specified in the query, selecting a thesaurus associated with said attribute and detecting entries of the selected thesaurus with which identifier lists having a non-empty intersection with the matching row identifier list are associated. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 39)
-
-
38. A database system for managing information from a collection of data, wherein a plurality of row identifiers are defined to designate respective rows of a reference table having columns respectively associated with data attributes, said rows containing groups of related words assigned to said attributes in the collection of data, the database system comprising:
-
means for storing a plurality of thesauruses respectively associated with attributes of said group, wherein each thesaurus associated with an attribute is defined with reference to a partition into subsets of a set of words which can be assigned to said attribute and has a respective entry for each subset including at least one word assigned to said attribute in the collection of data;
means for storing data representing identifier lists respectively associated with the thesaurus entries, wherein the identifier list associated with a entry, relating to a subset, of a thesaurus associated with an attribute includes any row identifier designating a row of the reference table having a word of said subset assigned to said attribute;
means for determining a matching reference table row identifier list based on a combination of thesaurus entries relevant to a query and on the stored data representing the reference table row identifier lists associated with said relevant thesaurus entries; and
means for processing said matching row identifier list to output a response, and wherein the means for processing the matching row identifier list comprise means for selecting a thesaurus associated with an attribute specified in the query and means for detecting entries of the selected thesaurus with which identifier lists having a non-empty intersection with the matching row identifier list are associated.
-
-
40. A computer program product for processing queries in a database system, wherein a plurality of row identifiers are defined to designate respective rows of a reference table having columns respectively associated with data attributes, said rows containing groups of related words assigned to said attributes in a collection of data, the database system comprising:
-
means for storing a plurality of thesauruses respectively associated with attributes of said group, wherein each thesaurus associated with an attribute is defined with reference to a partition into subsets of a set of words which can be assigned to said attribute and has a respective entry for each subset including at least one word assigned to said attribute in the collection of data; and
means for storing data representing identifier lists respectively associated with the thesaurus entries, wherein the identifier list associated with a entry, relating to a subset, of a thesaurus associated with an attribute includes any row identifier designating a row of the reference table having a word of said subset assigned to said attribute, the computer program product comprising;
instructions for determining a matching reference table row identifier list based on a combination of thesaurus entries relevant to a query and on the stored data representing the reference table row identifier lists associated with said relevant thesaurus entries; and
instructions for processing said matching row identifier list to output a response, wherein the instructions for processing the matching row identifier list comprise instructions for selecting a thesaurus associated with an attribute specified in the query and instructions for detecting entries of the selected thesaurus with which identifier lists having a non-empty intersection with the matching row identifier list are associated. - View Dependent Claims (41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55)
-
Specification