Extracting semantics from data
First Claim
Patent Images
1. A computer-implemented method comprising:
- (A) converting a plurality of atomic tuples, representing contents of a plurality of cells stored in a grid, into a plurality of schema tuples, comprising;
(A)(1) assigning a plurality of catalog-defined types to the plurality of atomic tuples;
(A)(2) assigning a role of locator to a first subset of the plurality of cells based on the plurality of catalog-defined logical types; and
(A)(3) determining whether a locator tuple slice in the plurality of cells contains a missing value.
0 Assignments
0 Petitions
Accused Products
Abstract
Embodiments of the invention convert data from atomic tuples found in data sources such as spreadsheets (e.g., raw numbers, words, and formatted dates) into semantically enriched schemas and associated tuples. In addition to the data content, visual content, such as font and background color, is also analyzed as a part of the interpretation process. Embodiments of the invention also provide methods of interacting with the raw data via the semantically enriched schema tuples.
-
Citations
52 Claims
-
1. A computer-implemented method comprising:
(A) converting a plurality of atomic tuples, representing contents of a plurality of cells stored in a grid, into a plurality of schema tuples, comprising; (A)(1) assigning a plurality of catalog-defined types to the plurality of atomic tuples; (A)(2) assigning a role of locator to a first subset of the plurality of cells based on the plurality of catalog-defined logical types; and (A)(3) determining whether a locator tuple slice in the plurality of cells contains a missing value. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26)
-
27. A non-transitory computer-readable medium comprising computer program instructions executable by a computer processor to perform a method, the method comprising:
-
(A) converting a plurality of atomic tuples, representing a plurality of cells stored in a grid, into a plurality of schema tuples, the method comprising; (A) (1) assigning a plurality of logical types to theplurality of atomic tuples; (A) (2) assigning a role of locator to a first subset of the plurality of cells based on the plurality of logical types; (A) (3) extracting from the plurality of atomic tuples a locator tuple slice, wherein the locator tuple slice comprises a plurality of locator tuples corresponding to a plurality of contiguous cells, from the plurality of cells, having the role of locator and forming a 1XN shape within the grid; (A) (4) determining, based on values of at least some of the plurality of contiguous cells, whether any empty cells in the plurality of contiguous cells implicitly contain missing values; (A) (5) if any of the empty cells are determined to implicitly contain missing values, then storing the missing values of the locator tuples of the locator tuple slice corresponding to the cells which are determined to implicitly contain missing values; and (A) (6) converting the plurality of locator tuples into a plurality of logical schema tuples corresponding to the plurality of locator tuples, wherein each of the plurality of logical schema tuples comprises a type of the corresponding locator tuple and a value of the corresponding locator tuple. - View Dependent Claims (28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52)
-
Specification