Unstructured data in a mining model language
First Claim
1. A data mining tool that facilitates a data mining operation of unstructured data, comprising:
- a receiving component that receives a command in a declarative language that identifies the unstructured data; and
an unstructured data processing component that analyzes the received command and commences a data mining operation of the unstructured data.
2 Assignments
0 Petitions
Accused Products
Abstract
A standard mechanism for directly accessing unstructured data types (e.g., image, audio, video, gene sequencing and text data) in accordance with data mining operations is provided. The subject innovation can enable access to unstructured data directly from within the data mining engine or tool. Accordingly, the innovation enables multiple vendors to provide algorithms for mining unstructured data on a data mining platform (e.g., an SQL-brand server), thereby increasing adoption. As well, the subject innovation allows users to directly mine unstructured data that is not fixed-length, without pre-processing and tokenizing the data external to the data mining engine. In accordance therewith, the innovation can provide a mechanism to expand declarative language content types to include an “unstructured” data type thereby enabling a user and/or application to affirmatively designate mining data as an unstructured type.
-
Citations
20 Claims
-
1. A data mining tool that facilitates a data mining operation of unstructured data, comprising:
-
a receiving component that receives a command in a declarative language that identifies the unstructured data; and
an unstructured data processing component that analyzes the received command and commences a data mining operation of the unstructured data. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A method that facilitates data mining of unstructured data, comprising:
-
receiving a command in a declarative language that identifies the unstructured data as an unstructured type; and
generating a data mining model based at least in part upon the unstructured data. - View Dependent Claims (14, 15, 16, 17, 18)
-
-
19. A computer-implemented system that facilitates a data mining operation, comprising:
-
means for identifying an unstructured data type;
means for extracting a plurality of key elements from data identified as the unstructured data type;
means for converting the plurality of key elements into a format recognizable to a modeling algorithm; and
means for passing the converted plurality of key elements to the modeling algorithm. - View Dependent Claims (20)
-
Specification