Processing mixed numeric and symbolic data encodings using scaling at one distance of at least one dimension, clustering, and a signpost transformation
First Claim
Patent Images
1. An apparatus for processing mixed data for a selected task, comprising:
- a data collection agent comprising logic encoded in a computer-readable medium that when executed is operable to collect mixed data implemented in a plurality of encoding schemes, the mixed data collected from a plurality of data sources wherein the plurality of data sources comprises at least two unique data sources comprising at least two unique encodings;
a neural network comprising logic encoded in a computer-readable medium that when executed is operable to implement an input transformation module, the input transformation module adapted to;
determine a complexity associated with the mixed data, the complexity based on a number of dimensions associated with the encoded data and a desired functional output for the selected task;
upon determining the complexity is below a first threshold, transform the encoding into a first numerical encoding based on a pattern in the encoding;
upon determining the complexity is above the first threshold;
determine a distance metric for determining a distance between any two data points within a particular dimension of the encoding;
scale at one distance of at least one dimension;
cluster the mixed data based on the determined distance metric;
transform the encoding into a second numerical encoding using a signpost transformation that dynamically adjusts a level of detail based on the desired functional output for the selected task;
a functional link network comprising logic encoded in a computer readable medium that when executed is operable to process the transformed encoding to provide the functional output for the selected task; and
a memory module operable to store the functional output.
1 Assignment
0 Petitions
Accused Products
Abstract
An apparatus and method for processing mixed data for a selected task is provided. An input transformation module converts mixed data into converted data. A functional mapping module processes the converted data to provide a functional output for the selected task. The selected task may be one or a combination of a variety of possible tasks, including search, recall, prediction, classification, etc. For example, the selected task may be for data mining, database search, targeted marketing, computer virus detection, etc.
82 Citations
41 Claims
-
1. An apparatus for processing mixed data for a selected task, comprising:
-
a data collection agent comprising logic encoded in a computer-readable medium that when executed is operable to collect mixed data implemented in a plurality of encoding schemes, the mixed data collected from a plurality of data sources wherein the plurality of data sources comprises at least two unique data sources comprising at least two unique encodings; a neural network comprising logic encoded in a computer-readable medium that when executed is operable to implement an input transformation module, the input transformation module adapted to; determine a complexity associated with the mixed data, the complexity based on a number of dimensions associated with the encoded data and a desired functional output for the selected task; upon determining the complexity is below a first threshold, transform the encoding into a first numerical encoding based on a pattern in the encoding; upon determining the complexity is above the first threshold; determine a distance metric for determining a distance between any two data points within a particular dimension of the encoding; scale at one distance of at least one dimension; cluster the mixed data based on the determined distance metric; transform the encoding into a second numerical encoding using a signpost transformation that dynamically adjusts a level of detail based on the desired functional output for the selected task; a functional link network comprising logic encoded in a computer readable medium that when executed is operable to process the transformed encoding to provide the functional output for the selected task; and a memory module operable to store the functional output. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 40, 41)
-
-
33. A computer-implemented method of processing mixed data for a selected task, comprising:
-
collecting mixed data implemented in a plurality of encoding schemes, the mixed data collected from a plurality of data sources wherein the plurality of data sources comprises at least two unique data sources comprising at least two unique types; determining a complexity associated with the mixed data, the complexity based on a number of dimensions associated with the encoded data and a desired functional output for the selected task; upon determining the complexity is below a first threshold, transforming the encoding into a first numerical encoding based on a pattern in the encoding; upon determining the complexity is above the first threshold; determining a distance metric for determining a distance between any two data points within a particular dimension of the encoding; scaling at one distance of at least one dimension; clustering the mixed data based on the determined distance metric; transforming the encoding into a second numerical encoding using a signpost transformation that dynamically adjusts a level of detail based on the desired functional output for the selected task; processing the transformed encoding to provide the functional output for the selected task; and storing the functional output. - View Dependent Claims (34, 35, 36, 37)
-
-
38. A program storage device readable by a machine, tangibly embodying a program of instructions executable by the machine, when executed by the machine the program of instruction operable to:
-
collect mixed data implemented in a plurality of encoding schemes, the mixed data collected from a plurality of data sources wherein the plurality of data sources comprises at least two unique data sources comprising at least two unique encodings; determine a complexity associated with the mixed data, the complexity based on a number of dimensions associated with the encoded data and a desired functional output for the selected task; upon determining the complexity is below a first threshold, transform the encoding into a first numerical encoding based on a pattern in the encoding; upon determining the complexity is above the first threshold; determine a distance metric for determining a distance between any two data points within a particular dimension of the encoding; scale at one distance of at least one dimension; cluster the mixed data based on the determined distance metric; transform the encoding into a second numerical encoding using a signpost transformation that dynamically adjusts a level of detail based on the desired functional output for the selected task; process the transformed encoding to provide the functional output for the selected task; and store the functional output.
-
-
39. A computing system, comprising:
-
a processor; and a program storage device readable by the computer system, tangibly embodying a program of instructions executable by the processor to; collect mixed data implemented in a plurality of encoding schemes, the mixed data collected from a plurality of data sources wherein the plurality of data sources comprises at least two unique data sources comprising at least two unique types of mixed data; determine a complexity associated with the mixed data, the complexity based on a number of dimensions associated with the encoded data and a desired functional output for the selected task; upon determining the complexity is below a first threshold, transform the encoding into a first numerical encoding based on a pattern in the encoding; upon determining the complexity is above the first threshold; determine a distance metric for determining a distance between any two data points within a particular dimension of the encoding; scale at one distance of at least one dimension; cluster the mixed data based on the determined distance metric; transform the encoding into a second numerical encoding using a signpost transformation that dynamically adjusts a level of detail based on the desired functional output for the selected task; process the transformed data to provide the functional output for the selected task; and store the functional output.
-
Specification