×

Modeling sequence and time series data in predictive analytics

  • US 7,747,641 B2
  • Filed: 04/28/2005
  • Issued: 06/29/2010
  • Est. Priority Date: 07/09/2004
  • Status: Active Grant
First Claim
Patent Images

1. A declarative data modeling language system for predicting sequences and time series data automatically, and by identifying patterns without manual pattern identification or validation, comprising:

  • a processor and memory;

    a data modeling language component that automatically generates at least one data mining model to extract predictive information from at least one database, and in a manner that does not require manual identification or validation of a predictive pattern;

    a plurality of language extension components configured in the data modeling language, the plurality of language extension components providing at least;

    a data sequence model in the data modeling language to generate sequence predictions;

    a time series model in the data modeling language and facilitating generating time series predictions of at least one of a casual or discrete subsequent data value in a time series, wherein the sequence model and the time series model are separate models, and in which the data sequence model predicts events based at least in part on historical event data, and the time series model predicts numerical time values based on historical numerical time value data;

    wherein one or both of the data sequence model or the time series model include schema rowsets stores that include contents of a mining model according to a transition matrix for clustering sequences and storing probabilities of transitions between different states;

    wherein the schema rowsets include All, Cluster and Sequence, in which;

    All is a node that is a root and represents a model;

    Cluster is a child of All; and

    Sequence is a child of All that stores a marginal transition matrix, and in which each Cluster has a Sequence child that contains a set of children, each of which is a column in the transition matrix; and

    wherein the memory configured to the processor retains at least one piece of information that pertains to the data modeling language component or the language extension components when directed to the processor.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×