Techniques for real-time generation of temporal comparative and superlative analytics in natural language for real-time dynamic data analytics
First Claim
1. A method of forming a natural language query template, the method comprising:
- obtaining a sample of real-time events from a data stream;
identifying, from data in the events in the sample, using processing resources including at least one hardware processor, measures and dimensions associated with that data;
classifying, using the processing resources, the identified measures and dimensions as belonging to one or more distinct measures and one or more distinct dimensions, respectively;
selecting at least one of the one or more distinct measures and/or at least one of the one or more distinct dimensions for inclusion in the natural language query template, the selecting being performed by the processing resources automatically, the natural language query template including natural language expressions and templated fields, at least one of the templated fields enabling user selection of one of a comparative and a superlative; and
building, using the processing resources, the natural language query template by arranging the at least one selected distinct measure and/or the at least one selected distinct dimension in the natural language query template as user-selectable options in at least some of the templated fields;
wherein a natural language query is buildable from the natural language query template and specified user-selectable options, and wherein the natural language query, in turn, is transformable into a formalized query executable on the data stream.
1 Assignment
0 Petitions
Accused Products
Abstract
Certain example embodiments relate to techniques for real-time generation of temporal comparative and superlative analytics. For instance, certain example embodiments analyze real-time events to generate building blocks that are assemblable into natural language question templates. These natural language question templates, which may include natural language phrases and templated fields, may be comparative or superlative in nature. User selectable options or parameters may be inserted in place of the templated fields, and complete questions, including user selections and the comparative or superlative, may be translated into a formal query language for execution on the real-time data source. In certain example embodiments, a GUI may enable users to build pre-constructed questions with selectable parameters.
24 Citations
25 Claims
-
1. A method of forming a natural language query template, the method comprising:
-
obtaining a sample of real-time events from a data stream; identifying, from data in the events in the sample, using processing resources including at least one hardware processor, measures and dimensions associated with that data; classifying, using the processing resources, the identified measures and dimensions as belonging to one or more distinct measures and one or more distinct dimensions, respectively; selecting at least one of the one or more distinct measures and/or at least one of the one or more distinct dimensions for inclusion in the natural language query template, the selecting being performed by the processing resources automatically, the natural language query template including natural language expressions and templated fields, at least one of the templated fields enabling user selection of one of a comparative and a superlative; and building, using the processing resources, the natural language query template by arranging the at least one selected distinct measure and/or the at least one selected distinct dimension in the natural language query template as user-selectable options in at least some of the templated fields; wherein a natural language query is buildable from the natural language query template and specified user-selectable options, and wherein the natural language query, in turn, is transformable into a formalized query executable on the data stream. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A method of forming a natural language query template, the method comprising:
-
obtaining a sample of real-time events from a data stream; identifying, from the events in the sample, using processing resources including at least one hardware processor, measures and dimensions associated therewith; classifying, using the processing resources, the identified measures and dimensions as belonging to one or more distinct measures and one or more distinct dimensions, respectively; selecting, using the processing resources, at least one of the one or more distinct measures and/or at least one of the one or more distinct dimensions for inclusion in the natural language query template, the natural language query template including natural language expressions and templated fields, at least one of the templated fields enabling user selection of one of a comparative and a superlative; and arranging, using the processing resources, the at least one selected distinct measure and/or the at least one selected distinct dimension in the natural language query template as user-selectable options in at least some of the templated fields; wherein the natural language query template with specified user-selectable options is transformable into a formalized query executable on the data stream; and wherein the obtaining of the sample comprises; (a) obtaining an initial sample having a predetermined sample size, each said real-time event in the initial sample having an associated timestamp; (b) identifying all unique timestamps in the initial sample; (c) determining whether the number of unique timestamps in the initial sample size divided by the number of real-time events in the initial sample meets or exceeds a predetermined threshold value; (d) in response to a determination that the number of unique timestamps in the initial sample size divided by the number of real-time events in the initial sample meets or exceeds the predetermined threshold value, treating the initial sample as the sample; and (e) in response to a determination that the number of unique timestamps in the initial sample size divided by the number of real-time events in the initial sample does not meet or exceed the predetermined threshold value, increasing the predetermined sample size and repeating (a)-(e), as needed. - View Dependent Claims (13, 14, 15)
-
-
16. An event processing system, comprising:
-
an event channel configured to receive real-time events from one or more computing systems; a non-transitory computer readable storage medium; and processing resources including at least one processor and a memory configured to control the system to at least; obtain a sample of real-time events from the event channel; identify, from data in the events in the sample, measures and dimensions associated with that data; classify the identified measures and dimensions as belonging to one or more distinct measures and one or more distinct dimensions, respectively; select at least one of the one or more distinct measures and/or at least one of the one or more distinct dimensions for inclusion in a natural language query template that includes natural language expressions and templated fields, at least one of the templated fields enabling user selection of one of a comparative and a superlative; arrange the at least one selected distinct measure and/or the at least one selected distinct dimension in the natural language query template as user-selectable options in at least some of the templated fields; responsive to the arranging, store to the non-transitory computer readable storage medium the natural language query template in association with the arranged at least one selected distinct measure and/or the arranged at least one selected distinct dimension; and enable the natural language query template with specified user-selectable options to be transformed into a formalized query executable on the event channel. - View Dependent Claims (17, 18, 19, 20, 21, 22, 23, 24)
-
-
25. A non-transitory computer readable storage medium having a program for forming a natural language query template stored thereon, the program comprising instructions that, when executed, cause a computing system including at least one processor and a memory to at least:
-
obtain a sample of real-time events from a data stream; identify, from the events in the sample, measures and dimensions associated therewith; classify the identified measures and dimensions as belonging to one or more distinct measures and one or more distinct dimensions, respectively; automatically select at least one of the one or more distinct measures and/or at least one of the one or more distinct dimensions for inclusion in the natural language query template, the at least one selected distinct measure and/or the at least one selected distinct dimension being selected dynamically based on the classification and having a propensity for subsequent user selection that is higher than non-selected distinct measure(s) and/or non-selected distinct dimension(s), the natural language query template including natural language expressions and templated fields, at least one of the templated fields enabling user selection of one of a comparative and a superlative; and arrange the at least one selected distinct measure and/or the at least one selected distinct dimension in the natural language query template as user-selectable options in at least some of the templated fields; wherein the natural language query template with specified user-selectable options is transformable into a formalized query executable on the data stream.
-
Specification