Normalizing ingested signals
First Claim
1. A method comprising:
- ingesting a raw signal including a time stamp, an indication of a signal type, an indication of a signal source, and content;
normalizing the raw signal into a normalized signal by reducing the dimensionality of the raw signal, including;
determining a time dimension associated with the raw signal from the time stamp;
inferring location annotations and context annotations from characteristics of the raw signal;
determining a location dimension associated with the raw signal from one or more of;
location information included in the raw signal or the location annotations;
determining a context dimension associated with the raw signal from one or more of;
context information included in the raw signal or the context annotations, including;
calculating a probability of a real-world event type as a ratio of raw signals turning into real-world events; and
calculating probability details indicating a probabilistic model used to calculate the probability and features of the raw signal considered in calculating the probability; and
including the time dimension, the location dimension, the context dimension, the indication of the signal type, the indication of the signal source, and the content in the normalized signal, wherein the context dimension includes the probability and probability details; and
detecting an occurring real-world event of the real-world event type from the time dimension, location dimension, and context dimension included in the normalized signal; and
notifying one or more entities about the real-world event.
1 Assignment
0 Petitions
Accused Products
Abstract
The present invention extends to methods, systems, and computer program products for normalizing ingested signals. In general, signal ingestion modules ingest different types of raw structured and/or raw unstructured signals on an ongoing basis. The signal ingestion modules normalize raw signals into normalized signals having a Time, Location, Context (or “TLC”) dimensions. A Time (T) dimension can be a time of origin or alternatively “event time” of a signal. A Location (L) dimension can be anywhere across a geographic area, such as, a country (e.g., the United States), a State, a defined area, an impacted area, an area defined by a geo cell, an address, etc. A Context (C) dimension indicates circumstances surrounding formation/origination of a raw signal in terms that facilitate understanding and assessment of the raw signal. The Context (C) dimension of a raw signal can be derived from express as well as inferred signal features of the raw signal.
70 Citations
19 Claims
-
1. A method comprising:
-
ingesting a raw signal including a time stamp, an indication of a signal type, an indication of a signal source, and content; normalizing the raw signal into a normalized signal by reducing the dimensionality of the raw signal, including; determining a time dimension associated with the raw signal from the time stamp; inferring location annotations and context annotations from characteristics of the raw signal; determining a location dimension associated with the raw signal from one or more of;
location information included in the raw signal or the location annotations;determining a context dimension associated with the raw signal from one or more of;
context information included in the raw signal or the context annotations, including;calculating a probability of a real-world event type as a ratio of raw signals turning into real-world events; and calculating probability details indicating a probabilistic model used to calculate the probability and features of the raw signal considered in calculating the probability; and including the time dimension, the location dimension, the context dimension, the indication of the signal type, the indication of the signal source, and the content in the normalized signal, wherein the context dimension includes the probability and probability details; and detecting an occurring real-world event of the real-world event type from the time dimension, location dimension, and context dimension included in the normalized signal; and notifying one or more entities about the real-world event. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A computer-implemented method comprising:
-
ingesting a raw signal including a time stamp, location information, an indication of a signal type, an indication of a signal source, and content; normalizing the raw signal into a normalized signal, including; determining a time dimension from the time stamp; determining a location dimension from the location information; inserting the time dimension and location dimension into a TL signal; storing the TL signal in TL message storage; subsequent to storing the TL signal, accessing the TL signal from the TL message storage; inferring context annotations based on characteristics of the TL signal, including one or more of;
the time dimension, the location dimension, the signal type, the signal source, and the content;appending the context annotations to the TL signal; determining a context dimension associated with the TL signal from the context annotations, including; calculating a probability of a real-world event type as a ratio of raw signals turning into real-world events; and calculating probability details indicating a probabilistic model used to calculate the probability and features of the raw signal considered in calculating the probability; and including the time dimension, the location dimension, the context dimension, the indication of the signal type, the indication of the signal source, and the content in the normalized signal, wherein the context includes the probability and probability details; and storing the normalized signal in TLC message storage; and detecting an occurring real-world event of the real-world event type from the time dimension, location dimension, and context dimension included in the normalized signal notifying one or more entities about the real-world event. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A computer system comprising:
-
a processor; system memory coupled to the processor and storing instructions configured to cause the processor to; ingest a raw signal including a time stamp, an indication of a signal type, an indication of a signal source, and content; normalize the raw signal into a Time, Location, Context (“
TLC”
) normalized signal by reducing the dimensionality of the raw signal, including;determine a time dimension associated with the raw signal from the time stamp; infer location annotations and context annotations from characteristics of the raw signal; determine a location dimension associated with the raw signal from one or more of;
location information included in the raw signal or the location annotations;determine a context dimension associated with the raw signal from one or more of;
context information included in the raw signal or the context annotations, including;calculate a probability of a real-world event type as a ratio of raw signals turning into real-world events; and calculating probability details indicating a probabilistic model used to calculate the probability and features of the raw signal considered in calculating the probability; and include the time dimension, the location dimension, the context dimension, the indication of the signal type, the indication of the signal source, and the content in the Time, Location, Context (“
TLC”
) normalized signal, wherein the context dimension includes the probability and probability details; anddetect an occurring real-world event of the real-world event type from the time dimension, location dimension, and context dimension included in the normalized signal; and notify one or more entities about the real-world event. - View Dependent Claims (16, 17, 18, 19)
-
Specification