Cognitive information security using a behavioral recognition system
First Claim
1. An apparatus, comprising:
- at least one processor; and
a memory storing processor-executable instructions and in communication with the at least one processor, the instructions in the memory comprising instructions for the at least one processor to;
generate a dictionary of words based on a statistical distribution of combinations of symbols, the symbols being generated based on an ordered stream of normalized data;
dynamically update the dictionary of words based on a subsequent stream of the ordered stream of normalized data;
determine an unusualness score for at least one word in the dictionary of words based on a frequency of occurrence of the at least one word; and
publish an alert based on the unusualness score, the alert indicating an anomaly in the normalized data.
5 Assignments
0 Petitions
Accused Products
Abstract
Embodiments presented herein describe techniques for generating a linguistic model of input data obtained from a data source (e.g., a video camera). According to one embodiment of the present disclosure, a sequence of symbols is generated based on an ordered stream of normalized vectors generated from the input data. A dictionary of words is generated from combinations of the ordered sequence of symbols based on a frequency at which combinations of symbols appear in the ordered sequence of symbols. A plurality of phrases is generated based an ordered sequence of words from the dictionary observed in the ordered sequence of symbols based on a frequency by which combinations of words in ordered sequence of words appear relative to one another.
-
Citations
19 Claims
-
1. An apparatus, comprising:
-
at least one processor; and a memory storing processor-executable instructions and in communication with the at least one processor, the instructions in the memory comprising instructions for the at least one processor to; generate a dictionary of words based on a statistical distribution of combinations of symbols, the symbols being generated based on an ordered stream of normalized data; dynamically update the dictionary of words based on a subsequent stream of the ordered stream of normalized data; determine an unusualness score for at least one word in the dictionary of words based on a frequency of occurrence of the at least one word; and publish an alert based on the unusualness score, the alert indicating an anomaly in the normalized data. - View Dependent Claims (9, 10, 11, 12, 13, 14, 15)
-
-
2. A computer-implemented method, comprising:
-
generating, via at least one processor, a dictionary of words based on a statistical distribution of combinations of symbols, the symbols being generated based on an ordered stream of normalized data; determining, via the at least one processor, an unusualness score for at least one word in the dictionary of words based on a frequency of occurrence of the at least one word; and publishing, via the at least one processor, an alert based on the unusualness score, the alert indicating an anomaly in the normalized data. - View Dependent Claims (3, 4, 5, 6, 7, 8)
-
-
16. A non-transitory processor-readable storage medium storing processor-executable instructions to:
-
generate a dictionary of words based on a statistical distribution of combinations of symbols, the symbols being generated from an ordered stream of normalized data; update the dictionary of words based on a subsequent stream of the ordered stream of normalized data; determine an unusualness score for at least one word in the dictionary of words based on a frequency of occurrence of the at least one word; and issue an alert based on the unusualness score, the alert indicating an anomaly in the normalized data. - View Dependent Claims (17, 18, 19)
-
Specification