Detection of polymorphic script language viruses by data driven lexical analysis
First Claim
1. A method of detecting script language viruses in data streams comprising:
- using a processor for;
preparing language description data corresponding to at least one script language;
preparing detection data for viral code corresponding to a script language virus;
lexically analyzing a data stream to identify the at least one script language;
lexically analyzing the data stream using the language description data to generate a stream of tokens;
generating viral code detection data by analyzing a plurality of samples of polymorphic script language viral code; and
lexically analyzing the stream of tokens using the detection data and the language description data to identify the script language virus.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and apparatus for detecting script language viruses is provided. The apparatus includes a script language processor, a detection data processor and a detection engine. The script language processor prepares language description data corresponding to at least one script language. The detection data processor prepares detection data for viral code corresponding to the script language virus. The detection engine lexically analyzes a data stream using the language description data and the detection data to detect the viral code. The language description data may correspond to language definition rules and language check rules. The data stream may be converted to a stream of tokens, wherein the lexical analysis is performed on the token stream. The script language virus detection apparatus may be a computer program stored on a computer readable medium and/or transmitted via a computer network or other transmission medium.
51 Citations
22 Claims
-
1. A method of detecting script language viruses in data streams comprising:
-
using a processor for; preparing language description data corresponding to at least one script language;
preparing detection data for viral code corresponding to a script language virus;
lexically analyzing a data stream to identify the at least one script language;lexically analyzing the data stream using the language description data to generate a stream of tokens; generating viral code detection data by analyzing a plurality of samples of polymorphic script language viral code; and lexically analyzing the stream of tokens using the detection data and the language description data to identify the script language virus. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A program storage device readable by a machine, tangibly embodying a program of instructions executable by the machine to perform method steps for detecting script language viruses, the method steps comprising:
-
preparing language description data corresponding to at least one script language;
preparing detection data for viral code corresponding to a script language virus;
lexically analyzing a data stream to identify the at least one script language;lexically analyzing the data stream using the language description data to generate a stream of tokens; generating viral code detection data by analyzing a plurality of samples of polymorphic script language viral code; and lexically analyzing the stream of tokens using the detection data and the language description data to identify the script language virus.
-
-
14. A computer system, comprising:
- a processor; and
a program storage device readable by the computer system, tangibly embodying a program of instructions executable by the processor to perform method steps for detecting script language viruses, the method steps comprising; preparing language description data corresponding to at least one script language;
preparing detection data for viral code corresponding to a script language virus; and
lexically analyzing a data stream to identify the at least one script language;lexically analyzing the data stream using the language description data to generate a stream of tokens; generating viral code detection data by analyzing a plurality of samples of polymorphic script language viral code; and lexically analyzing the stream of tokens using the detection data and the language description data to identify the script language virus.
- a processor; and
-
15. An apparatus for detecting script language viruses, comprising:
-
a script language processor, wherein the script language processor prepares language description data corresponding to at least one script language; a detection data processor, wherein the detection data processor prepares detection data for viral code corresponding to a script language virus and wherein the detection data processor generates viral code detection data by analyzing a plurality of samples of polymorphic script language viral code; and a detection engine, wherein the detection engine converts a data stream to a stream of tokens using lexical analysis, wherein the tokens correspond to respective language constructs, wherein the detection engine lexically analyzes the stream of tokens using the language description data and the detection data to identify the script language virus. - View Dependent Claims (16, 17, 18, 19, 20, 21)
-
-
22. A method, comprising:
-
using a processor for; receiving a data stream; lexically analyzing the data stream to identify a script language;
receiving language description data for the script language;lexically analyzing the data stream using the language description data to generate a stream of tokens; generating viral code detection data by analyzing a plurality of samples of polymorphic script language viral code; and lexically analyzing the stream of tokens using the viral code detection data and the language description data to identify at least one script language virus.
-
Specification