Data detection of a character sequence having multiple possible data types
First Claim
Patent Images
1. A method to process a sequence of characters, the method comprising:
- receiving the sequence of characters;
converting, by a lexer embodied in a processor, the sequence of characters into a sequence of tokens, each token comprising a lexeme and a token type,wherein each lexeme comprises one or more of the sequence of characters,wherein each token type describes at least one of a plurality of predetermined classes ascribed to lexemes by the lexer, andwherein the conversion of characters into tokens includes ascribing multiple, different predetermined classes to a single token using one token type, wherein the lexeme for the single token may function as any one of the multiple, different predetermined classes ascribed to the single token; and
analyzing, by a parser embodied in the processor, the sequence of tokens to detect data types within the sequence of characters based upon the token types of the sequence of tokens, including the token type ascribing the multiple predetermined classes to the single token.
1 Assignment
0 Petitions
Accused Products
Abstract
A method of processing a sequence of characters, the method comprising converting the sequence of characters into a sequence of tokens so that each token comprises a lexeme and one of a plurality of token types. Each of the plurality of token types relates to at least one of a plurality of predetermined functions, wherein at least one said token type relates to multiple functions of the plurality of predetermined functions.
-
Citations
16 Claims
-
1. A method to process a sequence of characters, the method comprising:
-
receiving the sequence of characters; converting, by a lexer embodied in a processor, the sequence of characters into a sequence of tokens, each token comprising a lexeme and a token type, wherein each lexeme comprises one or more of the sequence of characters, wherein each token type describes at least one of a plurality of predetermined classes ascribed to lexemes by the lexer, and wherein the conversion of characters into tokens includes ascribing multiple, different predetermined classes to a single token using one token type, wherein the lexeme for the single token may function as any one of the multiple, different predetermined classes ascribed to the single token; and analyzing, by a parser embodied in the processor, the sequence of tokens to detect data types within the sequence of characters based upon the token types of the sequence of tokens, including the token type ascribing the multiple predetermined classes to the single token. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A method of processing data, comprising:
-
receiving, by a parser embodied in the processor, a sequence of tokens, each token converted by a lexer from a sequence of characters and comprising a lexeme and a token type, wherein each lexeme comprises one or more of the sequence of characters, wherein each token type describes at least one of a plurality of predetermined classes ascribed to lexemes by the lexer, and, wherein the conversion of characters into tokens includes ascribing multiple, different predetermined classes to a single token using one token type, wherein the lexeme for the single token may function as any one of the multiple, different predetermined classes ascribed to the single token; and analyzing, by the parser, the sequence of tokens to detect the data types within the sequence of characters based upon the token types of the sequence of tokens, including the token type ascribing the multiple predetermined classes to the single token. - View Dependent Claims (8)
-
-
9. An apparatus to detect predetermined data in a sequence of characters, the apparatus comprising:
-
an interface to receive the sequence of characters; and a processing device coupled to the interface, the processing device implementing a lexer and a parser, the lexer to convert the sequence of characters into a sequence of tokens, each token comprising a lexeme and a token type, wherein each lexeme comprises one or more of the sequence of characters, wherein each token type describes at least one of a plurality of predetermined classes that correspond to data types, wherein the conversion of characters into tokens includes ascribing multiple, different predetermined classes to a single token using one token, wherein the lexeme for the single token may function as any one of the multiple, different predetermined classes ascribed to the single token; and the parser to analyze the sequence of tokens to detect data types within the sequence of characters based upon the token types of the sequence of tokens, including the token type ascribing the multiple predetermined classes to the single token. - View Dependent Claims (10, 11, 12, 13)
-
-
14. A non-transitory machine-readable storage medium storing executable program instructions, which, when executed, cause a data processing system to process a sequence of characters by:
-
receiving the sequence of characters; converting, by a lexer, the sequence of characters into a sequence of tokens, each token comprising a lexeme and a token type, wherein each lexeme comprises one or more of the sequence of characters, wherein each token type describes at least one of a plurality of predetermined classes ascribed to lexemes by the lexer, and wherein the conversion of characters into tokens includes ascribing multiple, different predetermined classes to a single token using one token type, wherein the lexeme for the single token may function as any one of the multiple, different predetermined classes ascribed to the single token; and analyzing, by a parser embodied in the processor, the sequence of tokens to detect data types within the sequence of characters based upon the token types of the sequence of tokens, including the token type ascribing the multiple predetermined classes to the single token.
-
-
15. An apparatus to detect predetermined data in a sequence of characters, the apparatus comprising:
-
an interface to receive the sequence of characters; and a processing device coupled to the interface, the processing device implementing a lexer and a parser, the parser to receive a sequence of tokens, each token converted by the lexer from the sequence of characters and comprising a lexeme and a token type, wherein each lexeme comprises one or more of the sequence of characters, wherein each token type describes at least one of a plurality of predetermined classes ascribed to lexemes by the lexer, and, wherein the conversion of characters into tokens includes ascribing multiple, different predetermined classes to a single token using one token type, wherein the lexeme for the single token may function as any one of the multiple, different predetermined classes ascribed to the single token; and analyze the sequence of tokens to detect data types within the sequence of characters based upon the token types of the sequence of tokens, including the token type ascribing the multiple predetermined classes to the single token.
-
-
16. A non-transitory machine-readable storage medium storing executable program instructions, which, when executed cause a data processing system to detect data in a sequence of characters by:
-
receiving, by a parser, a sequence of tokens, each token converted by a lexer from the sequence of characters and comprising a lexeme and a token type, wherein each lexeme comprises one or more of the sequence of characters, wherein each token type describes at least one of a plurality of predetermined classes ascribed to lexemes by the lexer, and, wherein the conversion of characters into tokens includes ascribing multiple, different predetermined classes to a single token using one token type, wherein the lexeme for the single token may function as any one of the multiple, different predetermined classes ascribed to the single token; and analyzing, by the parser, the sequence of tokens to detect data types within the sequence of characters based upon the token types of the sequence of tokens, including the token type ascribing the multiple predetermined classes to the single token.
-
Specification