Method and system for filtering text messages
First Claim
1. A method comprising the steps of:
- receiving a text message comprising content entered by a user, the content having abbreviations or misspellings within a sentence or sentence fragment, the abbreviations or misspellings comprising text having at least one character;
generating a plurality of tokens through manipulation of the text, wherein each of the plurality of tokens is related to the content of the message;
comparing the plurality of tokens with data contained in at least one database to generate a plurality of probabilities related to the suitability of publicly displaying the message, the data comprising at least one trademark and statistics derived, at least in part, from historical message data comprising text abbreviations or misspellings;
weighing the probabilities to generate an overall probability related to the suitability of publicly displaying the message;
comparing the overall probability to a threshold value to derive a rating of the suitability of publicly displaying the message; and
displaying the rating associated with the message.
2 Assignments
0 Petitions
Accused Products
Abstract
A system and method is provided for filtering text messages to determine which of the messages, if any, contains undesirable or unsuitable content. The method includes receiving a message containing at least one text character. The text is manipulated to generate a plurality of tokens that are compared with data contained in a plurality of databases. As a result, a plurality of probabilities are generated related to the suitability of the message. The probabilities are weighted to generate an overall probability related to the suitability of the message. The overall probability is then compared to a threshold value to rate the suitability of the message. Further, the rating associated with the message is displayed on a computer display screen or the like.
15 Citations
54 Claims
-
1. A method comprising the steps of:
-
receiving a text message comprising content entered by a user, the content having abbreviations or misspellings within a sentence or sentence fragment, the abbreviations or misspellings comprising text having at least one character; generating a plurality of tokens through manipulation of the text, wherein each of the plurality of tokens is related to the content of the message; comparing the plurality of tokens with data contained in at least one database to generate a plurality of probabilities related to the suitability of publicly displaying the message, the data comprising at least one trademark and statistics derived, at least in part, from historical message data comprising text abbreviations or misspellings; weighing the probabilities to generate an overall probability related to the suitability of publicly displaying the message; comparing the overall probability to a threshold value to derive a rating of the suitability of publicly displaying the message; and displaying the rating associated with the message. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A method comprising the steps of:
-
manipulating a text message comprising content entered by a user, the content having abbreviations or misspellings within a sentence or sentence fragment, the text message having less than 161 characters; generating a plurality of tokens, each of the plurality of tokens being related to the content of the message; generating at least two probabilities related to the suitability of publicly displaying the message based, at least in part, on a comparison between the plurality of tokens and data contained in at least one database, the data comprising at least one trademark and statistics derived, at least in part, from historical message data comprising text abbreviations and misspellings; generating an overall probability related to the suitability of publicly displaying the message based, at least in part, on a weighing of the at least two probabilities; and displaying the message if the overall probability is within a defined range. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20, 21, 22)
-
-
23. A method comprising the steps of:
-
manipulating text within a text message comprising content entered by a user, the content having abbreviations or misspellings within a sentence or sentence fragment, the text message having 160 characters or less; generating a plurality of tokens, each of the plurality of tokens being related to the content of the message; comparing the plurality of tokens with data contained in at least one database to generate a plurality of probabilities related to the suitability of publicly displaying the message, the data comprising at least one trademark and statistics derived, at least in part, from historical message data comprising text abbreviations or misspellings; comparing historical data based on a selected tokenization strategy to generate a plurality of weights; generating with the plurality of probabilities and the plurality of weights an overall probability related to the suitability of publicly displaying the message; and comparing the overall probability to a threshold to rate the suitability of publicly displaying the message. - View Dependent Claims (24, 25, 26, 27, 28, 29, 30, 31, 32)
-
-
33. A method comprising the steps of:
-
selecting a tokenization strategy from a plurality of generated tokens comprising words, phrases, n-grams, numbers and characters; manipulating text within a text message comprising content entered by a user, the content having abbreviations or misspellings within a sentence or sentence fragment, to generate data for each of the plurality of tokens selected wherein the data is related to the content of the message; comparing the token data with data contained in at least one database to generate a plurality of probabilities related to the suitability of publicly displaying the message, the data comprising at least one trademark and statistics derived, at least in part, from historical message data comprising text abbreviations or misspellings; weighing the probabilities to generate an overall probability related to the suitability of publicly displaying the message; comparing the overall probability to a threshold to rate the suitability of publicly displaying the message; and displaying the rating associated with the message. - View Dependent Claims (34, 35, 36, 37, 38, 39, 40, 41, 42, 43)
-
-
44. A method comprising the steps of:
-
receiving a text message comprising content entered by a user, the content having abbreviations or misspellings within a sentence or sentence fragment, the abbreviations or misspellings comprising text having at least one character; generating a plurality of tokens, each of the plurality of tokens being related to the content of the message; comparing the plurality of tokens with data contained in at least one database to generate a plurality of probabilities related to the suitability of publicly displaying the message, the data comprising at least one trademark and statistics derived, at least in part, from historical message data comprising text abbreviations or misspellings; weighing the probabilities to generate an overall probability related to the suitability of publicly displaying the message; comparing the overall probability to a threshold to rate the suitability of the message; and displaying the rating associated with the message. - View Dependent Claims (45, 46, 47, 48, 49, 50, 51, 52, 53, 54)
-
Specification