MULTILEVEL INTENT ANALYSIS METHOD FOR EMAIL FILTRATION
First Claim
Patent Images
1. A method, tangibly embodied as a program product encoded on computer readable media, for analyzing a document to determine if the document fits in a category, the method comprising the steps of:
- extracting at least one uniform resource identifiers (uri) from a first document;
following a uri from a first document to a second document or a redirect;
extracting a uri from the redirect or the second document;
matching said extracted uri with a member of a database; and
determining said first document fits in the category associated with the uri found in said database.
2 Assignments
0 Petitions
Accused Products
Abstract
A method for filtering email which contains links to uniform resource identifiers which disguise the content and identity of spam sites by multiple serial redirection.
32 Citations
32 Claims
-
1. A method, tangibly embodied as a program product encoded on computer readable media, for analyzing a document to determine if the document fits in a category, the method comprising the steps of:
-
extracting at least one uniform resource identifiers (uri) from a first document; following a uri from a first document to a second document or a redirect; extracting a uri from the redirect or the second document; matching said extracted uri with a member of a database; and determining said first document fits in the category associated with the uri found in said database. - View Dependent Claims (2, 3, 4)
-
-
5. A method, tangibly embodied as a program product encoded on computer readable media, for analyzing a document to determine if the document fits in a category, the method comprising the steps of
extracting at least one link from a document; -
following a link from the document to a second document or redirect; extracting a text string from a redirect on the second document; matching a text string extracted from a redirect with a member of a database; and determining said first document fits in the category associated with the text string found in the database.
-
-
6. A method for analyzing a document comprising the steps of
selecting links, following links, matching links in a database, and operating on a document wherein operating on a document comprises the process of causing the document to be blocked, deleted, diverted to a spam mailbox, marked with warning messages, tagged with a string, sterilized, quarantined, or modified, depending on the grade value; - or notifying user of category.
- View Dependent Claims (7, 8, 9, 10)
-
11. A method for matching links comprising the steps of:
extracting a domain name from a uri received with a redirection instruction and matching the domain name with one of a first category of websites in a database. - View Dependent Claims (12, 13, 14, 15, 16)
-
17. A method for email client multilevel content filtering of electronic documents, comprising the following processes:
-
analyzing at least one electronic document to extract at least one embedded uniform resource identifier (uri); extracting a website from the uri; operating on the electronic document if at least one website embedded in the document matches with a database. - View Dependent Claims (18)
-
-
19. A method comprising the steps following:
-
scanning an electronic document for at least one embedded uniform resource identifier; and querying a database of categorized uniform resource identifiers to determine if the embedded uniform resource identifier matches. - View Dependent Claims (20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30)
-
-
31. An article of manufacture comprising computer readable media on which is encoded instructions adapted to control a processor in matching a pattern expression of categorized uniform resource identifiers.
-
32. A computing system for multilevel domain redirection analysis comprising a processor adapted to perform the methods following coupled to a storage in which is tangibly encoded computer readable instructions which adapt the processor to access a database of categorized websites and analyze an electronic document to extract an embedded uniform resource identifier;
-
extract a website from the uri; match the website with a database of categorized websites; fetch data at the uri location; if there is redirection, extract another website, if there is a match, take action on the electronic document, and exhausting all the uri'"'"'s embedded in a document.
-
Specification