System For and Method Of Identifying Closely Matching Textual Identifiers, Such As Domain Names
First Claim
1. A computer-implemented method of identifying a set of textual identifiers comprising:
- maintaining a log of requests to resolve unresolvable textual identifiers;
identifying a set of unique unresolvable textual identifiers in the log;
parsing the log for each identifier within the identifier set and counting a number of unique requests for each textual identifier to populate a first mapping of unique identifiers to their corresponding unique requests;
tokenizing to create a list including tokens for each of the unique identifiers contained within the first mapping;
mapping the tokens to corresponding unique unresolvable textual identifiers to populate a second mapping of tokens to a list of unique unresolvable textual identifiers with their corresponding number of unique requests and aggregated geolocation information; and
sorting the second mapping according to the number of unique requests associated with each of the tokens.
1 Assignment
0 Petitions
Accused Products
Abstract
Methods and systems provide tracking or logging requests to resolve non-existent domain (NXDomains) and organizing the NXDomains to support searching of the domain names including ranking the NXDomains based on popularity, e.g, number of hits or potential traffic based on the number of requests made for the NXDomain. NXDomain logs may be organized so that it supports searching by creating an inverted index including n-grams of the NXDomains. Searching includes identifying a target substring in one or more of the indexes, selecting those matching NXDomains satisfying some threshold criteria, and displaying the NXDomains in a selected order such as by demand or popularity associated with, for example, a selected geographical location from which resolution requests targeting respective NXDomains originate.
-
Citations
22 Claims
-
1. A computer-implemented method of identifying a set of textual identifiers comprising:
-
maintaining a log of requests to resolve unresolvable textual identifiers; identifying a set of unique unresolvable textual identifiers in the log; parsing the log for each identifier within the identifier set and counting a number of unique requests for each textual identifier to populate a first mapping of unique identifiers to their corresponding unique requests; tokenizing to create a list including tokens for each of the unique identifiers contained within the first mapping; mapping the tokens to corresponding unique unresolvable textual identifiers to populate a second mapping of tokens to a list of unique unresolvable textual identifiers with their corresponding number of unique requests and aggregated geolocation information; and sorting the second mapping according to the number of unique requests associated with each of the tokens. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A computer-implemented method of identifying a set of textual identifiers comprising:
-
receiving a keyword; for a date range, selecting corresponding indices of tokens; searching the selected indices for tokens matching the keyword to populate a first mapping of dates within the date range to matching identifiers present in the selected corresponding indices; inverting the first mapping to populate a second mapping of unique identifiers to corresponding unique requests and aggregated geolocation information; returning a display list including the second mapping. - View Dependent Claims (11, 12)
-
-
13. A computer-implemented method of identifying a set of textual identifiers comprising:
-
receiving a keyword; searching selected indices for tokens matching the keyword to populate a first mapping of dates to of matching identifiers present in the selected corresponding indices; inverting the first mapping to populate a second mapping of unique identifiers to corresponding unique requests and aggregated geolocation information; summing a number of unique requests for each identifier and aggregate geolocation information into a third mapping of unique identifiers and their associated number of requests and aggregated geolocation data; sorting the third mapping according to the number of requests for each identifier and aggregated geolocation data; and returning a display list including the sorted third mapping. - View Dependent Claims (14, 15, 16, 17, 19, 20)
-
-
18. A data processing system for identifying a set of textual identifiers, the data processing system comprising a storage device including a storage medium, wherein the storage device stores computer usable program code;
- and a processor, wherein the processor executes the computer usable program code, and wherein the computer usable program code comprises;
computer usable code for maintaining a log of requests to resolve unresolvable textual identifiers; computer usable code for identifying a unique identifier set of unique unresolvable textual identifiers in the log; computer usable code for parsing the log for each identifier within the identifier set and counting a number of unique requests for each textual identifier to populate a first mapping of unique identifiers to their corresponding unique requests; computer usable code for tokenizing to create a list including tokens for each of the unique identifiers contained within the first mapping; computer usable code for mapping the tokens to corresponding unique unresolvable textual identifiers to populate a second mapping of tokens to a list of unique unresolvable textual identifiers with their corresponding number of unique requests and aggregated geolocation information; and computer usable code for sorting the second mapping according to the number of unique requests associated with each of the tokens.
- and a processor, wherein the processor executes the computer usable program code, and wherein the computer usable program code comprises;
-
21. A data processing system for identifying a set of textual identifiers, the data processing system comprising a storage device including a storage medium, wherein the storage device stores computer usable program code;
- and a processor, wherein the processor executes the computer usable program code, and wherein the computer usable program code comprises;
computer usable code for receiving a keyword; for the date range, selecting corresponding indexes of tokens; computer usable code for searching the selected indices for tokens matching the keyword to populate a first mapping of dates within the date range to matching identifiers present in the selected corresponding indices; computer usable code for inverting the first mapping to populate a second mapping of unique identifiers to corresponding unique requests and aggregated geolocation information; and computer usable code for returning a display list including the second mapping. - View Dependent Claims (22)
- and a processor, wherein the processor executes the computer usable program code, and wherein the computer usable program code comprises;
Specification