Method and system for providing a user agent string database
First Claim
1. A method, implemented on at least one computing device each of which has at least one processor, storage, and a communication platform connected to a network for determining a keyword from user agent strings, the method comprising:
- receiving a plurality of user agent strings;
grouping the plurality of user agent strings into one or more clusters, wherein the one or more clusters comprise a first cluster that includes two or more user agent strings;
comparing the two or more user agent strings in the first cluster;
extracting a longest common subsequence among the two or more user agent strings;
removing the longest common subsequence from each user agent string to obtain a remaining subsequence; and
determining a keyword from the first cluster based on at least one of the longest common subsequence and the remaining subsequence, wherein the keyword represents a type of user agent information.
6 Assignments
0 Petitions
Accused Products
Abstract
Method, system, and programs for determining a keyword from user agent strings are disclosed. In one example, a plurality of user agent strings is received. The plurality of user agent strings is grouped into one or more clusters. The one or more clusters comprise a first cluster that includes two or more user agent strings. The two or more user agent strings in the first cluster are compared. Based on the comparing, a keyword is determined from the first cluster. The keyword represents a type of user agent information.
-
Citations
17 Claims
-
1. A method, implemented on at least one computing device each of which has at least one processor, storage, and a communication platform connected to a network for determining a keyword from user agent strings, the method comprising:
-
receiving a plurality of user agent strings; grouping the plurality of user agent strings into one or more clusters, wherein the one or more clusters comprise a first cluster that includes two or more user agent strings; comparing the two or more user agent strings in the first cluster; extracting a longest common subsequence among the two or more user agent strings; removing the longest common subsequence from each user agent string to obtain a remaining subsequence; and determining a keyword from the first cluster based on at least one of the longest common subsequence and the remaining subsequence, wherein the keyword represents a type of user agent information. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A system having at least one processor storage, and a communication platform for determining a keyword from user agent strings, the system comprising:
-
a user agent receiver configured for receiving a plurality of user agent strings; a user agent clustering unit configured for grouping the plurality of user agent strings into one or more clusters, wherein the one or more clusters comprise a first cluster that includes two or more user agent strings; a user agent comparing unit configured for comparing the two or more user agent strings in the first cluster; a subsequence extractor configured for extracting a longest common subsequence among the two or more user agent strings; a subsequence removing unit configured for removing the longest common subsequence from each user agent string to obtain a remaining subsequence; and a keyword determiner configured for determining a keyword from the first cluster based on at least one of the longest common subsequence and the remaining subsequence, wherein the keyword represents a type of user agent information. - View Dependent Claims (9, 10, 11, 12, 13)
-
-
14. A non-transitory machine-readable medium having information recorded thereon for determining a keyword from user agent strings, wherein the information, when read by the machine, causes the machine to perform the following:
-
receiving a plurality of user agent strings; grouping the plurality of user agent strings into one or more clusters, wherein the one or more clusters comprise a first cluster that includes two or more user agent strings; comparing the two or more user agent strings in the first cluster; extracting a longest common subsequence among the two or more user agent strings; removing the longest common subsequence from each user agent string to obtain a remaining subsequence; and determining a keyword from the first cluster based on at least one of the longest common subsequence and the remaining subsequence, wherein the keyword represents a type of user agent information. - View Dependent Claims (15, 16, 17)
-
Specification