APPARATUS, METHOD AND ARTICLE TO INTERACT WITH SOURCE FILES IN NETWORKED ENVIRONMENT
First Claim
Patent Images
1. A computer-implemented method of interacting with data source files stored on computer-readable media and identified by logical addresses, the method comprising:
- computationally identifying a number of keys from a content of a data source file to be considered in determining a subject of the data source file based at least in part on at least one of;
a frequency of appearance of the key in the content of the data source file, a component type associated with the key in the content of the data source file, an indication indicative of at least a presence of a combination of alpha characters and non-alpha characters in the key, a formatting characteristic assigned to the key in the content of the data source file, a tuple size associated with the key, or a lack of inclusion of the key in a dictionary;
computationally identifying a number of keys from the content of the data source file to be ignored in determining the subject of the data source file based at least in part on at least one of an inclusion of the key in set of core language words, an inclusion of the key in a domain specification portion of a uniform resource locator, an association of the key in the content of the data source file with a first subset of component types and not with a second subset of component types, or a formatting characteristic assigned to the key in the content of the data source file;
computationally applying weights to at least some of the keys that are computationally identified to be considered and not computationally identified to be ignored; and
computationally determining the subject of the data source file based at least in part on a result of computationally applying the weights.
1 Assignment
0 Petitions
Accused Products
Abstract
A subject of a file'"'"'s content which is stored on a physical storage medium at a logical address in a networked system may be determined and used to identify the file to facilitate retrieval and/or the provision of information about the file, for example via forums or messages.
-
Citations
42 Claims
-
1. A computer-implemented method of interacting with data source files stored on computer-readable media and identified by logical addresses, the method comprising:
-
computationally identifying a number of keys from a content of a data source file to be considered in determining a subject of the data source file based at least in part on at least one of;
a frequency of appearance of the key in the content of the data source file, a component type associated with the key in the content of the data source file, an indication indicative of at least a presence of a combination of alpha characters and non-alpha characters in the key, a formatting characteristic assigned to the key in the content of the data source file, a tuple size associated with the key, or a lack of inclusion of the key in a dictionary;computationally identifying a number of keys from the content of the data source file to be ignored in determining the subject of the data source file based at least in part on at least one of an inclusion of the key in set of core language words, an inclusion of the key in a domain specification portion of a uniform resource locator, an association of the key in the content of the data source file with a first subset of component types and not with a second subset of component types, or a formatting characteristic assigned to the key in the content of the data source file; computationally applying weights to at least some of the keys that are computationally identified to be considered and not computationally identified to be ignored; and computationally determining the subject of the data source file based at least in part on a result of computationally applying the weights. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A networked computing system, comprising:
-
at least one networked computer, including at least one processor and at least one processor-readable storage medium that stores instructions that when executed by the at least one processor causes the at least one processor to associate a number of logical network addresses of a plurality of data source files with a number of logical network addresses of a number of forums based on subject, where some of the data source files are identified by multiple logical network addresses and where a content of some of the data source files identified by a single logical network address differs based on an active content component of the source data file, by; for each of a number of data source files, computationally identifying a number of keys from the content of the data source file to be considered in determining a subject of the data source file based on a raw content of the data source file, a formatting of the content of the data source file and an organizational aspect of a presentation of the content of the data source file; computationally identifying a number of keys from the content of the data source file to be ignored in determining the subject of the data source file based on a raw content of the data source file, a formatting of the content of the data source file and an organizational aspect of a presentation of the data source file; computationally applying weights to at least some of the keys that are computationally identified to be considered and not computationally identified to be ignored; and computationally determining the subject of the data source file based at least in part on a result of computationally applying the weights. - View Dependent Claims (21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31)
-
-
32. At least one computer-readable medium that stores instructions that when executed by at least one computer system cause the at least one computer system to associate a number of logical network addresses of a plurality of data source files with a number of logical network addresses of a number of forums based on subject matter, where some of the data source files are identified by multiple logical network addresses and where a content of some of the data source files identified by a single logical network address differs based on an active content component of the source data file, by:
-
for each of a number of data source files, computationally identifying a number of keys from the content of the data source file to be considered in determining a subject of the data source file based on a raw content of the data source file, a formatting of the content of the data source file and an organizational aspect of a presentation of the content of the data source file; computationally identifying a number of keys from the content of the data source file to be ignored in determining the subject of the data source file based on a raw content of the data source file, a formatting of the content of the data source file and an organizational aspect of a presentation of the data source file; computationally applying weights to at least some of the keys that are computationally identified to be considered and not computationally identified to be ignored; and computationally determining the subject of the data source file based at least in part on a result of computationally applying the weights. - View Dependent Claims (33, 34, 35, 36, 37, 38, 39, 40, 41, 42)
-
Specification