Internet and computer information retrieval and mining with intelligent conceptual filtering, visualization and automation
First Claim
1. A method for searching information comprising obtaining one or more information elements extracted from a first set of one or more files or parts thereof;
- ranking the one or more information elements based on one or more of the following ranking parameters;
a function of a link-based popularity rankings of the files from which an information element is extracted;
a function of a relevancy rankings of the files from which an information element is extracted;
a function of a date-based rankings of the files from which an information element is extracted;
ranking an information element higher if it can be extracted from more number of files, ranking an information element higher if it can be extracted from less number of files;
format of an information element;
relation of one or more information elements relative to one or more information elements in a second set of information elements;
location or roles of one or more information elements in the text;
context in which one or more information elements appear; and
the semantics of one or more information elements.
0 Assignments
0 Petitions
Accused Products
Abstract
The present invention presents embodiments of methods, systems, and computer-readable media for the retrieval, mining, filtering and visualization of information stored on a plural of computers connected to the Internet and on a local computer. Embodiments of this invention generate a conceptual search query using a description provided by a user, perform user selectable conceptual filtering of search results, concept following and link following to expand search results, search for files that may or may not contain certain information, rank concepts contained in search results or one or more files, compute relevancy rank of a file in search results, use conceptual path maps to display logic or statistical relationships among search results, monitor changes in information in a search or a file, and protect files or searches based on information contents.
-
Citations
20 Claims
-
1. A method for searching information comprising
obtaining one or more information elements extracted from a first set of one or more files or parts thereof; ranking the one or more information elements based on one or more of the following ranking parameters;
a function of a link-based popularity rankings of the files from which an information element is extracted;
a function of a relevancy rankings of the files from which an information element is extracted;
a function of a date-based rankings of the files from which an information element is extracted;
ranking an information element higher if it can be extracted from more number of files, ranking an information element higher if it can be extracted from less number of files;
format of an information element;
relation of one or more information elements relative to one or more information elements in a second set of information elements;
location or roles of one or more information elements in the text;
context in which one or more information elements appear; and
the semantics of one or more information elements.
- 2. The method of claim 2, wherein the first set is the results of a first search that is defined by one or more descriptions of the first search.
-
5. A method for displaying or organizing files into a structure comprising
organizing two or more files into two or more sets along a first dimension where the set membership is based on one or more information elements about or contained in the files, connecting two sets along the first dimension if there exists a first relationship between the two sets; -
organizing two or more files into two or more sets along a second dimension where the set membership is based on one or more information elements about or contained in the files; and
,connecting two sets along the second dimension if there exists a second relationship between the two sets. - View Dependent Claims (6, 7, 8, 9)
-
-
10. A method to compute a rank of a file in the results of a search comprising
identifying in the file one or more matching elements that are considered identical, equivalent or similar to part or all the description that defines the search as entered by a user; -
computing a relevancy ranking factor based on one or more of the following in the file;
the degree of identicalness, equivalence or similarity of the one or more matching elements to their counterparts in the description that defines the search;
the order of appearance of two or more matching elements compared with the order of appearance of their counterparts in the description that defines the search;
the relative position of two or more matching elements in a sentence or text structure;
the presence or absence of punctuation marks or other symbols between two or more matching elements;
the format in which one or more matching elements appear;
the role of one or more matching elements in the file;
the location or part of the file in which one or more matching elements appear; and
,the presence or absence of information that is similar to information that is specific to a user and the degree of the similarity. - View Dependent Claims (11)
-
-
12. A method for information monitoring comprising
providing an option in a browsing application window for monitoring changes in the content of a URL or in the results of a search that is being accessed in the window; -
when a user selects the option, checking for changes in the content of the URL or in the results of the search over a period of time; and
,alerting the user of the change if a change is detected. - View Dependent Claims (13, 14, 15, 16)
-
-
17. A method to protect information comprising
maintaining a first set of one or more characteristics or information elements of one or more files or parts thereof or descriptions of contents that are to be protected; requiring a user to pass one or more security measures before allowing the user access to a second set of one or more files or parts thereof that match or contain some or all the information in the first set. - View Dependent Claims (18, 19, 20)
Specification