Method for extracting information utilizing a user-context-based search engine
First Claim
1. A method for extracting information from the internet, the method programmed in a computer readable-medium to be executed by a processor operably connected thereto, the method comprising:
- mining to gather and organize information from the Internet to form a database having a hierarchical schema;
acquiring from a user a textual query comprising a collection of words each having multiple meanings depending on the context of use, at least one of which meanings is descriptive of information sought by the user;
deriving a micro-context comprising a plurality of words corresponding to the textual query, each word of the plurality of words assigned a relative weighting derived from the patterns of occurrence thereof in web pages accessed by the user during the user'"'"'s navigation through the Internet;
operating independently from the hierarchical schema to locate a subset of the information in the database, the subset corresponding to the micro-context; and
presenting the subset to the user.
4 Assignments
0 Petitions
Accused Products
Abstract
A data extraction tool is provided for cataloging information in an information source for searching by a user. The tool mines information from the information source and organizes the information, or the locations of that information, within a database. A user may then query the tool for a desired type of information. The tool filters the database to provide a set of pinpoint site locations with information of the type requested in the query. These pinpoint site locations are presented to a user and indexed for future reference. The index of site locations may be updated automatically by the tool. A context system is provided for manually or automatically determining the proper context for a user'"'"'s query. Thus, the data extraction tool provides information with a high probability of relevance to the user. The user obtains the information without expending much effort to refine the search.
-
Citations
4 Claims
-
1. A method for extracting information from the internet, the method programmed in a computer readable-medium to be executed by a processor operably connected thereto, the method comprising:
-
mining to gather and organize information from the Internet to form a database having a hierarchical schema; acquiring from a user a textual query comprising a collection of words each having multiple meanings depending on the context of use, at least one of which meanings is descriptive of information sought by the user; deriving a micro-context comprising a plurality of words corresponding to the textual query, each word of the plurality of words assigned a relative weighting derived from the patterns of occurrence thereof in web pages accessed by the user during the user'"'"'s navigation through the Internet; operating independently from the hierarchical schema to locate a subset of the information in the database, the subset corresponding to the micro-context; and presenting the subset to the user. - View Dependent Claims (2, 3, 4)
-
Specification