Search system for providing fulltext search over web pages of world wide web servers
First Claim
1. A search system for providing fulltext search of web pages of world wide web servers connected to an internet, the search system comprising:
- an internet server connected to the internet;
a plurality of data groups stored in the server, each of the data groups comprising data from web pages of one world wide web server connected to the internet; and
a management program stored in the server for managing operations of the server and providing users with the fulltext search service over the data groups, the management program comprising a data group creating module for creating the data group of each of the world wide web servers for fulltext search;
wherein each of the data groups in the server comprises;
a text file for recording the text data contained in each of the web pages of the corresponding world wide web server;
a path file for recording path data of each of the web pages contained in the text file of the same data group; and
an index file for providing fulltext search for text data contained in the text file of the same data group;
wherein according to at least one user specified search parameter, the management program uses the index file of each data group to search the text file of the same data group to find web pages of the corresponding world wide web server which fit the specified search parameter, uses the text file of the same data group to retrieve text data of each web page which fits the search parameter, uses the path file of the same data group to find the path data of each of the web pages of the corresponding world wide web server which fit the specified search parameter, and then outputs the result in a predetermined format, and when creating one data group for a world wide web server, the data group creating module first connects to the world wide web server through the internet, retrieves text and path data stored in the web pages of the world wide web server, creates one text file and one path file using the retrieved data, and then creates one index file using the text file for fulltext search of the text data contained in the text file, and after retrieving the text data and path data contained in each of the web pages, the management program abandons all the other data to save memory space.
2 Assignments
0 Petitions
Accused Products
Abstract
The present invention provides a search system for providing fulltext search over web pages of world wide web servers which can save memory by storing only text, path and hyperlink data of a web page and excluding extraneous data. The system comprises a server connected to an internet, a plurality of data groups with web page data, and a management program. One user can input search parameter such as keywords into the search system over which the management program uses the search parameters to find matching web pages using an index file within the data groups, generates path data for the matched web pages and outputs the path and text data in a standard http format. The search system retrieve only text and path data of each web page and leaves out extraneous data so that the memory space of the server can be saved.
-
Citations
12 Claims
-
1. A search system for providing fulltext search of web pages of world wide web servers connected to an internet, the search system comprising:
-
an internet server connected to the internet;
a plurality of data groups stored in the server, each of the data groups comprising data from web pages of one world wide web server connected to the internet; and
a management program stored in the server for managing operations of the server and providing users with the fulltext search service over the data groups, the management program comprising a data group creating module for creating the data group of each of the world wide web servers for fulltext search;
wherein each of the data groups in the server comprises;
a text file for recording the text data contained in each of the web pages of the corresponding world wide web server;
a path file for recording path data of each of the web pages contained in the text file of the same data group; and
an index file for providing fulltext search for text data contained in the text file of the same data group;
wherein according to at least one user specified search parameter, the management program uses the index file of each data group to search the text file of the same data group to find web pages of the corresponding world wide web server which fit the specified search parameter, uses the text file of the same data group to retrieve text data of each web page which fits the search parameter, uses the path file of the same data group to find the path data of each of the web pages of the corresponding world wide web server which fit the specified search parameter, and then outputs the result in a predetermined format, and when creating one data group for a world wide web server, the data group creating module first connects to the world wide web server through the internet, retrieves text and path data stored in the web pages of the world wide web server, creates one text file and one path file using the retrieved data, and then creates one index file using the text file for fulltext search of the text data contained in the text file, and after retrieving the text data and path data contained in each of the web pages, the management program abandons all the other data to save memory space. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A method for creating a data group for a world wide web server connected to an internet in a full text search system, the search system comprising:
-
an internet server connected to the internet for storing the data group of the world wide web server; and
a management program stored in the server for managing operations of the server and creating the data group of the world wide web server;
the data group of the world wide web server comprising;
a text file for recording the text data contained in web pages of the world wide web server;
a path file for recording path data of each of the web pages in the text file of the data group; and
an index file for providing fulltext search for the text data contained in the text file of the data group;
the method of creating the data group comprising;
connecting the server with the world wide web server through the internet;
retrieving text data from each of the web pages of the world wide web server to create the text file;
retrieving path data from each of the web pages of the world wide web server to create the path file;
using text data contained in each of the web pages of the world wide web server to create the index file for providing fulltext search over the text data of the web pages in the world wide web server; and
after retrieving the text data and path data contained in each of the web pages, the management program abandoning all the other data to save memory space. - View Dependent Claims (7)
-
-
8. A search system for providing full text search of web pages of world wide web servers connected to an internet, the search system comprising:
-
an internet server connected to the internet;
a text file for recording the text data contained in each of the web pages;
a path file for recording path data of each of the web pages contained in the text file;
an index file for providing fulltext search for the text data contained in the text file; and
a management program stored on the server for managing operations of the server and using the path file and the index file to provide users with the fulltext search service, the management program comprising a data creation module for creating the path file, the index file and the text file for fulltext search;
wherein according to at least one user specified search parameter, the management program uses the index file of each data group to search the text file to find web pages of a world wide web server which fit the specified search parameter, uses the path file of each data group to find the path data of each of the web pages of the corresponding world wide web server which fit the specified search parameter, and then outputs the result in a predetermined format, and when creating data for a world wide web server, the data creation module first connects to the world wide web server through the internet, retrieves text and path data stored in the web pages of the world wide web server, creates one text file and one path file using the retrieved data, and then creates one index file using the text file for fulltext search of the text data contained in the text file, and after retrieving the text data and path data contained in each of the web pages, the management program abandons all the other data to save memory space. - View Dependent Claims (9, 10, 11, 12)
-
Specification