System and method for integrating and displaying travel advices gathered from a plurality of reliable sources
First Claim
1. A computerized travel system for providing advices and warnings per travel destination to end-users of the travel system, comprising:
- a client service module for receiving a query from an end user and delivering a merged report of travel advices and warnings for a queried travel destination;
a download and parsing module for parsing individual travel advice documents downloaded from a plurality of online reliable travel sources into cleaned documents sharing a common formatting structure including sections, sections headers and paragraphs and wherein, for each individual travel advice, tags are defined and text portions are extracted between the tags;
an algorithmic module for integrating the cleaned documents into a merged report of travel advices and warnings per travel destination wherein the integration of the cleaned documents into the merged report performed by;
i) choosing, among all the clean documents, a base document from which integration is performed;
ii) creating sections in the merged report using the section headers of the base document as section headers of the merged report;
iii) for each section of the base document extracting paragraphs from the base document and inserting said paragraphs extracted from the base document in the merged report under the section header corresponding to the section header of the section of the base document;
iv) comparing section headers of the cleaned documents other than the base document with the section headers of the base document to determine similarity values;
v) defining a similarity value threshold and defining pairs of comparable sections each made of a section of a cleaned document other than the base document and of a section of the base document and for which the similarity value is above the similarity value threshold;
vi) for each pair of comparable sections;
comparing the paragraphs of the section of the cleaned document other than the base document with the paragraphs of the section of the base document to determine similarity scores;
inserting the paragraphs of the section of the cleaned document other than the base document in the merged report, next to the paragraphs of the section of the base document returning the best similarity score;
a relational database for storing and retrieving all travel information and controls to operate the travel system including the cleaned documents and the merged report;
an administrative module to manage and control the building and delivery of the merged report to the end-user of the travel system.
1 Assignment
0 Petitions
Accused Products
Abstract
A computerized travel system and a method for providing advices and warnings per travel destination to end-users. The system has a client service module for receiving queries from the end-users, and delivering to them merged reports of travel advices and warnings for the queried travel destinations. A download and parsing module parses travel raw documents, downloaded from a plurality of online reliable travel sources, into cleaned raw documents sharing a common formatting structure including sections, sections headers and paragraphs. An algorithmic module integrates the cleaned raw documents into the merged reports of travel advices and warnings per travel destination. The integration is performed from a base document chosen among all the relevant clean raw documents for each queried travel destination. Comparable sections are determined on the basis of contents of their section headers and semantically close paragraphs are placed next to each other in corresponding sections of merged reports.
17 Citations
20 Claims
-
1. A computerized travel system for providing advices and warnings per travel destination to end-users of the travel system, comprising:
-
a client service module for receiving a query from an end user and delivering a merged report of travel advices and warnings for a queried travel destination; a download and parsing module for parsing individual travel advice documents downloaded from a plurality of online reliable travel sources into cleaned documents sharing a common formatting structure including sections, sections headers and paragraphs and wherein, for each individual travel advice, tags are defined and text portions are extracted between the tags; an algorithmic module for integrating the cleaned documents into a merged report of travel advices and warnings per travel destination wherein the integration of the cleaned documents into the merged report performed by; i) choosing, among all the clean documents, a base document from which integration is performed; ii) creating sections in the merged report using the section headers of the base document as section headers of the merged report; iii) for each section of the base document extracting paragraphs from the base document and inserting said paragraphs extracted from the base document in the merged report under the section header corresponding to the section header of the section of the base document; iv) comparing section headers of the cleaned documents other than the base document with the section headers of the base document to determine similarity values; v) defining a similarity value threshold and defining pairs of comparable sections each made of a section of a cleaned document other than the base document and of a section of the base document and for which the similarity value is above the similarity value threshold; vi) for each pair of comparable sections; comparing the paragraphs of the section of the cleaned document other than the base document with the paragraphs of the section of the base document to determine similarity scores; inserting the paragraphs of the section of the cleaned document other than the base document in the merged report, next to the paragraphs of the section of the base document returning the best similarity score; a relational database for storing and retrieving all travel information and controls to operate the travel system including the cleaned documents and the merged report; an administrative module to manage and control the building and delivery of the merged report to the end-user of the travel system. - View Dependent Claims (2, 3, 4, 5, 6, 7, 14, 15, 16, 19)
-
-
8. A method in a travel system of integrating advices and warnings from a plurality of individual travel advice documents, comprising:
-
receiving, via a computer, a query for advices and warnings for a travel destination from and end-user; getting, via a computer, individual travel advice documents including text portions from a database of the travel system; providing a common formatting structure including sections, sections headers and paragraphs; parsing, via a computer, the individual travel advice documents, to obtain cleaned documents sharing the common formatting structure, said parsing step comprising for each individual travel advice document, the definition of tags and the extraction of text portions between the tags; integrating, via a computer, the cleaned documents into a merged report of travel advices and warnings for the travel destination, the integrating step including the further step of; i) choosing, among all the cleaned documents, a base document from which integration is performed; ii) creating sections in the merged report using the section headers of the base document as section headers of the merged report; iii) for each section of the base document, extracting paragraphs from the base document and inserting said paragraphs extracted from the base document in the merged report under the section header corresponding to the section header of the section of the base document; iv) comparing section headers of the cleaned documents other than the base document with the section headers of the base document to determine similarity values; v) defining a similarity value threshold and defining pairs of comparable sections each made of a section of a cleaned document other than the base document and of a section of the base document and for which the similarity value is above the similarity value threshold; vi) for each pair of comparable sections; comparing the paragraphs of the section of the cleaned document other than the base document with the paragraphs of the section of the base document to determine similarity scores; inserting the paragraphs of the section of the cleaned document other than the base document in the merged report, next to the paragraphs of the section of the base document returning the best similarity score; delivering, via a computer, the merged report of travel advices and warnings for the travel destination to the end-user of the travel system. - View Dependent Claims (9, 10, 11, 12, 13, 17, 18, 20)
-
Specification