×

Search engine

  • US 20030187835A1
  • Filed: 03/24/2003
  • Published: 10/02/2003
  • Est. Priority Date: 09/27/2000
  • Status: Active Grant
First Claim
Patent Images

1. A search engine intended for extracting a list of data (12) from a database (3) in response to a request (9, 20, 24, 26), said database (3) having a matrix structure and comprising a plurality of rows Pi(i=1 to N) of recorded data Dij, said rows Pi being called profiles, and a plurality of columns Cj(j=1 to M) called fields, each field Cj defining a predetermined category of data Dij which are associated therewith, each profile Pi comprising an identifier Ei for identifying it uniquely and data each associated with a field Cj of said base, each profile Pi of the database comprising at least one data item Dij associated with at least one of said fields Cj, said search engine containing a first set (J1) of instruction codes which can be read or stored on a support, characterized in that the codes of said first set (J1) are executable by a first computer terminal, the so-called search terminal (TR), said search terminal (TR) comprising storage means (2) for storing said database and being linked to a telecommunication network (R), so as to perform the successive operations comprising the steps consisting in:

  • 1) sending, by way of said telecommunication network (R) and bound for a second computer terminal, the so-called interrogation terminal (TI), an interrogation software module (8) containing a second set (J2) of instruction codes which can be read or stored on a support and are executable by said interrogation terminal (TI) so as to;

    1a) construct a reference profile Pr by associating reference data Drj with at least one field Cj among a collection of fields of said base, a so-called collection of accessible fields, the construction of said reference profile Pr being performed directly or indirectly by a user (U) of said interrogation terminal (TI);

    1b) send to said search terminal (TR) a request comprising said reference profile Pr and designating an interrogation domain, said interrogation domain comprising at least one field Cm of said database (3), each field Cm of said interrogation domain being preselected or having to be selected by a user (U) of said interrogation terminal (TI);

    2) reading said request received by way of said telecommunication network (R);

    3) extracting from said database (3) a sub-base (SB) of profiles Pk to be compared with said reference profile Pr, said sub-base (SB) comprising all or some of the profiles of said database (3);

    4) calculating occurrence counters, a distinct occurrence counter Qn being calculated for each field Cn of a comparison domain (DC), said comparison domain (DC) consisting of all said database (3) fields Cj with which a data item Drj of said reference profile Pr is associated, the calculation of the occurrence counters Qn comprising the steps consisting successively in;

    4a) selecting a field Cn of said comparison domain (DC), 4b) once for each profile Pk of said sub-base (SB), performing a test of correspondence between the reference profile Pr data item Drn associated with said field Cn, the so-called reference data item, and the data item Dkn of the profile Pk of the sub-base (SB) associated with said field Cn, the so-called data item to be compared, the result of said test of correspondence being a boolean number, the result of said test being true whenever said data item to be compared Dkn corresponds through a predetermined logic law with said reference data item Drn, the result of said test being in particular false when said data item to be compared is empty or non-existent, 4c) rendering said occurrence counter Qn equal to the number of said sub-base (SB) profiles Pk for which the result of said test of correspondence performed in step 4b) is true, 4d) repeating steps 4a) to 4c) by selecting another field Cn of the comparison domain (DC) which has not yet been selected in step 4a), if one exists;

    5) calculating partial coefficients, a distinct partial coefficient an being calculated for each field Cn of said comparison domain (DC) whose occurrence counter Qn calculated in step 4c) is nonzero, each partial coefficient an being rendered equal to the value F(Qn) taken by a predetermined real weighting function F(x) of at least one variable x, when said variable x takes the value of said occurrence counter Qn;

    6) calculating weighting coefficients CPk, a distinct weighting coefficient being calculated for each profile Pk of said sub-base (SB), each weighting coefficient CPk calculated being equal to a sum of distinct partial coefficients α

    n calculated in step

         5), said sum pertaining to the partial coefficients α

    n of all those of the fields Cn of the comparison domain (DC) for which the result of the test of correspondence performed in step

         4) between said reference profile Pr and the profile Pk to which said weighting coefficient CPk refers is true, and pertaining only to these fields;

    7) calculating weighted frequencies FPqm, a weighted frequency FPqm being calculated for each data item Dim associated with a field Cm of the interrogation domain in said sub-base (SB) of profiles, the calculation of the weighted frequencies FPqm comprising the steps consisting, for each field Cm of the interrogation domain, in 7a) selecting a data item Dqm associated with said field Cm, said selected data item arising from a profile Pq among the collection of profiles Pk of said sub-base (SB) which comprise a nonempty data item Dkm associated with said field Cm, 7b) assigning said selected data item Dqm a weighted frequency FPqm equal to the sum of the weighting coefficients CPk of the profiles Pk of said collection of profiles whose data item Dkm associated with said field Cm is substantially identical to said selected data item Dqm, 7c) repeating steps 7a) and 7b) by selecting another data item which has not yet been selected in step 7a), if one exists;

    8) generating a response (12) to said request, said response comprising a list of data Dqm arising from said sub-base (SB), the so-called response data, said response data Dqm being those which possess the highest weighted frequencies FPqm among the data associated with the fields Cm of the interrogation domain, the maximum number NR of response data Dqm associated with each field Cm of the interrogation domain being predetermined and greater than or equal to one;

    9) sending said response to a response recipient by way of the telecommunication network (R).

View all claims
  • 5 Assignments
Timeline View
Assignment View
    ×
    ×