×

THEMATIC WEB CORPUS

  • US 20170140055A1
  • Filed: 11/17/2016
  • Published: 05/18/2017
  • Est. Priority Date: 11/17/2015
  • Status: Active Grant
First Claim
Patent Images

1. Computer-implemented method for building a Web corpus that relates to a theme, the method comprising, by a server storing an index of a search engine, sending, to a client, the URLs of pages of a Web corpus that relates to the theme, including:

  • receiving, from the client, a structured query that corresponds to the theme, the structured query consisting of a disjunction of at least one keyword;

    determining in the index the group that consists of the URLs of all pages that match the query, wherein the determining consists in;

    reading the keywords of the disjunction of the query on the index, thereby retrieving at least one set of URLs from the index, thenperforming on the retrieved at least one set of URLs a scheme of set operations that corresponds to the disjunction of the query, therebyleading to the group of URLs; and

    sending to the client the URLs of the group as a stream.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×