×

Software web crowler and method therefor

  • US 7,693,872 B2
  • Filed: 08/16/2007
  • Issued: 04/06/2010
  • Est. Priority Date: 08/17/2006
  • Status: Active Grant
First Claim
Patent Images

1. A computerized system comprising:

  • a crawler obtaining multimedia files from a network, the crawler comprising;

    a multi-threaded downloader that downloads web pages;

    a queue storing links corresponding to the download web pages;

    a scheduler obtaining the stored links from the queue and passing the obtained links to the multi-threaded downloader, wherein the multi-threaded downloader downloads multiple multimedia files concurrently from said links;

    a multimedia processor receiving said multimedia files from the crawler and processing said multimedia files by translating speech in the multimedia files into a textual representation,wherein said multimedia processor determines sound effects in said multimedia files by comparing said sound effects in said multimedia files against a predetermined set of sounds, wherein generated metadata is determined by the comparison, and wherein said metadata comprises keywords identifying a type of said sound effects;

    a data mining module that extracts text information from the textual representation; and

    an indexer that indexes the multimedia files based on said keywords and said text information.

View all claims
  • 20 Assignments
Timeline View
Assignment View
    ×
    ×