×

SYSTEM AND METHOD OF STRUCTURING DATA FOR SEARCH USING LATENT SEMANTIC ANALYSIS TECHNIQUES

  • US 20110225159A1
  • Filed: 01/27/2011
  • Published: 09/15/2011
  • Est. Priority Date: 01/27/2010
  • Status: Active Grant
First Claim
Patent Images

1. A computer-based method of organizing data for search, the method comprising the steps of:

  • accessing a domain corpus;

    parsing the domain corpus into a plurality of documents;

    parsing each document into at least one term that corresponds to the document;

    generating a term-to-document matrix that correlates each document with the at least one term that corresponds to the document, the at least one term defining a document node for the document;

    performing a singular value decomposition and a dimension reduction on the term-to-document matrix to form a reformed term-to-document matrix having document nodes with fewer dimensions than the document nodes of the term-to-document matrix;

    comparing at least one document node of the reformed term-to-document matrix against another document node of the reformed term-to-document matrix; and

    combining at least one document node of the term-to-document matrix with another document node of the term-to-document matrix, based on the comparison of the at least one document node of the reformed term-to-document matrix against the another document node of the reformed term-to-document matrix, thereby clustering at least two document nodes of the term-to-document matrix.

View all claims
  • 3 Assignments
Timeline View
Assignment View
    ×
    ×