×

System and method of structuring data for search using latent semantic analysis techniques

  • US 9,183,288 B2
  • Filed: 01/27/2011
  • Issued: 11/10/2015
  • Est. Priority Date: 01/27/2010
  • Status: Active Grant
First Claim
Patent Images

1. A computer-based method of organizing data for search, the method comprising the steps of:

  • accessing a domain corpus;

    parsing the domain corpus into a plurality of documents;

    parsing each document into at least one term that corresponds to the document;

    generating a term-to-document matrix that correlates each document with the at least one term that corresponds to the document, the at least one term defining a document node for the document;

    performing a singular value decomposition and a dimension reduction on the term-to-document matrix to form a reformed term-to-document matrix having document nodes with fewer dimensions than the document nodes of the term-to-document matrix;

    comparing at least one document node of the reformed term-to-document matrix against another document node of the reformed term-to-document matrix; and

    combining at least one document node of the term-to-document matrix with another document node of the term-to-document matrix, based on the comparison of the at least one document node of the reformed tem-to-document matrix against the another document node of the reformed term-to-document matrix, to form a combined document node representing the combination of the at least one document node of the term-to-document matrix with the another document node of the term-to-document matrix, thereby clustering at least two document nodes of the term-to-document matrix.

View all claims
  • 3 Assignments
Timeline View
Assignment View
    ×
    ×