TERM EXTRACTION FROM SERVICE DESCRIPTION DOCUMENTS
First Claim
1. A computer readable medium including code that when executed by a computer system performs a method of selecting most relevant terms from service description documents, the method comprising:
- receiving a set of service description documents;
determining a matrix X of terms and documents from the set of service description documents;
decomposing the matrix X using singular value decomposition (SVD) into three matrices T, S, and Dt, wherein S is a diagonal matrix of singular values ordered in descending order, and T and Dt are matrices of eigen vectors;
selecting k singular values from S, where k is an integer greater than or equal to 1;
determining rankings of the terms in the matrix X using the k singular values; and
selecting a set of terms from the matrix X based on the rankings.
8 Assignments
0 Petitions
Accused Products
Abstract
Terms are extracted from service description documents describing desired services, and the terms are used to search for candidate service providers operable to provide the desired services. A matrix of terms and documents are determined from the service description documents. Singular value decomposition (SVD) is used to decompose the matrix. Eigen vectors from the SVD decomposition are weighted by top k singular values in a diagonal matrix S of singular values also generated from the decomposition. The weighted eigen vectors are used to rank terms from the matrix X based on relevance of the terms to the requirements for desired services described in the service description documents.
31 Citations
20 Claims
-
1. A computer readable medium including code that when executed by a computer system performs a method of selecting most relevant terms from service description documents, the method comprising:
-
receiving a set of service description documents; determining a matrix X of terms and documents from the set of service description documents; decomposing the matrix X using singular value decomposition (SVD) into three matrices T, S, and Dt, wherein S is a diagonal matrix of singular values ordered in descending order, and T and Dt are matrices of eigen vectors; selecting k singular values from S, where k is an integer greater than or equal to 1; determining rankings of the terms in the matrix X using the k singular values; and selecting a set of terms from the matrix X based on the rankings. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A computer readable medium including code that when executed by a computer system performs a method comprising:
-
receiving a set of service description documents describing requirements for one or more desired services; determining a term-by-document matrix X from the set of documents, wherein a dimension of the matrix X is t number of terms by d number of documents; using SVD to determine matrices T, S, and Dt, wherein S is a diagonal matrix of singular values ordered in descending order, and T and Dt are matrices of eigenvectors; selecting top k singular values from S; determining eigen vectors from T weighted by the top k singular values in S; and using the weighted eigen vectors to rank terms from the matrix X based on relevance of the terms to the requirements for one or more desired services in the set of documents. - View Dependent Claims (11, 12, 13, 14, 15, 16)
-
-
17. A computer system comprising:
-
an interface receiving a set of service description documents; and one or more processors executing code to perform a method including determining a term-by-document matrix X from the set of service description documents, wherein a dimension of the matrix X is t number of terms by d number of documents; decomposing the matrix X using singular value decomposition (SVD) into three matrices T, S, and Dt, wherein S is a diagonal matrix of singular values ordered in descending order, and T and Dt are matrices of eigen vectors; selecting top k singular values in S, where k is less than a total number of non-zero singular values in S; determining eigen vectors from T weighted by the top k singular values in S; and using the weighted eigen vectors to rank terms from the matrix X based on relevance of the terms to the requirements for one or more desired services in the set of documents. - View Dependent Claims (18, 19, 20)
-
Specification