Learning apparatus and learning method
First Claim
Patent Images
1. A learning apparatus comprising:
- a memory; and
a processor coupled to the memory and the processor configured to;
acquire a plurality of words from a plurality of documents;
generate a plurality of contexts represented in a vector for each word of the plurality of words;
perform clustering of the plurality of contexts for each word of the plurality of words;
when a plurality of clusters are generated for a first word among the plurality of words by the clustering, perform assignment, to the first word, different labels corresponding to each cluster of the plurality of clusters;
generate first contexts for each first word distinguished by the assigned different labels; and
perform re-clustering of the plurality of contexts including the first contexts for each first word with the assigned different labels.
2 Assignments
0 Petitions
Accused Products
Abstract
A learning apparatus includes a memory and a processor configured to acquire a plurality of documents, perform clustering of the plurality of documents for each of a plurality of words included in the plurality of document, when a plurality of clusters are generated for a first word among the plurality of words by the clustering, perform assignment of different labels corresponding to the plurality of clusters to the first word included in the plurality of documents, and perform re-clustering of the plurality of documents including the first word with the assigned different labels, for other words among the plurality of words.
53 Citations
13 Claims
-
1. A learning apparatus comprising:
-
a memory; and a processor coupled to the memory and the processor configured to; acquire a plurality of words from a plurality of documents; generate a plurality of contexts represented in a vector for each word of the plurality of words; perform clustering of the plurality of contexts for each word of the plurality of words; when a plurality of clusters are generated for a first word among the plurality of words by the clustering, perform assignment, to the first word, different labels corresponding to each cluster of the plurality of clusters; generate first contexts for each first word distinguished by the assigned different labels; and perform re-clustering of the plurality of contexts including the first contexts for each first word with the assigned different labels. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A learning method executed by a computer, the method comprising:
-
acquiring a plurality of words from a plurality of documents; generating a plurality of contexts represented in a vector for each word of the plurality of words; performing clustering of the plurality of contexts for each word of the plurality of words; when a plurality of clusters are generated for a first word among the plurality of words by the clustering, performing assignment, to the first word, different labels corresponding to each cluster of the plurality of clusters; generating first contexts for each first word distinguished by the assigned different labels; and performing re-clustering of the plurality of contexts including the first contexts for each first word with the assigned different labels. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. A non-transitory computer-readable medium storing a learning program that causes a computer to execute a process comprising:
-
acquiring a plurality of words from a plurality of documents; generate a plurality of contexts represented in a vector for each word of the plurality of words; performing clustering of the plurality of contexts for each word of the plurality of words; when a plurality of clusters are generated for a first word among the plurality of words by the clustering, performing assignment, to the first word, different labels corresponding to each cluster of the plurality of clusters; generate first contexts for each first word distinguished by the assigned different labels; and performing re-clustering of the plurality of contexts including the first contexts for each first word with the assigned different labels.
-
Specification