Identifying word-senses based on linguistic variations
First Claim
Patent Images
1. A computer program product for identifying word-senses, the computer program product comprising:
- one or more computer-readable storage media and program instructions stored on the one or more computer-readable storage media, the program instructions comprising;
program instructions to generate a set of domain tables each comprising one or more arrays of aggregated statistical information corresponding to a plurality of words, one or more word-senses corresponding to the plurality of words, and temporal properties corresponding to the plurality of words, wherein the aggregated statistical information comprises a temporal frequency of occurrence value determined using an n-gram viewer;
program instructions to receive a word;
program instructions to identify the temporal frequency of occurrence value corresponding to the received word from each domain table in the set of domain tables;
program instructions to associate the received word with one or more domain tables in the set of domain tables based on the temporal frequency of occurrence value corresponding to the received word in each of the one or more domain tables meeting a threshold value; and
program instructions to identify one or more word-senses corresponding to the received word based on one or more corresponding word-senses in the associated one or more domain tables and based on one or more corresponding word-senses in a corresponding domain dictionary.
1 Assignment
0 Petitions
Accused Products
Abstract
One or more words are received. A set of frequency of occurrence values of the received word(s) within a set of domain tables is determined. A domain table in the set of domain tables is associated to the received word(s), based on the set of frequency of occurrence values meeting a threshold value. A word-sense of the received word(s) is determined based on a corresponding word-sense in the associated domain table and/or corresponding domain dictionary.
26 Citations
20 Claims
-
1. A computer program product for identifying word-senses, the computer program product comprising:
-
one or more computer-readable storage media and program instructions stored on the one or more computer-readable storage media, the program instructions comprising; program instructions to generate a set of domain tables each comprising one or more arrays of aggregated statistical information corresponding to a plurality of words, one or more word-senses corresponding to the plurality of words, and temporal properties corresponding to the plurality of words, wherein the aggregated statistical information comprises a temporal frequency of occurrence value determined using an n-gram viewer; program instructions to receive a word; program instructions to identify the temporal frequency of occurrence value corresponding to the received word from each domain table in the set of domain tables; program instructions to associate the received word with one or more domain tables in the set of domain tables based on the temporal frequency of occurrence value corresponding to the received word in each of the one or more domain tables meeting a threshold value; and program instructions to identify one or more word-senses corresponding to the received word based on one or more corresponding word-senses in the associated one or more domain tables and based on one or more corresponding word-senses in a corresponding domain dictionary. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A computer program product for identifying word-senses, the computer program product comprising:
-
one or more computer-readable storage media and program instructions stored on the one or more computer-readable storage media, the program instructions comprising; program instructions to identify a frequency of occurrence value of a received word from each of a plurality of domain tables, wherein each of the plurality of domain tables comprises a frequency of occurrence value corresponding to the received word, a word-sense corresponding to the received word, and temporal properties corresponding to the received word, wherein the frequency of occurrence value is determined using an n-gram viewer; program instructions to associate the received word with a domain table from the plurality of domain tables based on the frequency of occurrence value corresponding to the received word meeting a threshold value; and program instructions to identify a word-sense of the received word based on the corresponding word-sense from the associated domain table. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A computer program product for identifying word-senses, the computer program product comprising:
-
one or more computer-readable storage media and program instructions stored on the one or more computer-readable storage media, the program instructions comprising; program instructions to generate a set of domain tables each comprising one or more arrays of aggregated statistical information corresponding to a plurality of words, one or more word-senses corresponding to the plurality of words, and temporal properties corresponding to the plurality of words, wherein the aggregated statistical information of words comprises temporal frequency of occurrence of words, wherein the temporal frequency of occurrence of words comprises a frequency of usage of words and corresponding word-senses during a specific time period; program instructions to receive a word; program instructions to identify the temporal frequency of occurrence corresponding to the received word from each domain table in the set of domain tables; program instructions to associate the received word with a domain table in the set of domain tables, based on the temporal frequency of occurrence of the received word corresponding to the received word in the domain table meeting a threshold value; and program instructions to identify one or more word-senses corresponding to the received word based on one or more corresponding word-senses in the associated domain table based on the specific time period and one or more corresponding word-senses in a corresponding domain dictionary. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification