System for compiling word usage frequencies
First Claim
1. A computer system for managing word usage frequencies, the computer system comprising:
- one or more processors, one or more computer-readable memories and one or more computer-readable tangible storage devices;
program instructions, stored on at least one of the one or more storage devices for execution by at least one of the one or more processors via at least one of the one or more memories, to receive;
an identifier of a location;
an identifier of a number of document levels at the location, the document levels comprising at least one document; and
an identifier of a minimum number of words;
program instructions, stored on the at least one of the one or more storage devices for execution by the at least one of the one or more processors via the at least one of the one or more memories, to determine whether the minimum number of words are present in the at least one document;
program instructions, stored on the at least one of the one or more storage devices for execution by the at least one of the one or more processors via the at least one of the one or more memories, responsive to determining that the minimum number of words are present in the at least one document, to analyze all of the at least one document to determine whether the at least one document comprises Latin based words or Sino-Tibetan based words;
program instructions, stored on the at least one of the one or more storage devices for execution by the at least one of the one or more processors via the at least one of the one or more memories, responsive to determining that the at least one document comprises the Latin based words, to populate a Latin based word list with unique Latin based words in the at least one document and to determine a frequency of each of the unique Latin based words in the at least one document;
program instructions, stored on the at least one of the one or more storage devices for execution by the at least one of the one or more processors via the at least one of the one or more memories, responsive to determining that the at least one document comprises the Sino-Tibetan based words, to populate a Sino-Tibetan based word list with unique Sino-Tibetan based words in the at least one document and to determine a frequency of the unique Sino-Tibetan based words in the at least one document; and
program instructions, stored on the at least one of the one or more storage devices for execution by the at least one of the one or more processors via the at least one of the one or more memories, to generate results comprising the unique Latin based words, the frequency of the unique Latin based words, the unique Sino-Tibetan based words, and the frequency of the unique Sino-Tibetan based words.
1 Assignment
0 Petitions
Accused Products
Abstract
A system for assisting a user who is learning a language to prioritize words to be learned in order of usage frequency is disclosed. A frequency determination program running on a computer determines the frequency of usage of each word at a list of locations provided by the user. Different algorithms to identify what constitutes a word are employed depending upon the language of the source data. The total number of words at each location and their usage frequency found during the user session, along with a total number of words and their usage frequency for all user sessions performed regardless of location, are calculated and made available to the user. The user can view usage frequencies for words from a single location, a group of locations, or all user sessions performed.
-
Citations
26 Claims
-
1. A computer system for managing word usage frequencies, the computer system comprising:
-
one or more processors, one or more computer-readable memories and one or more computer-readable tangible storage devices; program instructions, stored on at least one of the one or more storage devices for execution by at least one of the one or more processors via at least one of the one or more memories, to receive; an identifier of a location; an identifier of a number of document levels at the location, the document levels comprising at least one document; and an identifier of a minimum number of words; program instructions, stored on the at least one of the one or more storage devices for execution by the at least one of the one or more processors via the at least one of the one or more memories, to determine whether the minimum number of words are present in the at least one document; program instructions, stored on the at least one of the one or more storage devices for execution by the at least one of the one or more processors via the at least one of the one or more memories, responsive to determining that the minimum number of words are present in the at least one document, to analyze all of the at least one document to determine whether the at least one document comprises Latin based words or Sino-Tibetan based words; program instructions, stored on the at least one of the one or more storage devices for execution by the at least one of the one or more processors via the at least one of the one or more memories, responsive to determining that the at least one document comprises the Latin based words, to populate a Latin based word list with unique Latin based words in the at least one document and to determine a frequency of each of the unique Latin based words in the at least one document; program instructions, stored on the at least one of the one or more storage devices for execution by the at least one of the one or more processors via the at least one of the one or more memories, responsive to determining that the at least one document comprises the Sino-Tibetan based words, to populate a Sino-Tibetan based word list with unique Sino-Tibetan based words in the at least one document and to determine a frequency of the unique Sino-Tibetan based words in the at least one document; and program instructions, stored on the at least one of the one or more storage devices for execution by the at least one of the one or more processors via the at least one of the one or more memories, to generate results comprising the unique Latin based words, the frequency of the unique Latin based words, the unique Sino-Tibetan based words, and the frequency of the unique Sino-Tibetan based words. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A computer program product for managing word usage frequencies, the computer program product comprising:
-
one or more computer-readable tangible storage devices; program instructions, stored on at least one of the one or more storage devices, to receive; an identifier of a location comprising at least one document, an identifier of a number of document levels at the location, the document levels comprising the at least one document, and an identifier of a minimum number of words; program instructions, stored on at least one of the one or more storage devices, to determine whether the minimum number of words are present in the at least one document; program instructions, stored on at least one of the one or more storage devices, responsive to determining that the minimum number of words are present in the at least one document, to analyze all of the at least one document to determine whether the at least one document comprises Latin based words or Sino-Tibetan based words; program instructions, stored on at least one of the one or more storage devices, responsive to determining that the at least one document comprises the Latin based words, to populate a Latin based word list with unique Latin based words in the at least one document and to determine a frequency of each of the unique Latin based words in the at least one document; program instructions, stored on at least one of the one or more storage devices, responsive to determining that the at least one document comprises the Sino-Tibetan based words, to populate a Sino-Tibetan based word list with unique Sino-Tibetan based words in the at least one document and to determine a frequency of the unique Sino-Tibetan based words in the at least one document; and program instructions, stored on at least one of the one or more storage devices, to generate results comprising the unique Latin based words, the frequency of the unique Latin based words, the unique Sino-Tibetan based words, and the frequency of the unique Sino-Tibetan based words. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18)
-
-
19. A method for managing word usage frequencies, the method comprising:
-
a computer receiving; an identifier of a location; an identifier of a number of document levels at the location, the document levels comprising at least one document; and an identifier of a minimum number of words; the computer determining whether the minimum number of words are present in the at least one document; responsive to the computer determining that the minimum number of words are present in the at least one document, the computer analyzing all of the at least one document to determine whether the at least one document comprises Latin based words or Sino-Tibetan based words; responsive to the computer determining that the at least one document comprises the Latin based words, the computer populating a Latin based word list with the Latin based words and the computer determining a frequency of the Latin based words; and responsive to the computer determining that the at least one document comprises the Sino-Tibetan based words, the computer populating a Sino-Tibetan based word list with the Sino-Tibetan based words and to determine a frequency of the Sino-Tibetan based words; and the computer generating results comprising the Latin based words, the frequency of the Latin based words, the Sino-Tibetan based words, and the frequency of the Sino-Tibetan based words. - View Dependent Claims (20, 21, 22, 23, 24, 25, 26)
-
Specification