Construction of a lexicon for a selected context
First Claim
Patent Images
1. A computing system that generates a lexicon of a social network, the computing system comprising:
- a processor; and
memory storing instructions that, when executed by the processor, provide;
a lexicon generator that;
for respective messages of the social network;
identifies a context of the message,scans the message to identify a set of word sequences, andcounts an occurrence of the respective word sequences among the messages within the context;
for respective contexts, generates a lexicon for the context that comprises the word sequences identified in the messages with a higher count than a word sequence count threshold;
identifies a selected word sequence by which at least one user of the social network who is associated with the selected word sequence is identifiable; and
refrains from including the selected word sequence from the lexicon; and
a lexicon presenter that, responsive to a selection by a user of a selected context, presents to the user the lexicon of word sequences for the selected context.
2 Assignments
0 Petitions
Accused Products
Abstract
Various technologies pertaining to constructing a lexicon for a defined context are set forth herein. Social media text is acquired, where the social media text has contextual data that corresponds thereto. The social media text is encoded to form encoded text (in Unicode), and the contextual data is assigned to the encoded text. A text corpus for a defined context is formed by filtering the encoded text based upon contextual data, such as location. Frequency of occurrence of words or phrases in the text corpus is used to identify words or phrases that are to be included in the lexicon.
43 Citations
20 Claims
-
1. A computing system that generates a lexicon of a social network, the computing system comprising:
-
a processor; and memory storing instructions that, when executed by the processor, provide; a lexicon generator that; for respective messages of the social network; identifies a context of the message, scans the message to identify a set of word sequences, and counts an occurrence of the respective word sequences among the messages within the context; for respective contexts, generates a lexicon for the context that comprises the word sequences identified in the messages with a higher count than a word sequence count threshold; identifies a selected word sequence by which at least one user of the social network who is associated with the selected word sequence is identifiable; and refrains from including the selected word sequence from the lexicon; and a lexicon presenter that, responsive to a selection by a user of a selected context, presents to the user the lexicon of word sequences for the selected context. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. A method of generating a lexicon for a context of a social network, the method involving a device and comprising:
executing, by the processor, instructions that cause the device to; among messages of the social network, identify selected messages that are within the context; scan the respective selected message to identify a set of word sequences; count an occurrence of the respective word sequences among the selected messages within the context; generate a lexicon for the context that comprises the word sequences identified in the selected messages with a higher count than a word sequence count threshold; identify a selected word sequence by which at least one user of the social network who is associated with the selected word sequence is identifiable; and refrain from including the selected word sequence in the lexicon; and responsive to a request by the user to present word sequences that frequently occur in the context, present to the user the lexicon of word sequences for the context. - View Dependent Claims (18, 19)
-
20. A method of generating a lexicon from users of a social network who are respectively identified by a user profile, the method comprising:
-
for respective users of the social network, performing an evaluation of the user profile of the user to identify a context of the user and respective messages associated with the user, while refraining from using information that distinctively identifies the selected user; among the messages of the social network, identifying selected messages that are within the context according to the context of at least one user who is associated with the message; scanning the respective selected message to identify a set of word sequences; counting an occurrence of the respective word sequences among the selected messages within the context; generating a lexicon for the context that comprises the word sequences identified in the selected messages with a higher count than a word sequence count threshold; and responsive to a request by the user to present word sequences that frequently occur in the context, presenting to the user the lexicon of word sequences for the context.
-
Specification