×

Identifying word-senses based on linguistic variations

  • US 9,619,460 B2
  • Filed: 09/23/2016
  • Issued: 04/11/2017
  • Est. Priority Date: 02/13/2015
  • Status: Expired due to Fees
First Claim
Patent Images

1. A computer system for identifying word-senses, the computer system comprising:

  • one or more computer processors;

    one or more computer-readable storage media;

    program instructions stored on the computer-readable storage media for execution by at least one of the one or more processors, the program instructions comprising;

    program instructions to generate, by a computer, a plurality of arrays of aggregated statistical information of words, their corresponding word-senses, and temporal properties within different professional fields using an n-gram viewer, wherein the aggregated statistical information comprises frequency of usage of words, frequency of occurrence of words, frequency of co-occurrence of words with other words, and their respective corresponding word-senses;

    program instructions to generate, by the computer, a set of domain tables based on the generated plurality of arrays of aggregated statistical information, wherein each of the domain tables within the set of domain tables corresponds to a different professional field comprising medical, veterinary, legal, and engineering;

    program instructions to receive, from a remote server through a network, a digital text stream comprising metadata and one or more words from a doctor, using the computer, the network being an internet connection;

    program instructions to select, using the metadata, a medical frequency domain table, veterinary frequency domain table, and a word-sense domain table from the set of domain tables;

    program instructions to determine a frequency of occurrence value for the received digital text stream within each of the selected domain tables;

    program instructions to receive a threshold from the doctor;

    program instructions to associate the medical frequency domain table with the received digital text stream in response to the frequency of occurrence value satisfying the received threshold;

    program instructions to determine a word-sense of the received digital text stream, by determining a corresponding word sense to the received digital text stream within the medical frequency domain table;

    program instructions to assign a confidence value to the word-sense based on a degree of frequency of occurrence of the received digital text stream within the medical domain, wherein the word-sense has a higher confidence value, when the frequency of occurrence of the received digital text stream is higher within the medical domain table; and

    program instructions to present the word-sense and the confidence value to the doctor.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×