×

Creating, rendering and interacting with a multi-faceted audio cloud

  • US 10,007,724 B2
  • Filed: 06/29/2012
  • Issued: 06/26/2018
  • Est. Priority Date: 06/29/2012
  • Status: Active Grant
First Claim
Patent Images

1. An apparatus comprising:

  • at least one processor; and

    a non-transitory computer readable storage medium having computer readable program code embodied therewith and executable by the at least one processor, the computer readable program code comprising;

    computer readable program code configured to segment audio provided in a first language not having available automatic speech recognition capabilities into speech units, wherein to segment comprises employing a language sub-word recognition technique selected from the group consisting of;

    a statistical system for sub-word unit recognition;

    a voice-activity-detection technique; and

    a syllable segmentation technique, wherein the language sub-word recognition technique comprises utilizing a sub-word recognition technique of a second language having available automatic speech recognition capabilities and different from the first language of the audio;

    computer readable program code configured to identify prominent speech units, wherein to identify comprises detecting a repeated speech unit by identifying speech patterns within the audio and using a language agnostic speech unit comparison technique, wherein the language agnostic speech unit comparison technique comprises a technique where a language associated with the speech unit is disregarded;

    wherein to identify further comprises determining a frequency of occurrence of a speech unit and wherein a prominent speech unit comprises a speech unit that exceeds a predetermined frequency of occurrence threshold;

    computer readable program code configured to create an audio cloud comprising audio signals of the prominent speech units, wherein each of the audio signals comprise a playable audio unit that when played provides an audible output from the audio of the corresponding prominent speech unit;

    computer readable program code configured to render the audio cloud, wherein the audio cloud comprises a visual representation of the audio signals, wherein the audio signals are arranged in order of decreasing frequency of occurrence and wherein a volume of the audio signals is based upon the frequency of occurrence; and

    computer readable program code configured to afford user interaction with at least a clip portion of the audio cloud.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×