VOICE SEARCH DEVICE, VOICE SEARCH METHOD, AND NON-TRANSITORY RECORDING MEDIUM
First Claim
1. A voice search device comprising:
- a search string acquirer acquiring a search string;
a converter converting the search string acquired by the search string acquirer into a phoneme sequence;
a time length deriver acquiring a duration of each phoneme included in the phoneme sequence converted by the converter, and deriving a spoken time length of voice corresponding to the search string based on the acquired durations;
a zone designator designating a likelihood acquisition zone that is a zone of the time length derived by the time length deriver in a target voice signal;
a likelihood acquirer acquiring a likelihood indicating how likely the likelihood acquisition zone designated by the zone designator is a zone in which voice corresponding to the search string is spoken;
a repeater changing the likelihood acquisition zone designated by the zone designator, and repeating a process of the zone designator and the likelihood acquirer; and
an identifier identifying, on the basis of the likelihood acquired by the likelihood acquirer for each likelihood acquisition zone designated by the zone designator, from the target voice signal an estimated zone for which the voice corresponding to the search string is estimated to be spoken.
1 Assignment
0 Petitions
Accused Products
Abstract
A search string acquiring unit acquires a search string. A converting unit converts the search string into a phoneme sequence. A time length deriving unit derives the spoken time length of the voice corresponding to the search string. A zone designating unit designates a likelihood acquisition zone in a target voice signal. A likelihood acquiring device acquires a likelihood indicating how likely the likelihood acquisition interval is an interval in which voice corresponding to the search string is spoken. A repeating unit changes the likelihood acquisition zone designated by the zone designating unit, and repeats the process of the zone designating unit and the likelihood acquiring device. An identifying unit identifies, from the target voice signal, estimated intervals for which the voice corresponding to the search string is estimated to be spoken, on the basis of the likelihoods acquired for each of the likelihood acquisition zones.
13 Citations
17 Claims
-
1. A voice search device comprising:
-
a search string acquirer acquiring a search string; a converter converting the search string acquired by the search string acquirer into a phoneme sequence; a time length deriver acquiring a duration of each phoneme included in the phoneme sequence converted by the converter, and deriving a spoken time length of voice corresponding to the search string based on the acquired durations; a zone designator designating a likelihood acquisition zone that is a zone of the time length derived by the time length deriver in a target voice signal; a likelihood acquirer acquiring a likelihood indicating how likely the likelihood acquisition zone designated by the zone designator is a zone in which voice corresponding to the search string is spoken; a repeater changing the likelihood acquisition zone designated by the zone designator, and repeating a process of the zone designator and the likelihood acquirer; and an identifier identifying, on the basis of the likelihood acquired by the likelihood acquirer for each likelihood acquisition zone designated by the zone designator, from the target voice signal an estimated zone for which the voice corresponding to the search string is estimated to be spoken. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A voice search method comprising:
-
a search string acquiring step that acquires a search string; a converting step that converts the search string acquired by the search string acquiring step into a phoneme sequence; a time length deriving step that acquires a duration of each phoneme included in the phoneme sequence converted by the converting step, and derives a spoken time length of voice corresponding to the search string based on the acquired durations; a zone designating step that designates a likelihood acquisition zone that is a zone of the time length derived by the time length deriving step in a target voice signal; a likelihood acquiring step that acquires a likelihood indicating how likely the likelihood acquisition zone designated by the zone designating step is a zone in which voice corresponding to the search string is spoken; a repeating step that changes the likelihood acquisition zone designated by the zone designating step, and repeats a process of the zone designating step and the likelihood acquiring step; and an identifying step that identifies, on the basis of the likelihood acquired by the likelihood acquiring step for each likelihood acquisition zone designated by the zone designating step, from the target voice signal an estimated zone for which the voice corresponding to the search string is estimated to be spoken. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. A non-transitory recording medium storing a program causing a computer to function as:
-
a search string acquirer acquiring a search string; a converter converting the search string acquired by the search string acquirer into a phoneme sequence; a time length deriver acquiring a duration of each phoneme included in the phoneme sequence converted by the converter, and deriving a spoken time length of voice corresponding to the search string based on the acquired durations; a zone designator designating a likelihood acquisition zone that is a zone of the time length derived by the time length deriver in a target voice signal; a likelihood acquirer acquiring a likelihood indicating how likely the likelihood acquisition zone designated by the zone designator is a zone in which voice corresponding to the search string is spoken; a repeater changing the likelihood acquisition zone designated by the zone designator, and repeating a process of the zone designator and the likelihood acquirer; and an identifier identifying, on the basis of the likelihood acquired by the likelihood acquirer for each likelihood acquisition zone designated by the zone designator, from the target voice signal an estimated zone for which the voice corresponding to the search string is estimated to be spoken.
-
Specification