User voice mixing device, virtual space sharing system, computer control method, and information storage medium
First Claim
1. A user voice mixing device used in a virtual space sharing system in which a plurality of information processing devices share a virtual space, said user voice mixing device comprising:
- means for receiving voice data representing a voice uttered by a first user of one of the plurality of information processing devices and a first region corresponding to a first position coordinate of the first the user in the virtual space;
means for storing predetermined environmental sounds in association with each of one or more regions defined in the virtual space;
means for acquiring a second position corresponding to a second user in the virtual space, who receives the voice data of the first at the user voice mixing device;
means for specifying a second region corresponding to the second user according to the second acquired position coordinate, the second region being different from the first region;
means for creating mixed voice data by acquiring a first predetermined environmental sound associated with the first region received by the means for receiving from the means for storing, and mixing the acquired first predetermined environmental sound and the voice data, and acquiring a second predetermined environmental sound associated with the second region corresponding to the second user, and mixing the voice data with the first predetermined environmental sound associated with the first region corresponding to the user and the second predetermined environmental sound associated with the second region corresponding to the second user; and
means for outputting the mixed voice data,wherein said acquired first environmental sound is independent of the voice data and not directly associated with the acquired position coordinate of the user, andwherein, when said voice data, said first environmental sound and said second environmental sound occur at a same time, said voice data is mixed with said first and second environmental sounds to generate a modified output indicative of said first and second environmental sounds reducing the comprehensibility of said voice data.
1 Assignment
0 Petitions
Accused Products
Abstract
A sensation of presence of voice chat in a virtual space is enhanced. A user speech synthesizer used in a virtual space sharing system where information processing devices share the virtual space. The user speech synthesizer comprises a speech data acquiring section (60) for acquiring speech data representing a speech uttered by the user of one of the information processing devices, an environment sound storage section (66) for storing an environment sound associated with one or more regions defined in the virtual space, a region specifying section (64) for specifying a region corresponding to the user in the virtual space, and an environment sound synthesizing section (68) for acquiring the environment sound associated with the specified region from the environment sound storage section (66), combining the acquired environment sound and the speech data and synthesizing synthesized speech data.
-
Citations
19 Claims
-
1. A user voice mixing device used in a virtual space sharing system in which a plurality of information processing devices share a virtual space, said user voice mixing device comprising:
-
means for receiving voice data representing a voice uttered by a first user of one of the plurality of information processing devices and a first region corresponding to a first position coordinate of the first the user in the virtual space; means for storing predetermined environmental sounds in association with each of one or more regions defined in the virtual space; means for acquiring a second position corresponding to a second user in the virtual space, who receives the voice data of the first at the user voice mixing device; means for specifying a second region corresponding to the second user according to the second acquired position coordinate, the second region being different from the first region; means for creating mixed voice data by acquiring a first predetermined environmental sound associated with the first region received by the means for receiving from the means for storing, and mixing the acquired first predetermined environmental sound and the voice data, and acquiring a second predetermined environmental sound associated with the second region corresponding to the second user, and mixing the voice data with the first predetermined environmental sound associated with the first region corresponding to the user and the second predetermined environmental sound associated with the second region corresponding to the second user; and means for outputting the mixed voice data, wherein said acquired first environmental sound is independent of the voice data and not directly associated with the acquired position coordinate of the user, and wherein, when said voice data, said first environmental sound and said second environmental sound occur at a same time, said voice data is mixed with said first and second environmental sounds to generate a modified output indicative of said first and second environmental sounds reducing the comprehensibility of said voice data. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A control method for a computer used in a virtual space sharing system in which a plurality of information processing devices share a virtual space, the control method for a computer comprising:
-
acquiring voice data representing a voice uttered by a user of one of the plurality of information processing devices; acquiring a first position coordinate corresponding to the user in the virtual space and acquiring a second position coordinate corresponding to a receiver of the voice data in the virtual space; specifying a first region corresponding to the user in the virtual space according to the first acquired position coordinate, and specifying a second region corresponding to the receiver of the voice data according to the second acquired position coordinate, second region being different from the first region; transmitting the first region and the voice data to the receiver; creating mixed voice data at the receiver by acquiring, with reference to storing predetermined environmental sounds in association with each of one or more regions defined in the virtual space, a first predetermined environmental sound associated with the first region specified in the specifying, and mixing the acquired first predetermined environmental sound and the voice data, and acquiring a second predetermined environmental sound associated with the second region corresponding to the receiver, and mixing the voice data with the first predetermined environmental sound associated with the first region corresponding to the user and the second predetermined environmental sound associated with the second region corresponding to the receiver; and outputting the mixed voice data at the receiver, wherein said acquired first environmental sound is independent of the voice data and not directly associated with the acquired position coordinate of the user, and wherein, when said voice data, said first environmental sound and said second environmental sound occur at a same time, said voice data is mixed with said first and second environmental sounds to generate a modified output indicative of said first and second environmental sounds reducing the comprehensibility of said voice data. - View Dependent Claims (9, 10, 11)
-
-
12. A non-transitory information storage medium storing a program for causing a computer to function, the computer being used in a virtual space sharing system in which a plurality of information processing devices share a virtual space, the program causing the computer to function as:
-
means for acquiring voice data representing a voice uttered by a user of one of the plurality of information processing devices; means for storing predetermined environmental sounds in association with each of one or more regions defined in the virtual space; means for acquiring a first position coordinate corresponding to the user in the virtual space and acquiring a second position coordinate corresponding to a receiver of the voice data in the virtual space; means for specifying a first region corresponding to the user in the virtual space according to the first acquired position coordinate, and specifying a second region corresponding to the receiver of the voice data according to the second acquired position coordinate, second region being different from the first region; transmitting the first region and the voice data to the receiver; means for creating mixed voice data at the receiver by acquiring a first predetermined environmental sound associated with the first region specified by the means for specifying from the environmental sound storing means, and mixing the acquired first predetermined environmental sound and the voice data, and acquiring a second predetermined environmental sound associated with the second region corresponding to the receiver, and mixing the voice data with the first predetermined environmental sound associated with the first region corresponding to the user and the second predetermined environmental sound associated with the second region corresponding to the receiver; and means for outputting the mixed voice data at the receiver, wherein said acquired first environmental sound is independent of the voice data and not directly associated with the acquired position coordinate of the user, and wherein, when said voice data, said first environmental sound and said second environmental sound occur at a same time, said voice data is mixed with said first and second environmental sounds to generate a modified output indicative of said first and second environmental sounds reducing the comprehensibility of said voice data. - View Dependent Claims (13, 14, 15)
-
-
16. A virtual space sharing system in which a plurality of information processing devices share a virtual space, comprising:
-
means for acquiring voice data representing a voice uttered by a user of one of the plurality of information processing devices; means for storing predetermined environmental sounds in association with each of one or more regions defined in the virtual space; means for acquiring a first position coordinate corresponding to the user in the virtual space and acquiring a second position coordinate corresponding to a receiver of the voice data in the virtual space; means for specifying a first region corresponding to the user in the virtual space according to the first acquired position coordinate, and specifying a second region corresponding to the receiver of the voice data according to the second acquired position coordinate, the second region being different from the first region; means for transmitting the first region and the voice data to the receiver; means for creating mixed voice data at the receiver by acquiring a first predetermined environmental sound associated with the first region specified by the means for specifying from the means for storing, and mixing the acquired first predetermined environmental sound and the voice data, and acquiring a second predetermined environmental sound associated with the second region corresponding to the receiver, and mixing the voice data with the first predetermined environmental sound associated with the first region corresponding to the user and the second predetermined environmental sound associated with the second region corresponding to the receiver, and means for outputting the mixed voice data at the receiver; wherein said acquired first environmental sound is independent of the voice data and not directly associated with the acquired position coordinate of the user, and wherein, when said voice data, said first environmental sound and said second environmental sound occur at a same time, said voice data is mixed with said first and second environmental sounds to generate a modified output indicative of said first and second environmental sounds reducing the comprehensibility of said voice data. - View Dependent Claims (17, 18, 19)
-
Specification