Media presentation system controlled by voice to text commands
First Claim
1. A system utilizing a computer for enabling a user to vocally assemble, manipulate, and display a plurality of media searched and retrieved from variable external data media sources based on preferences of said user, comprising:
- a voice recognition module working in conjunction with said computer for converting an inputted voice utterance into computer-readable text in the form of a plurality of search commands, manipulation commands, and navigation commands;
filtering means for taking said search commands after such conversion by said voice-recognition means and identifying one of said external data media sources for committing and retrieving said media;
juxtapositioning means for preparing said media after such retrieving by said filtering means for local display;
platform means whereon said media is vocally manipulated and organized based on each of said manipulation commands performed by said user after such preparing from said juxtapositioning means; and
, means for providing a mirror image of said media after such manipulation and organization as a preliminary view before an epic view projection of said media wherein said epic view projection utilizes each of said navigation commands as part of a presentation program.
0 Assignments
0 Petitions
Accused Products
Abstract
A system and method for searching, assembling, and manipulating a variety of multi-media using voice converted to text commands. Digital images, movies, audio, or text is verbally searched and retrieved from a variety of video and audio databases using a combination of directional commands and a means for juxtaposing and assembling search results. The desired media is then placed onto a platform means for manipulating and editing the media files. Any retrieved media files and/or images can be manipulated and assembled on-screen using commands such as “zoom” or “move left” by having corners and borders read by the grid of the platform means. The image(s) are also capable of being stacked, or overlay one another to define re-proportioned backgrounds. The image(s) from the platform means are displayed without the grid using an image platter as a means of providing a preliminary view of the presentation prior to projection. The system allows for the hand-free assembly and editing of music and movies, and provides a means for verbally assembling pre-planned or impromptu presentations comprising video or audio clips, digital images, or text retrieved from multiple local and remote databases, such as a DVD movie-base or the World Wide Web.
167 Citations
34 Claims
-
1. A system utilizing a computer for enabling a user to vocally assemble, manipulate, and display a plurality of media searched and retrieved from variable external data media sources based on preferences of said user, comprising:
-
a voice recognition module working in conjunction with said computer for converting an inputted voice utterance into computer-readable text in the form of a plurality of search commands, manipulation commands, and navigation commands;
filtering means for taking said search commands after such conversion by said voice-recognition means and identifying one of said external data media sources for committing and retrieving said media;
juxtapositioning means for preparing said media after such retrieving by said filtering means for local display;
platform means whereon said media is vocally manipulated and organized based on each of said manipulation commands performed by said user after such preparing from said juxtapositioning means; and
,means for providing a mirror image of said media after such manipulation and organization as a preliminary view before an epic view projection of said media wherein said epic view projection utilizes each of said navigation commands as part of a presentation program. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A system utilizing a computer for enabling a user to vocally assemble and display a plurality of media searched and retrieved from variable external data media sources and manipulated based on preferences of said user, comprising:
-
a voice recognition module working in conjunction with said computer for converting an inputted voice utterance into computer-readable text in the form of a plurality of search commands, manipulation commands, and navigation commands;
filtering means for taking each of said search commands after such conversion by said voice-recognition means and identifying one of said external data media sources for committing and retrieving said media in the form of search results, said filtering means further comprising a first directional means for activating each of a plane search, a media capture, and a second directional means;
a results file having a results counter;
means for providing that said results file calculate an amount of said search results produced by said search command performed by said user for said media;
means for creating a table for said search results;
platform means for allowing said user to see a layout of said media after said search results are retrieved from said external data media sources;
means for moving each of said search results independently out of said table onto said platform means;
means for activating a juxtaposition process to allow each of said manipulation commands to be performed by said user on said search results after such removal out of said table onto said platform means; and
,means for providing a mirror image of said media after said manipulation commands are performed as a preliminary view before an epic view projection of said media, wherein said epic view projection utilizes said navigation commands as part of a presentation program. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25)
-
-
26. A method in a computer for vocally assembling, manipulating, and displaying a plurality of media searched and retrieved from variable external data media sources based on preferences of said user, comprising the steps of:
-
categorizing a command converted from voice to text as a search command, manipulation command, or navigation command;
filtering said search command to allow access to each of said external data media sources based on said preferences of said user;
identifying each of said media relevant to said search command;
retrieving said media;
preparing said media for a vocal transformation based on said manipulation command on a platform having implemented therein a plurality of hooks on a grid whereon said media is hung having active corners and borders;
simulcasting said media on a local display;
outputting said media onto an epic display; and
assembling said media based on each of said navigation commands. - View Dependent Claims (27, 28, 29, 30, 31, 32, 33, 34)
organizing said media in a table created by a mini-processor; and
,moving said media from said table onto said platform using a hanging command, wherein said hanging command calculates a placement of said media onto each of said hooks.
-
-
33. The method of claim 26, wherein said vocal transformation from said manipulation command includes at least one of the following performed on said media by said user:
- moving said media up, down, left, or right;
placing a transparent foreground over a re-proportioned background;
clipping said media;
pausing said media;
opening up a dual screen; and
zooming in or out.
- moving said media up, down, left, or right;
-
34. The method of claim 26, wherein after the step of assembling said media, said user can record said outputting of said media.
Specification