Media presentation system controlled by voice to text commands

US 6,718,308 B1
Filed: 07/07/2000
Issued: 04/06/2004
Est. Priority Date: 02/22/2000
Status: Expired due to Fees

First Claim

Patent Images

1. A system utilizing a computer for enabling a user to vocally assemble, manipulate, and display a plurality of media searched and retrieved from variable external data media sources based on preferences of said user, comprising:

a voice recognition module working in conjunction with said computer for converting an inputted voice utterance into computer-readable text in the form of a plurality of search commands, manipulation commands, and navigation commands;

filtering means for taking said search commands after such conversion by said voice-recognition means and identifying one of said external data media sources for committing and retrieving said media;

juxtapositioning means for preparing said media after such retrieving by said filtering means for local display;

platform means whereon said media is vocally manipulated and organized based on each of said manipulation commands performed by said user after such preparing from said juxtapositioning means; and

, means for providing a mirror image of said media after such manipulation and organization as a preliminary view before an epic view projection of said media wherein said epic view projection utilizes each of said navigation commands as part of a presentation program.

View all claims

0 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A system and method for searching, assembling, and manipulating a variety of multi-media using voice converted to text commands. Digital images, movies, audio, or text is verbally searched and retrieved from a variety of video and audio databases using a combination of directional commands and a means for juxtaposing and assembling search results. The desired media is then placed onto a platform means for manipulating and editing the media files. Any retrieved media files and/or images can be manipulated and assembled on-screen using commands such as “zoom” or “move left” by having corners and borders read by the grid of the platform means. The image(s) are also capable of being stacked, or overlay one another to define re-proportioned backgrounds. The image(s) from the platform means are displayed without the grid using an image platter as a means of providing a preliminary view of the presentation prior to projection. The system allows for the hand-free assembly and editing of music and movies, and provides a means for verbally assembling pre-planned or impromptu presentations comprising video or audio clips, digital images, or text retrieved from multiple local and remote databases, such as a DVD movie-base or the World Wide Web.

167 Citations

34 Claims

1. A system utilizing a computer for enabling a user to vocally assemble, manipulate, and display a plurality of media searched and retrieved from variable external data media sources based on preferences of said user, comprising:
- a voice recognition module working in conjunction with said computer for converting an inputted voice utterance into computer-readable text in the form of a plurality of search commands, manipulation commands, and navigation commands;
  
  filtering means for taking said search commands after such conversion by said voice-recognition means and identifying one of said external data media sources for committing and retrieving said media;
  
  juxtapositioning means for preparing said media after such retrieving by said filtering means for local display;
  
  platform means whereon said media is vocally manipulated and organized based on each of said manipulation commands performed by said user after such preparing from said juxtapositioning means; and
  
  , means for providing a mirror image of said media after such manipulation and organization as a preliminary view before an epic view projection of said media wherein said epic view projection utilizes each of said navigation commands as part of a presentation program.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
- - 2. The system of claim 1, wherein said media includes text, audio, still images, movies, and on-line resources.
  - 3. The system of claim 2, wherein said text is closed captioned text from each of said movies.
  - 4. The system of claim 2, wherein said text is text from scanned music lyric sheets.
  - 5. The system of claim 1, wherein said external data media sources include a plurality of both local and remote databases.
  - 6. The system of claim 5, wherein said local database is a DVD tower.
  - 7. The system of claim 5, wherein said local database is a CD tower.
  - 8. The system of claim 5, wherein said local database is a jukebox tower.
  - 9. The system of claim 5, wherein said remote database is a virtual movie provider.
  - 10. The system of claim 5, wherein said remote database is a music provider.
  - 11. The system of claim 10, wherein said music provider is an MP3 archive.
  - 12. The system of claim 1, wherein said platform means further comprises hooks dispersed around a grid whereon said media is hung by having corners and borders of said media read by said hooks and said grid.

13. A system utilizing a computer for enabling a user to vocally assemble and display a plurality of media searched and retrieved from variable external data media sources and manipulated based on preferences of said user, comprising:
- a voice recognition module working in conjunction with said computer for converting an inputted voice utterance into computer-readable text in the form of a plurality of search commands, manipulation commands, and navigation commands;
  
  filtering means for taking each of said search commands after such conversion by said voice-recognition means and identifying one of said external data media sources for committing and retrieving said media in the form of search results, said filtering means further comprising a first directional means for activating each of a plane search, a media capture, and a second directional means;
  
  a results file having a results counter;
  
  means for providing that said results file calculate an amount of said search results produced by said search command performed by said user for said media;
  
  means for creating a table for said search results;
  
  platform means for allowing said user to see a layout of said media after said search results are retrieved from said external data media sources;
  
  means for moving each of said search results independently out of said table onto said platform means;
  
  means for activating a juxtaposition process to allow each of said manipulation commands to be performed by said user on said search results after such removal out of said table onto said platform means; and
  
  , means for providing a mirror image of said media after said manipulation commands are performed as a preliminary view before an epic view projection of said media, wherein said epic view projection utilizes said navigation commands as part of a presentation program.
- View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25)
- - 14. The system of claim 13, wherein said plane search identifies how close each of said media are relative to each of said search commands based on word proximity and image size inputted by said user.
  - 15. The system of claim 13, wherein said media capture is a downloader for each of said result files.
  - 16. The system of claim 13, wherein said second directional means prepares and activates a portal display window stream for a display of text with images.
  - 17. The system of claim 16, wherein said display of text with images is a web page.
  - 18. The system of claim 13, wherein said second directional means prepares and activates a feature-length link-up stream for a display of long-play audio and video.
  - 19. The system of claim 18, wherein said long-play audio and video is a soundtrack and movie.
  - 20. The system of claim 13, wherein said second directional means prepares and activates a latching means for performing a strip-down of extraneous data sent with each of said search results to said results file.
  - 21. The system of claim 13, wherein one of said manipulation commands is a shrouder command wherein a background image retrieved as said media based on said preferences is copied and re-sized to fit approximate dimensions of a retrieved foreground image.
  - 22. The system of claim 13, wherein one of said manipulation commands is a basic movement command.
  - 23. The system of claim 13, wherein one of said manipulation commands allows said user to clip videos.
  - 24. The system of claim 13, wherein one of said manipulation commands is a screen splitter command.
  - 25. The system of claim 13, wherein one of said manipulation commands is a zoom command.

26. A method in a computer for vocally assembling, manipulating, and displaying a plurality of media searched and retrieved from variable external data media sources based on preferences of said user, comprising the steps of:
- categorizing a command converted from voice to text as a search command, manipulation command, or navigation command;
  
  filtering said search command to allow access to each of said external data media sources based on said preferences of said user;
  
  identifying each of said media relevant to said search command;
  
  retrieving said media;
  
  preparing said media for a vocal transformation based on said manipulation command on a platform having implemented therein a plurality of hooks on a grid whereon said media is hung having active corners and borders;
  
  simulcasting said media on a local display;
  
  outputting said media onto an epic display; and
  
  assembling said media based on each of said navigation commands.
- View Dependent Claims (27, 28, 29, 30, 31, 32, 33, 34)
- - 27. The method of claim 26, wherein for the step of filtering said search command, multiple directionals are simultaneously performing a series of pre-set scripts for a desired characteristic of said media.
  - 28. The method of claim 26, wherein after the step of retrieving said media, said media is stored in a results file.
  - 29. The method of claim 28, wherein said results file is accessed by a latcher command, said latcher command strips down any extraneous data included within or around said media based on said preferences of said user.
  - 30. The method of claim 29, wherein said extraneous data is an advertisement.
  - 31. The method of claim 29, wherein said extraneous data is text.
  - 32. The method of claim 26, wherein the step of preparing said media further comprises the steps of:
33. The method of claim 26, wherein said vocal transformation from said manipulation command includes at least one of the following performed on said media by said user:
- moving said media up, down, left, or right;
  
  placing a transparent foreground over a re-proportioned background;
  
  clipping said media;
  
  pausing said media;
  
  opening up a dual screen; and
  
  zooming in or out.
34. The method of claim 26, wherein after the step of assembling said media, said user can record said outputting of said media.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Daniel L. Nolting
Original Assignee
Daniel L. Nolting
Inventors
Nolting, Daniel L.
Primary Examiner(s)
Chawan, Vijay
Assistant Examiner(s)
Storm, Donald L.

Application Number

US09/611,782
Time in Patent Office

1,369 Days
Field of Search

704/275, 704/270, 704/270.1, 345/731, 345/728
US Class Current

704/275
CPC Class Codes

G10L 15/26 Speech to text systems G10L...

Media presentation system controlled by voice to text commands

First Claim

0 Assignments

0 Petitions

Accused Products

Abstract

167 Citations

34 Claims

Specification

Solutions

Use Cases

Quick Links

Media presentation system controlled by voice to text commands

First Claim

0 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

167 Citations

34 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links