PROCESS FOR INCREASING THE QUALITY OF EXPERIENCE FOR USERS THAT WATCH ON THEIR TERMINALS A HIGH DEFINITION VIDEO STREAM
First Claim
1. Process for increasing the Quality of Experience for users that watch on their terminals a high definition video stream captured by at least one video capturing device and provided by a server to which said users are connected through their terminals in a network, said process providing for:
- collecting, for each user of a sample of the whole audience of said video stream, at least information about the position of the gaze of said user on said video stream;
aggregating all of said collected information and analysing said aggregated information to identify the main regions of interest for said video stream according to the number of users'"'"' gazes positioned on said regions of interest;
selecting at least a region of interest of said video stream to be displayed on some terminals of said users;
said process wherein the video stream comprises several synchronised video views, the main regions of interest of said video stream being identified from processing said video views, said process providing for creating for each video view a 2D saliency map localising the regions of interests of said video view, and thus for creating from all of said 2D saliency maps a global 3D saliency map so as to identify the main regions of interest of the video stream.
8 Assignments
0 Petitions
Accused Products
Abstract
Process for increasing the Quality of Experience for users that watch on their terminals (1) a high definition video stream (2, I, V) captured by at least one video capturing device (3) and provided by a server (4) to which said users are connected through their terminals (1) in a network, said process providing for: —collecting, for each user of a sample of the whole audience of said video stream, at least information about the position of the gaze of said user on said video stream; —aggregating all of said collected information and analysing said aggregated information to identify the main regions of interest (R1, R2, R3, R4) for said video stream according to the number of users'"'"' gazes positioned on said regions of interest; —selecting at least a region of interest (R1, R2, R3) of said video stream to be displayed on some terminals (1) of said users.
-
Citations
16 Claims
-
1. Process for increasing the Quality of Experience for users that watch on their terminals a high definition video stream captured by at least one video capturing device and provided by a server to which said users are connected through their terminals in a network, said process providing for:
-
collecting, for each user of a sample of the whole audience of said video stream, at least information about the position of the gaze of said user on said video stream; aggregating all of said collected information and analysing said aggregated information to identify the main regions of interest for said video stream according to the number of users'"'"' gazes positioned on said regions of interest; selecting at least a region of interest of said video stream to be displayed on some terminals of said users; said process wherein the video stream comprises several synchronised video views, the main regions of interest of said video stream being identified from processing said video views, said process providing for creating for each video view a 2D saliency map localising the regions of interests of said video view, and thus for creating from all of said 2D saliency maps a global 3D saliency map so as to identify the main regions of interest of the video stream. - View Dependent Claims (3, 4, 6, 7, 16)
-
-
2. (canceled)
-
5. (canceled)
-
8. Engine for increasing the Quality of Experience for users that watch on their terminals a high definition video stream captured by at least one video capturing device and provided by a server to which said users are connected through their terminals in a network, said engine comprising:
-
at least a collector module for collecting, for each user of at least a sample of the whole audience of said video stream, at least information about the position of the gaze of said user on said video stream; at least an estimator module that comprises means for aggregating all of said collected information and means for analysing said aggregated information to identify the main regions of interest for said video stream according to the number of users'"'"' gazes positioned on said regions of interest; at least a selector module adapted for selecting at least a region of interest and for interacting with said server so that said selected region of interest will be displayed on some terminals of said users; said engine wherein the video stream comprises several synchronised video views and is provided from several video capturing devices, the collector module or the estimator module comprising means for creating for each video view a 2D saliency map localising the regions of interests of said video view, and the estimator module comprising means for creating from all of said 2D saliency maps a global 3D saliency map so as to identify the main regions of interest of the video stream. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. Architecture for a network for providing to users connected through their terminals a high definition video stream to be watched by said users on said terminals, said video stream being captured by at least one video capturing device, said architecture comprising:
-
an engine for increasing the Quality of Experience for users, comprising; at least a collector module for collecting, for each user of at least a sample of the whole audience of said video stream, at least information about the position of the gaze of said user on said video stream; at least an estimator module that comprises means for aggregating all of said collected information and means for analysing said aggregated information to identify the main regions of interest for said video stream according to the number of users'"'"' gazes positioned on said regions of interest; a selector module adapted for selecting at least a region of interest to be displayed on some terminals of said users; a server to which users are connected through their terminals, said server providing said high definition video stream to said users, said server further comprising; a focus module comprising means for interacting with the selector module of said engine to build at least one ROI video stream comprising a region of interest selected by said selector module; a streamer module comprising means for providing the ROI video stream to some of said users; said architecture wherein the video stream comprises several synchronised video views and is provided from several video capturing devices, the collector module or the estimator module comprising means for creating for each video view a 2D saliency map localising the regions of interests of said video view, and the estimator module comprising means for creating from all of said 2D saliency maps a global 3D saliency map so as to identify the main regions of interest of the video stream.
-
Specification