Patentable/Patents/US-9271048
US-9271048

Systems and methods for immersive viewing experience

PublishedFebruary 23, 2016
Assigneenot available in USPTO data we have
Inventorsnot available in USPTO data we have
Technical Abstract

Described herein are methods and systems that may help to provide selectable viewing options for a television program. An exemplary method involves: (i) receiving a television video transport stream comprising video content associated with a particular television program, wherein the television video transport stream comprises focal-point metadata regarding at least one focus point, wherein the at least one focus point corresponds to a sub-frame within at least one frame of the video content, (ii) receiving focal-point input data indicating a zoom request, (iii) processing video content in response to the focal-point input data, and (iv) generating a television video output signal comprising video content that is zoomed to the sub-frame, wherein the television video output signal is configured to be displayable on a graphic display.

Patent Claims
25 claims

Legal claims defining the scope of protection, as filed with the USPTO.

1

1. A method, comprising: receiving a television video transport stream comprising video content associated with a particular television program, wherein the television video transport stream comprises focal-point metadata indicating at least one dynamic focus point for a zoom function in the video content, wherein the at least one dynamic focus point corresponds to a first sub-frame within a first frame of the video content and a second sub-frame within a second frame of the video content that is subsequent to the first frame; receiving focal-point input data indicating a zoom request for a particular dynamic focus point; and in response to receiving the focal-point input data: processing the video content, based on the focal-point metadata and the movement metadata, to generate a television video output signal, wherein the movement metadata comprises a motion vector indicating movement of the at least one focus point between the first frame and the second frame, and wherein processing the video content comprises: (a) generating, based on a comparison of the first sub-frame to the second sub-frame, the motion vector indicating movement of the at least one focus point between the first frame and the second frame, wherein the motion vector comprises both a directional component and a magnitude component; and (b) determining, based on the motion vector that indicates movement of the at least one dynamic focus point between the first frame and the second frame, a third subframe that corresponds to an estimated location of the dynamic focus point in a third frame of the video content that is subsequent to the second frame; and outputting the television video output signal to a graphic display, wherein the television video output signal comprises video content that is zoomed to the particular dynamic focus point in the first, second, and third frames of the video content.

2

2. The method of claim 1 , wherein the focal-point input data indicating the zoom request is received via a graphical user interface that facilitates a selection of the particular dynamic focus point.

3

3. The method of claim 1 , wherein at least a portion of the television video transport stream is configured to be displayed at an Ultra HD resolution.

4

4. The method of claim 1 , wherein the focal-point metadata is provided in separate packets of the standard television video transport stream.

5

5. The method of claim 4 , wherein the separate packets are provided via an advanced program guide or through MPEG-2 private section packets.

6

6. The method of claim 1 , wherein the focal-point metadata is provided in a packet header section of a packet.

7

7. The method of claim 1 , wherein a display format of the television video output signal is a picture-in-picture arrangement, a split-screen arrangement, or a full screen display.

8

8. The method of claim 1 , further comprising: receiving video content associated with a plurality of different camera views of the particular television program, wherein the video data from at least one of the plurality of different camera views comprises focal-point metadata regarding the at least one dynamic focus point; receiving camera selection input data indicating a camera selection request; processing the video content in response to the camera selection input data; and generating a television video output signal comprising video content that is associated with one of the camera selection input data, the focal-point input data, and the camera selection input data and the focal-point input data.

9

9. The method of claim 8 , wherein the one of the camera selection input data, the focal-point input data, and the camera selection input data and the focal-point input data is obtained by way of a graphical user interface that facilitates a selection of one of the camera selection input data, the focal-point input data, or the camera selection input data and the focal-point input data.

10

10. The method of claim 8 , wherein the focal-point metadata further comprises a focal-point type.

11

11. The method of claim 10 , wherein the focal-point input data is associated with the focal-point type.

12

12. The method of claim 11 , further comprising: receiving focal-point type input data indicating a focal-point type request, wherein the focal-point type input data is obtained by way of a graphical user interface that facilitates a selection of the focal-point type, wherein the input data indicating the zoom request is obtained by way of a graphical user interface that facilitates a selection of the at least one focus point; processing the video content in response to the focal-point type input data; and generating a television video output signal comprising video content that is associated with the focal-point type input data, wherein the television video output signal is configured to be displayable on a graphic display.

13

13. An apparatus, comprising: a receiver configured to: receive a video transport stream comprising video content associated with a particular television program, wherein the television video transport stream comprises focal-point metadata indicating at least one dynamic focus point for a zoom function in the video content, wherein the at least one dynamic focus point corresponds to a first sub-frame within a first frame of the video content and a second sub-frame within a second frame of the video content that is subsequent to the first frame; receive focal-point input data indicating a zoom request for a particular dynamic focus point; and in response to receipt of the focal-point input data: process the video content, based on the focal-point metadata and the movement metadata, to generate a television video output signal, wherein the movement metadata comprises a motion vector indicating movement of the at least one focus point between the first frame and the second frame, and wherein processing the video content comprises: (a) generating, based on a comparison of the first sub-frame to the second sub-frame, the motion vector indicating movement of the at least one focus point between the first frame and the second frame, wherein the motion vector comprises both a directional component and a magnitude component; and (b) determining, based on the motion vector that indicates movement of the at least one dynamic focus point between the first frame and the second frame, a third subframe that corresponds to an estimated location of the dynamic focus point in a third frame of the video content that is subsequent to the second frame; and output the television video output signal to a graphic display, wherein the television video output signal comprises video content that is zoomed to the particular dynamic focus point in at least the first, second, and third frames of the video content.

14

14. A method, comprising receiving a plurality of television video transport streams comprising video content for a particular television program, wherein the plurality of television video transport streams comprises video content associated with a plurality of different camera views of the particular television program, wherein one or more of the plurality of television video transport streams further comprises focal-point metadata indicating at least one dynamic focus point for a zoom function in the video content, wherein the at least one dynamic focus point corresponds to a first sub-frame a first frame of the video content and a second sub-frame within a second frame of the video content that is subsequent to the first frame; identifying the plurality of different camera views; receiving camera selection input data indicating a camera selection request; processing the video content, based on the focal-point metadata, the movement metadata, and the camera selection input data, to generate a television video output signal, wherein the movement metadata comprises a motion vector indicating movement of the at least one focus point between the first frame and the second frame, and wherein processing the video content comprises: (a) generating, based on a comparison of the first sub-frame to the second sub-frame, the motion vector indicating movement of the at least one focus point between the first frame and the second frame, wherein the motion vector comprises both a directional component and a magnitude component; and (b) determining, based on the motion vector that indicates movement of the at least one dynamic focus point between the first frame and the second frame, a third subframe that corresponds to an estimated location of the dynamic focus point in a third frame of the video content that is subsequent to the second frame; and outputting the television video output signal to a graphic display, wherein the television video output signal comprises video content that is: (a) zoomed to the particular dynamic focus point in the first, second, and third frames of the video content, and (b) associated with the camera selection request.

15

15. A method, comprising: receiving streaming data comprising video content associated with at least one live stream for a particular television program; generating focal-point metadata indicating at least one dynamic focus point for application of a zoom function in the video content, wherein the at least one dynamic focus point corresponds to a first sub-frame within a first frame of the video content and a second sub-frame within a second frame of the video content that is subsequent to the first frame; generating, based at least in part on a comparison of the first sub-frame to the second sub-frame, movement metadata, wherein the generated movement metadata comprises a motion vector indicating movement of the at least one focus point between the first frame and the second frame, and wherein the motion vector comprises both a directional component and a magnitude component; generating a television video transport stream comprising: (a) the video content, (b) the focal-point metadata, and (c) the movement metadata; and transmitting the television video transport stream including the video content and the focal-point metadata indicating the at least one dynamic focus point, by way of a single television channel, so as to facilitate a receiver function to: (i) process the video content, based on the focal-point metadata and the movement metadata, and generate a television video output signal based on the motion vector, wherein the television video output signal comprises a third subframe that corresponds to an estimated location of the dynamic focus point in a third frame of the video content that is subsequent to the second frame, and (ii) output the television video output signal, to a graphic display, wherein the outputted television video output signal comprises video content that is zoomed to the particular dynamic focus point in the first, second, and third frames of the video content.

16

16. The method of claim 15 , wherein generating focal-point metadata further includes defining a first pair of coordinates as opposing corners of a first box, wherein the first pair of coordinates represents a first sub-frame within a first frame of the video content and defining a second pair of coordinates as opposing corners of a second box, wherein the second pair of coordinates represents a different sub-frame within a second frame of the video content.

17

17. The method of claim 15 , further comprising: generating vector metadata indicating the motion vector, wherein the motion vector is determined by comparing a current focus point sub-frame to a previous focus point sub-frame to generate direction data regarding a direction of movement and magnitude data regarding a magnitude of movement.

18

18. The method of claim 15 , further comprising: generating identification metadata indicating identification of the at least one live stream; and wherein generating a television video transport stream further includes the identification metadata.

19

19. The method of claim 15 , wherein the television video transport stream is an Ultra HD video transport stream.

20

20. The method of claim 15 , wherein generating a television video transport stream further comprises including the metadata in separate packets.

21

21. The method of claim 20 , wherein the separate packets including the metadata are included in an advanced program guide or are included in an MPEG-2 private section.

22

22. The method of claim 15 , wherein generating the television video transport stream further comprises including the metadata within the packet header section of a packet.

23

23. A broadcast system, comprising: a receiver configured to: receive streaming data comprising video content associated with at least one live stream for a particular television program; and a signal-generation system configured to: receive focal-point metadata that indicates at least one dynamic focus point for application a zoom function in the video content, wherein the at least one dynamic focus point corresponds to a first sub-frame within a first frame of the video content and a second sub-frame within a second frame of the video content that is subsequent to the first frame, and wherein a motion vector indicates movement of the at least one dynamic focus point between the first frame and the second frame; generate, based at least in part on a comparison of the first sub-frame to the second sub-frame, movement metadata, wherein the generated movement metadata comprises a motion vector indicating movement of the at least one focus point between the first frame and the second frame, and wherein the motion vector comprises both a directional component and a magnitude component; generate a television video transport stream that comprises: (a) the video content, (b) the focal-point metadata, and (c) the movement metadata, wherein generating the video content comprises determining, based on the motion vector that indicates movement of the at least one dynamic focus point between the first frame and the second frame, a third subframe that corresponds to an estimated location of the dynamic focus point in a third frame of the video content that is subsequent to the second frame, transmit the television video transport stream including the video content, the focal-point metadata, and the movement metadata, by way of a single television channel, so as to facilitate a receiver function to process the video content to output a television video output signal, to a graphic display, that comprises video content that is zoomed to the particular dynamic focus point in the first, second, and third frames of the video content.

24

24. The broadcast system of claim 23 , wherein the receiver is configured to receive the focal-point metadata from the streaming data and send the focal-point metadata to the signal-generation system.

25

25. The broadcast system of claim 23 , wherein, in order to receive the focal-point metadata, the signal-generation system is configured to generate the focal-point metadata.

Classification Codes (CPC)

Cooperative Patent Classification codes for this invention. Click any code to explore related patents in that topic.

Patent Metadata

Filing Date

December 13, 2013

Publication Date

February 23, 2016

Want to explore more patents?

Browse 5M+ US patents with plain-English claim translations and AI-generated analysis.

Citation & reuse

Analysis on this page is generated by Patentable — an AI-powered patent intelligence platform. AI-generated summaries, explanations, and analysis may be reused with attribution and a visible link back to the canonical URL below. Patent abstracts and claims are USPTO public domain.

Cite as: Patentable. “Systems and methods for immersive viewing experience” (US-9271048). https://patentable.app/patents/US-9271048

© 2026 Patentable. All rights reserved.

Patentable is a research and drafting-assistant tool, not a law firm, and does not provide legal advice. Documents we generate are drafts for review by a licensed patent attorney.