Patentable/Patents/US-11062738
US-11062738

Signalling of video content including sub-picture bitstreams for video coding

PublishedJuly 13, 2021
Assigneenot available in USPTO data we have
Inventorsnot available in USPTO data we have
Technical Abstract

In various implementations, modifications and/or additions to the ISOBMFF are provided to process video data. A plurality of sub-picture bitstreams are obtained from memory, each sub-picture bitstream including a spatial portion of the video data and each sub-picture bitstream being independently coded. In at least one file, the plurality of sub-picture bitstreams are respectively stored as a plurality of sub-picture tracks. Metadata describing the plurality of sub-picture tracks is stored in a track box within a media file in accordance with a file format. A sub-picture base track is provided that includes the metadata describing the plurality of sub-picture tracks.

Patent Claims
35 claims

Legal claims defining the scope of protection, as filed with the USPTO.

1

1. A method of processing omnidirectional video data, comprising: obtaining, from memory, a plurality of sub-picture bitstreams associated with an omnidirectional scene, each sub-picture bitstream including video data specifying a respective subset of spatial area of an entire spatial area of the omnidirectional scene, where each respective subset of spatial area is less than the entire spatial area of the omnidirectional scene and each sub-picture bitstream is independently coded; respectively storing, in at least one file, the plurality of sub-picture bitstreams associated with the omnidirectional scene as a plurality of sub-picture tracks associated with the omnidirectional scene; and setting, in a track box within a media file in accordance with a file format, a track group type of a sub-picture track among the plurality of sub-picture tracks to a value that indicates that the sub-picture track belongs to a group of sub-picture tracks that carries the omnidirectional scene, the group of sub-picture tracks being the plurality of sub-picture tracks.

2

2. The method of claim 1 , wherein the media file includes the at least one file.

3

3. The method of claim 1 , further comprising: storing a sub-picture base track associated with the plurality of sub-picture tracks.

4

4. The method of claim 3 , further comprising: indicating the sub-picture base track based on a sample entry type within a sample description box of the track box.

5

5. The method of claim 3 , wherein the sub-picture base track includes metadata describing the plurality of sub-picture tracks.

6

6. The method of claim 1 , further comprising including metadata that comprises a spatial resolution of a full picture associated with a set of the plurality of sub-picture tracks.

7

7. The method of claim 1 , wherein the file format is based on an International Standards Organization (ISO) base media file format.

8

8. The method of claim 1 , wherein the track group type is a backward compatible extension to the TrackGroupTypeBox set forth in clause 8.3.4 of the ISO base media file format identified by ISO/IEC 14496-12, Part 12.

9

9. The method of claim 1 , further comprising: storing, within the track box, a track group type box designating the track group type.

10

10. The method of claim 9 , wherein the track group type indicates that additional information is available to be added to the track box.

11

11. The method of claim 10 , wherein when the track group type indicates that additional information is available to be added to the track box, storing, within the track box, sub-picture row index information and sub-picture column index information of sub-pictures in the track.

12

12. The method of claim 10 , wherein sub-pictures encoded within the sub-picture bitstream are defined in sub-picture rows and sub-picture columns.

13

13. The method of claim 10 , wherein sub-pictures encoded within the sub-picture bitstream are groupable as tiles such that the tiles are defined tile rows and tile columns.

14

14. The method of claim 10 , wherein when the track group type indicates that additional information is available to be added to the track box, storing, within the track box, coordinates corresponding to an upper-left corner pixel of sub-picture in the track.

15

15. The method of claim 10 , wherein when the track group type indicates that additional information is available to be added to the track box, storing, within the track box, information representing a size of a full picture.

16

16. A device for decoding omnidirectional video data, the device comprising: memory configured to store at least one file and a media file, the media file being formatted in accordance with a file format and the at least one file including a plurality of sub-picture tracks associated with an omnidirectional scene, each sub-picture track comprising a respective sub-picture bitstream that includes video data specifying a respective subset of spatial area of an entire spatial area of the omnidirectional scene where each respective subset of spatial area is less than the entire spatial area of the omnidirectional scene and each sub-picture bitstream is independently coded; and a processor, in communication with the memory, the processing being configured to: process the media file; and determine, from a track box within the media file and for a sub-picture track among the plurality of sub-picture tracks associated with the omnidirectional scene, that the sub-picture track belongs to a group of sub-picture tracks that carries the omnidirectional scene based on a value of a track group type of the sub-picture track, the group of sub-picture tracks being the plurality of sub-picture tracks included within the at least one file.

17

17. The device of claim 16 , wherein the media file includes the at least one file.

18

18. The device of claim 16 , wherein the processor is further configured to obtain, from the media file, a sub-picture base track associated with the plurality of sub-picture tracks.

19

19. The device of claim 18 , wherein the processor is further configured to determine, based on a sample entry type within a sample description box of the track box, a presence of the sub-picture base track.

20

20. The device of claim 18 , wherein the processor is further configured to obtain, from the sub-picture base track, metadata describing the plurality of sub-picture tracks.

21

21. The device of claim 16 , wherein the processor is further configured to process metadata comprising a spatial resolution of a full picture associated with a set of the plurality of sub-picture tracks.

22

22. The device of claim 16 , wherein the file format is based on an International Standards Organization (ISO) base media file format.

23

23. The device of claim 16 , wherein the processor is further configured to obtain, from a track group type box of the track box, sub-picture row index information and sub-picture column index information of sub-pictures in the track.

24

24. A device for processing omnidirectional video data, the device comprising: memory configured to store a plurality of sub-picture bitstreams associated with an omnidirectional scene, each sub-picture bitstream including video data specifying a respective subset of spatial area of an entire spatial area of the omnidirectional scene, where each respective subset of spatial area is less than the entire spatial area of the omnidirectional scene and each sub-picture bitstream is independently coded; and a processor in communication with the memory the processor being configured to: obtain, from the memory, the plurality of sub-picture bitstreams associated with the omnidirectional scene; respectively store, in at least one file, the plurality of sub-picture bitstreams as a plurality of sub-picture tracks associated with the omnidirectional scene; and set, in a track box within a media file in accordance with a file format, a track group type of a sub-picture track among the plurality of sub-picture tracks to a value that indicates that the sub-picture track belongs to a group of sub-picture tracks that carries the omnidirectional scene, the group of sub-picture tracks being the plurality of sub-picture tracks.

25

25. The device of claim 24 , wherein the media file includes the at least one file.

26

26. The device of claim 24 , wherein the processor is further configured to store a sub-picture base track associated with the plurality of sub-picture tracks.

27

27. The device of claim 26 , wherein the processor is further configured to set a value of a sample entry type within a sample description box of the track box, the value indicating a presence of the sub-picture base track.

28

28. The device of claim 26 , wherein the sub-picture base track includes metadata describing the plurality of sub-picture tracks.

29

29. The device of claim 24 , wherein the processor is further configured to include metadata that comprises a spatial resolution of a full picture associated with a set of the plurality of sub-picture tracks.

30

30. The device of claim 24 , wherein the file format is based on an International Standards Organization (ISO) base media file format.

31

31. The device of claim 26 , wherein the processor is further configured to store, within a track group type box of the track box, sub-picture row index information and sub-picture column index information of sub-pictures in the track.

32

32. The device of claim 24 , wherein the device is a wireless communication device, further comprising: a transmitter configured to transmit the plurality of sub-picture bitstreams.

33

33. The device of claim 32 , wherein the wireless communication device is a cellular telephone and the plurality of sub-picture bitstreams are further modulated for transmission according to a cellular communication standard.

34

34. A device comprising: means for obtaining, from memory, a plurality of sub-picture bitstreams associated with an omnidirectional scene, each sub-picture bitstream including video data specifying a respective subset of spatial area of an entire spatial area of the omnidirectional scene, where each respective subset of spatial area is less than the entire spatial area of the omnidirectional scene and each sub-picture bitstream is independently coded; means for respectively storing, in at least one file, the plurality of sub-picture bitstreams associated with the omnidirectional scene as a plurality of sub-picture tracks associated with the omnidirectional scene; and means for setting, in a track box within a media file in accordance with a file format, a track group type of a sub-picture track among the plurality of sub-picture tracks to a value that indicates that the sub-picture track belongs to a group of sub-picture tracks that carries the omnidirectional scene, the group of sub-picture tracks being the plurality of sub-picture tracks.

35

35. A non-transitory computer-readable medium having instructions stored thereon that, when executed, cause one or more processors of an electronic device to: obtain, from memory, a plurality of sub-picture bitstreams associated with an omnidirectional scene, each sub-picture bitstream including video data specifying a respective subset of spatial area of an entire spatial area of the omnidirectional scene, where each respective subset of spatial area is less than the entire spatial area of the omnidirectional scene and each sub-picture bitstream is independently coded; respectively store, in at least one file, the plurality of sub-picture bitstreams associated with the omnidirectional scene as a plurality of sub-picture tracks associated with the omnidirectional scene; and set, in a track box within a media file in accordance with a file format, a track group type of a sub-picture track among the plurality of sub-picture tracks to a value that indicates that the sub-picture track belongs to a group of sub-picture tracks that carries the omnidirectional scene, the group of sub-picture tracks being the plurality of sub-picture tracks.

Classification Codes (CPC)

Cooperative Patent Classification codes for this invention. Click any code to explore related patents in that topic.

Patent Metadata

Filing Date

March 21, 2018

Publication Date

July 13, 2021

Want to explore more patents?

Browse 5M+ US patents with plain-English claim translations and AI-generated analysis.

Citation & reuse

Analysis on this page is generated by Patentable — an AI-powered patent intelligence platform. AI-generated summaries, explanations, and analysis may be reused with attribution and a visible link back to the canonical URL below. Patent abstracts and claims are USPTO public domain.

Cite as: Patentable. “Signalling of video content including sub-picture bitstreams for video coding” (US-11062738). https://patentable.app/patents/US-11062738

© 2026 Patentable. All rights reserved.

Patentable is a research and drafting-assistant tool, not a law firm, and does not provide legal advice. Documents we generate are drafts for review by a licensed patent attorney.

Signalling of video content including sub-picture bitstreams for video coding — Ye-kui Wang | Patentable