Patentable/Patents/US-11395003
US-11395003

System and method for segmenting immersive video

PublishedJuly 19, 2022
Assigneenot available in USPTO data we have
Inventorsnot available in USPTO data we have
Technical Abstract

Aspects of the subject disclosure may include, for example, identifying a field of view (FoV) within a first frame of a video that occupies a sub-portion of the frame. A size is determined by subdividing a portion of the video according to a tiling scenario. Tiles overlapping the FoV are detected, and an updated size is determined based on an encoded version of the tiles and compared to a reference size. Responsive to the updated size being less than the reference size, the reference size is adjusted to the updated size, the tiling scenario is adjusted according to an adjusted tiling scenario, and the determining of the size is repeated based on the adjusted tiling scenario. Responsive to the updated size not being less than the reference size, a preferred tiling scenario is identified based on the adjusted tiling scenario for tiling the video. Other embodiments are disclosed.

Patent Claims
20 claims

Legal claims defining the scope of protection, as filed with the USPTO.

1

1. A device, comprising: a processing system including a processor; and a memory that stores executable instructions that, when executed by the processing system, facilitate performance of operations, the operations comprising: obtaining an adjustable playback parameter of a communication device that renders immersive video content; obtaining crowdsource statistics regarding viewing of the immersive video content by a group of users; generating a heat map based on the crowdsource statistics, wherein the heat map indicates a frequency range of fields of view of the immersive video content by the group of users; obtaining historical observations of past views of the immersive video content; determining a field of view based on the adjustable playback parameter, heat map, and the historical observations; determining an initial encoded data size value based on a size of an encoded version of a first group of tiles overlapping the field of view, the first group of tiles determined according to an initial tile size, a portion of an immersive video content item segmented according to an initial tile size; initializing a reference encoded data size variable to the initial encoded data size value; determining a tiling scenario, wherein the determining of the tiling scenario comprises: determining an updated encoded data size based on a size of an encoded version of an updated group of tiles overlapping the field of view, the updated group of tiles determined according to a reduced tile size; and comparing the updated encoded data size to the reference encoded data size variable to obtain a comparison result; responsive to the comparison result indicating that the updated encoded data size is less than the reference encoded data size variable, repeating the determining of the tiling scenario according to a further-reduced tile size; and responsive to the comparison result indicating that the updated encoded data size is not less than the reference encoded data size variable, identifying the tiling scenario according to the reduced tile size, further reduction of the reduced tile size increasing the updated encoded data size, wherein a portion of an immersive video content item is segmented according to the tiling scenario.

2

2. The device of claim 1 , wherein the field of view occupies a corresponding area within a first video frame of the immersive video content item.

3

3. The device of claim 1 , wherein the tiling scenario comprises a coarse tiling scenario by which a video frame of the immersive video content item segmented according to the tiling scenario has a first number of associated tiles.

4

4. The device of claim 3 , wherein the updated group of tiles comprises a second number of associated tiles greater the first number of associated tiles according to the tiling scenario.

5

5. The device of claim 4 , wherein a difference between the first number of associated tiles and the second number of associated tiles is not less than a factor of two.

6

6. The device of claim 1 , wherein the portion of the immersive video content item comprises a plurality of consecutive video frames, and wherein the plurality of consecutive video frames is segmented according to the tiling scenario.

7

7. The device of claim 1 , wherein the portion of the immersive video content item comprises all video frames of the immersive video content item.

8

8. The device of claim 1 , wherein the determining of the tiling scenario comprises encoding the updated group of tiles resulting in the encoded version of the updated group of tiles.

9

9. The device of claim 8 , wherein the encoding of the updated group of tiles comprises video encoding the updated group of tiles.

10

10. The device of claim 9 , wherein the video encoding comprises one of HEVC encoding, VP9 encoding, AV1 encoding or any combination thereof.

11

11. A non-transitory, machine-readable storage medium, comprising executable instructions that, when executed by a processing system including a processor, facilitate performance of operations, the operations comprising: obtaining crowdsource statistics regarding viewing of an immersive video content by a group of users; generating a heat map based on the crowdsource statistics, wherein the heat map indicates a frequency range of fields of view of the immersive video content by the group of users; evaluating the heat map based on prior views of the immersive video content item; obtaining an adjustable playback parameter of a communication device that renders immersive video content; obtaining historical observations of past views of the immersive video content; identifying a field of view based on the heat map, the adjustable playback parameter, and the historical observations; determining an initial encoded data size value based on a size of an encoded version of a first group of tiles overlapping the field of view, the first group of tiles determined according to an initial tile size, a portion of the immersive video content item segmented according to an initial tile size; initializing a reference encoded data size variable to the initial encoded data size value; determining a tiling scenario, wherein the determining of the tiling scenario comprises: determining an updated encoded data size based on a size of an encoded version of an updated group of tiles overlapping the field of view, the updated group of tiles determined according to a reduced tile size; and comparing the updated encoded data size to the reference encoded data size variable to obtain a comparison result; responsive to the comparison result indicating that the updated encoded data size is less than the reference encoded data size variable, repeating the determining of the tiling scenario according to a further-reduced tile size; and responsive to the comparison result indicating that the updated encoded data size is not less than the reference encoded data size variable, identifying the tiling scenario according to the reduced tile size, further reduction of the reduced tile size increasing the updated encoded data size, wherein the portion of the immersive video content item is segmented according to the tiling scenario.

12

12. The non-transitory, machine-readable storage medium of claim 11 , wherein the tiling scenario comprises a coarse tiling scenario by which a video frame of the immersive video content item segmented according to the tiling scenario has a first number of associated tiles.

13

13. The non-transitory, machine-readable storage medium of claim 12 , wherein the updated group of tiles comprises a second number of associated tiles greater the first number of associated tiles according to the tiling scenario.

14

14. The non-transitory, machine-readable storage medium of claim 11 , wherein the portion of the immersive video content item comprises a plurality of consecutive video frames, and wherein the plurality of consecutive video frames is segmented according to the tiling scenario.

15

15. The non-transitory, machine-readable storage medium of claim 11 , wherein the determining of the tiling scenario comprises encoding the updated group of tiles resulting in the encoded version of the updated group of tiles.

16

16. The non-transitory, machine-readable storage medium of claim 15 , wherein the encoding comprises one of HEVC encoding, VP9 encoding, AV1 encoding or any combination thereof.

17

17. A method, comprising: obtaining, by a processing system including a processor, crowdsource statistics regarding viewing of an immersive video content by a group of users; generating, by the processing system, a heat map based on the crowdsource statistics, wherein the heat map indicates a frequency range of fields of view of the immersive video content by the group of users; obtaining historical observations of past views of the immersive video content; identifying, by the processing system, a default path of the immersive video content item based on the heat map and the historical observations of past views of the immersive video content item; obtaining, by the processing system, an adjustable playback parameter of a communication device that renders immersive video content; determining, by the processing system, a field of view based on the default path and the adjustable playback parameter; determining, by the processing system, an initial encoded data size value based on a size of an encoded version of a first group of tiles overlapping the field of view, the first group of tiles determined according to an initial tile size, a portion of the immersive video content item segmented according to an initial tile size; initializing, by the processing system, a reference encoded data size variable to the initial encoded data size value; determining, by the processing system, a tiling scenario, wherein the determining of the tiling scenario comprises: determining, by the processing system, an updated encoded data size based on a size of an encoded version of an updated group of tiles overlapping the field of view, the updated group of tiles determined according to a reduced tile size; and comparing, by the processing system, the updated encoded data size to the reference encoded data size variable to obtain a comparison result; responsive to the comparison result indicating that the updated encoded data size is less than the reference encoded data size variable, repeating, by the processing system, the determining of the tiling scenario according to a further-reduced tile size; and responsive to the comparison result indicating that the updated encoded data size is not less than the reference encoded data size variable, identifying, by the processing system, the tiling scenario according to the reduced tile size, further reduction of the reduced tile size increasing the updated encoded data size, wherein a portion of the immersive video content item is segmented according to the tiling scenario.

18

18. The method of claim 17 , wherein the tiling scenario comprises a coarse tiling scenario by which a video frame of the immersive video content item segmented according to the tiling scenario has a first number of associated tiles.

19

19. The method of claim 17 , wherein the portion of the immersive video content item comprises a plurality of consecutive video frames, and wherein the plurality of consecutive video frames is segmented according to the tiling scenario.

20

20. The method of claim 17 , wherein the determining of the tiling scenario comprises encoding, by the processing system, the updated group of tiles resulting in the encoded version of the updated group of tiles.

Classification Codes (CPC)

Cooperative Patent Classification codes for this invention. Click any code to explore related patents in that topic.

Patent Metadata

Filing Date

September 14, 2020

Publication Date

July 19, 2022

Want to explore more patents?

Browse 5M+ US patents with plain-English claim translations and AI-generated analysis.

Citation & reuse

Analysis on this page is generated by Patentable — an AI-powered patent intelligence platform. AI-generated summaries, explanations, and analysis may be reused with attribution and a visible link back to the canonical URL below. Patent abstracts and claims are USPTO public domain.

Cite as: Patentable. “System and method for segmenting immersive video” (US-11395003). https://patentable.app/patents/US-11395003

© 2026 Patentable. All rights reserved.

Patentable is a research and drafting-assistant tool, not a law firm, and does not provide legal advice. Documents we generate are drafts for review by a licensed patent attorney.