Methods and Devices for Extended Reality Device Training Data Creation

PublishedApril 21, 2020

Assigneenot available in USPTO data we have

InventorsHiu Lok SZETO Syed Alimul HUDA Kiever Xiang CHEN

Technical Abstract

Patent Claims

20 claims

Legal claims defining the scope of protection, as filed with the USPTO.

1. A method for one or more processors to implement in an extended reality (XR) device including a camera, the one or more processors, at least one memory, and a display, the method comprising: acquiring, from the camera, a camera data sequence including a first image frame of a real object in a scene; tracking a pose of the real object with respect to the camera along the camera data sequence, the pose being derived based at least on the first image frame and original training data generated from at least one of (i) a synthetic image of a 3D model rendered from a predetermined view and (ii) a camera image of a reference real object captured from the view, where the 3D model and the reference real object correspond to the real object; displaying an XR object on the display by rendering the XR object based at least on the pose; setting flag data, in a memory area of the at least one memory, indicative of whether or not the displayed XR object is consistent in pose with the real object, in response to receipt of an input of a user of the XR device; storing, in another memory area of the at least one memory, second image frames in the camera data sequence acquired when the flag data indicates that the displayed XR object is consistent in pose with the real object; and outputting the stored second image frames to a separate computing device having another processor.

2. The method according to claim 1 , further comprising receiving, after outputting the stored second image frames, another training data to replace or update the original training data, the another training data being based at least in part on the output second image frames.

3. The method according to claim 1 , further comprising receiving an input from the user of the XR device through a user interface indicating that the displayed XR object is consistent in pose with the real object.

4. The method according to claim 1 , further comprising not storing the second image frames in the camera data sequence acquired when the flag data indicates that the displayed XR object is not consistent in pose with the real object.

5. The method according to claim 1 , wherein tracking a pose of the real object with respect to the camera along the camera data sequence comprises deriving the pose based on the first image frame and training data only generated from a synthetic image of the 3D model rendered from the predetermined view.

6. The method according to claim 1 , wherein the original training data includes only shape-based training data.

7. A method for one or more processors to implement in a computing device including the one or more processors and a memory storing original training data for tracking a real object using an extended reality (XR) device, the method comprising: receive image frames of the real object acquired by the XR device, the images being acquired when flag data indicated that an XR object displayed on the XR device was consistent in pose with the real object; extracting feature data of the real object from the image frames, the feature data including the tracked pose of the real object in the respective image frames; and generating another training data to replace or update the original training data, the another training data based at least in part on the extracted feature data.

8. The method according to claim 7 , further comprising outputting the another training data to the XR device.

9. The method according to claim 7 , wherein the original training data included only shape-based training data.

10. A method for one or more processors to implement in an extended reality (XR) device including a camera, the one or more processors, at least one memory, and a display, the method comprising: acquiring, from the camera, a camera data sequence including a first image frame of a real object in a scene; tracking a pose of the real object with respect to the camera along the camera data sequence, the pose being derived based at least on the first image frame and original training data generated from at least one of (i) a synthetic image of a 3D model rendered from a predetermined view and (ii) a camera image of a reference real object captured from the view, where the 3D model and the reference real object correspond to the real object; displaying an XR object on the display by rendering the XR object based at least on the pose; setting flag data, in a memory area of the at least one memory, indicative of whether or not the displayed XR object is consistent in pose with the real object, in response to receipt of an input of a user of the XR device; outputting, to a separate computing device having another processor, second image frames in the camera data sequence acquired when the flag data indicates that the displayed XR object is consistent in pose with the real object.

11. The method according to claim 10 , further comprising receiving, after outputting the second image frames, another training data to replace or update the original training data, the updated training data being based at least in part on the output second image frames.

12. The method according to claim 10 , further comprising receiving an input from the user of the XR device through a user interface indicating that displayed XR object is consistent in pose with the real object.

13. The method according to claim 10 , further comprising not outputting the second image frames in the camera data sequence acquired when the flag data indicates that the displayed XR object is not consistent in pose with the real object.

14. The method according to claim 10 , wherein tracking a pose of the real object with respect to the camera along the camera data sequence comprises deriving the pose based on the first image frame and training data only generated from a synthetic image of the 3D model rendered from a predetermined view.

15. The method according to claim 10 , wherein the original training data includes only shape-based training data.

16. A method for one or more processors to implement in an extended reality (XR) device including a camera, the one or more processors, at least one memory, and a display, the method comprising: acquiring, from the camera, a camera data sequence including a first image frame of a real object in a scene; tracking a pose of the real object with respect to the camera along the camera data sequence, the pose being derived based at least on the first image frame and original training data generated from at least one of (i) a synthetic image of a 3D model rendered from a predetermined view and (ii) a camera image of a reference real object captured from the view, where the 3D model and the reference real object correspond to the real object; displaying an XR object on the display by rendering the XR object based at least on the pose; setting flag data, in a memory area of the at least one memory, indicative of whether or not the displayed XR object is consistent in pose with the real object, in response to receipt of an input of a user of the XR device; extracting feature data of the real object from second image frames in the camera data sequence acquired when the flag data indicates that the displayed XR object is consistent in pose with the real object, the feature data including the tracked pose of the real object in the respective second image frames; and generating another training data to replace or update the original training data, the another training data based at least in part on the extracted feature data.

17. The method according to claim 16 , further comprising receiving an input from the user of the XR device through a user interface indicating that displayed XR object is consistent in pose with the real object.

18. The method according to claim 16 , further comprising not extracting feature data of the real object from the second image frames in the camera data sequence when the flag data indicates that the displayed XR object is not consistent in pose with the real object.

19. The method according to claim 16 , wherein tracking a pose of the real object with respect to the camera along the camera data sequence comprises deriving the pose based on the first image frame and training data only generated from a synthetic image of the 3D model rendered from a predetermined view.

20. The method according to claim 16 , wherein the original training data includes only shape-based training data.

Patent Metadata

Filing Date

Unknown

Publication Date

April 21, 2020

Inventors

Hiu Lok SZETO

Syed Alimul HUDA

Kiever Xiang CHEN

Want to explore more patents?

Browse 5M+ US patents with plain-English claim translations and AI-generated analysis.

Browse All Patents Try Prior Art Search