Method and System for 3d Reconstruction of X-Ray CT Volume and Segmentation Mask from a Few X-Ray Radiographs

PublishedJuly 14, 2020

Assigneenot available in USPTO data we have

InventorsShaohua Kevin Zhou Sri Venkata Anirudh Nanduri Jin-hyeong Park Haofu Liao

Technical Abstract

Patent Claims

20 claims

Legal claims defining the scope of protection, as filed with the USPTO.

1. A method for automated reconstruction of a 3D computed tomography (CT) volume one or more X-ray images, comprising: generating a sparse 3D volume from one or more X-ray images of a patient; and generating a final reconstructed 3D CT volume from the sparse 3D volume using a trained deep neural network.

2. The method of claim 1 , wherein the one or more X-ray images of the patient comprise a first x-ray image and a second x-ray image, and generating the sparse 3D volume from the one or more X-ray images of the patient comprises: generating the sparse 3D volume from the first X-ray image and the second X-ray image using a tomographic reconstruction algorithm.

3. The method of claim 2 , wherein the one or more x-ray images of the patient comprise only the first and second x-ray images, and generating the sparse 3D volume from the first X-ray image and the second X-ray image using a tomographic reconstruction algorithm comprises: generating the sparse 3D volume from the first X-ray image and the second X-ray image without any additional x-ray images using a tomographic reconstruction algorithm.

4. The method of claim 1 , further comprising: generating a 3D segmentation mask of a target object from the sparse 3D volume using the trained deep neural network.

5. The method of claim 4 , wherein the trained deep neural network is a multi-output deep image-to-image network having encoder layers that code the sparse 3D volume into a code whose size is smaller than the spare 3D volume and decoder layers that decode the code into the final reconstructed 3D volume and the 3D segmentation mask of the target object.

6. The method of claim 1 , wherein the trained deep neural network is a deep image-to-image network that is trained in a generative adversarial network together with a discriminator network for distinguishing between synthesized reconstructed 3D CT volumes generated by the deep image-to-image network from input sparse 3D volume training samples and real reconstructed 3D CT volume training samples.

7. The method of claim 1 , wherein the trained deep neural network is a deep image-to-image network that is trained in a conditional-generative adversarial network together with a discriminator network for distinguishing between synthesized reconstructed 3D CT volumes generated by the deep image-to-image network from input sparse 3D volume training samples and real reconstructed 3D CT volume training samples, conditioned on the input sparse 3D volume training samples.

8. The method of claim 7 , wherein the conditional-generative adversarial network is integrated with a voxel-wise cost function that computes a voxel-wise error between the synthesized reconstructed 3D CT volumes generated by the deep image-to-image network from input sparse 3D volume training samples and corresponding ground-truth reconstructed 3D CT volume training samples, and the deep image-to-image network and the discriminator network are trained together to optimize, over a plurality of training samples, a minimax objective function that includes a first term that calculates an error using the voxel-wise cost function, a second term that calculates an error of the discriminator network classifying the real reconstructed 3D CT training samples, and a third term that calculates and error of the discriminator network classifying the synthesized reconstructed 3D CT volumes generated by the deep image-to-image network.

9. An apparatus for automated reconstruction of a 3D computed tomography (CT) volume one or more X-ray images, comprising: means for generating a sparse 3D volume from one or more X-ray images of a patient; and means for generating a final reconstructed 3D CT volume from the sparse 3D volume using a trained deep neural network.

10. The apparatus of claim 9 , further comprising: means for generating a 3D segmentation mask of a target object from the sparse 3D volume using the trained deep neural network.

11. The apparatus of claim 9 , wherein the trained deep neural network is a deep image-to-image network that is trained in a generative adversarial network together with a discriminator network for distinguishing between synthesized reconstructed 3D CT volumes generated by the deep image-to-image network from input sparse 3D volume training samples and real reconstructed 3D CT volume training samples.

12. The apparatus of claim 9 , wherein the trained deep neural network is a deep image-to-image network that is trained in a conditional-generative adversarial network together with a discriminator network for distinguishing between synthesized reconstructed 3D CT volumes generated by the deep image-to-image network from input sparse 3D volume training samples and real reconstructed 3D CT volume training samples, conditioned on the input sparse 3D volume training samples.

13. The apparatus of claim 12 , wherein the conditional-generative adversarial network is integrated with a voxel-wise cost function that computes a voxel-wise error between the synthesized reconstructed 3D CT volumes generated by the deep image-to-image network from input sparse 3D volume training samples and corresponding ground-truth reconstructed 3D CT volume training samples, and the deep image-to-image network and the discriminator network are trained together to optimize, over a plurality of training samples, a minimax objective function that includes a first term that calculates an error using the voxel-wise cost function, a second term that calculates an error of the discriminator network classifying the real reconstructed 3D CT training samples, and a third term that calculates and error of the discriminator network classifying the synthesized reconstructed 3D CT volumes generated by the deep image-to-image network.

14. A non-transitory computer-readable medium storing computer program instructions for automated reconstruction of a 3D computed tomography (CT) volume one or more X-ray images, the computer program instructions when executed by a processor cause the processor to perform operations comprising: generating a sparse 3D volume from one or more X-ray images of a patient; and generating a final reconstructed 3D CT volume from the sparse 3D volume using a trained deep neural network.

15. The non-transitory computer-readable medium of claim 14 , wherein the one or more X-ray images of the patient comprise a first x-ray image and a second x-ray image, and generating the sparse 3D volume from the one or more X-ray images of the patient comprises: generating the sparse 3D volume from the first X-ray image and the second X-ray image using a tomographic reconstruction algorithm.

16. The non-transitory computer-readable medium of claim 14 , wherein the operations further comprise: generating a 3D segmentation mask of a target object from the sparse 3D volume using the trained deep neural network.

17. The non-transitory computer-readable medium of claim 16 , wherein the trained deep neural network is a multi-output deep image-to-image network having encoder layers that code the sparse 3D volume into a code whose size is smaller than the spare 3D volume and decoder layers that decode the code into the final reconstructed 3D volume and the 3D segmentation mask of the target object.

18. The non-transitory computer-readable medium of claim 14 , wherein the trained deep neural network is a deep image-to-image network that is trained in a generative adversarial network together with a discriminator network for distinguishing between synthesized reconstructed 3D CT volumes generated by the deep image-to-image network from input sparse 3D volume training samples and real reconstructed 3D CT volume training samples.

19. The non-transitory computer-readable medium of claim 14 , wherein the trained deep neural network is a deep image-to-image network that is trained in a conditional-generative adversarial network together with a discriminator network for distinguishing between synthesized reconstructed 3D CT volumes generated by the deep image-to-image network from input sparse 3D volume training samples and real reconstructed 3D CT volume training samples, conditioned on the input sparse 3D volume training samples.

20. The non-transitory computer-readable medium of claim 19 , wherein the conditional-generative adversarial network is integrated with a voxel-wise cost function that computes a voxel-wise error between the synthesized reconstructed 3D CT volumes generated by the deep image-to-image network from input sparse 3D volume training samples and corresponding ground-truth reconstructed 3D CT volume training samples, and the deep image-to-image network and the discriminator network are trained together to optimize, over a plurality of training samples, a minimax objective function that includes a first term that calculates an error using the voxel-wise cost function, a second term that calculates an error of the discriminator network classifying the real reconstructed 3D CT training samples, and a third term that calculates and error of the discriminator network classifying the synthesized reconstructed 3D CT volumes generated by the deep image-to-image network.

Patent Metadata

Filing Date

Unknown

Publication Date

July 14, 2020

Inventors

Shaohua Kevin Zhou

Sri Venkata Anirudh Nanduri

Jin-hyeong Park

Haofu Liao

Want to explore more patents?

Browse 5M+ US patents with plain-English claim translations and AI-generated analysis.

Browse All Patents Try Prior Art Search