US-11264126

Predictive data analysis using image representations of categorical and scalar feature data

PublishedMarch 1, 2022

Assigneenot available in USPTO data we have

Inventorsnot available in USPTO data we have

Technical Abstract

There is a need for more effective and efficient predictive data analysis solutions and/or more effective and efficient solutions for generating image representations of categorical/scalar data. Various embodiments of the present invention address one or more of the noted technical challenges. In one example, a method comprises receiving the one or more categorical input features; generating an image representation of the one or more categorical input features, wherein the image representation comprises image region values each associated with a categorical input feature, and further wherein each image region value of the one or more image region values is determined based at least in part on the corresponding categorical input feature associated with the image region value; and processing the image representation using an image-based machine learning model to generate the image-based predictions.

Patent Claims

19 claims

Legal claims defining the scope of protection, as filed with the USPTO.

1. A computer-implemented method for generating an image-based prediction based at least in part on one or more categorical input feature values, the computer-implemented method comprising: receiving the one or more categorical input feature values, wherein each categorical input feature value is associated with a categorical input feature type of one or more categorical input feature types; generating a raw image representation of the one or more categorical input feature values, wherein (i) the raw image representation is associated with one or more raw image region values, (ii) each raw image value is associated with a categorical input feature value of the one or more categorical input feature values, (iii) each raw image region value of the one or more raw image region values is determined based at least in part on the corresponding categorical input feature type associated with the raw image region value, (iv) at least one raw image region value of the one or more image region values is configured to depict a visual representation of textual data associated with the categorical input feature value that is associated with the raw image region value; determining, based at least in part on the raw image representation, one or more raw image region values each associated with a character region of a plurality of character regions within the raw image region; determining, for each character region of the plurality of character regions, a character region scalar value and a character region location within the raw image representation; generating based at least in part on the raw image representation, an image representation of the one or more categorical input feature values to comprise, for each character region of the plurality of character regions, a scalar visual representation of the region scalar value for the character region in the character region location for the character region, wherein (i) the image representation comprises a plurality of pixels, (ii) the image representation is divided into a plurality of image regions each comprising an image region subset of the plurality of pixels, (iii) each image region is associated with an image region value of a plurality of image region values that describes pixel values for the image region pixel subset that is associated with the image region, (iv) each image region of the plurality of image regions is associated with a categorical input feature type of the one or more categorical input feature types, and (v) each image region value is generated in a manner that is configured to represent a categorical input feature value for the corresponding categorical input feature type that is associated with the image region of the image region value; and processing the image representation using an image-based machine learning model to generate an image-based prediction.

2. The computer-implemented method of claim 1 , wherein: the one or more categorical input feature values comprise one or more patient features associated with a patient, and the image-based prediction is a health prediction for the patient.

3. The computer-implemented method of claim 1 , wherein the image region value of a plurality of image region values is configured to depict a visual representation of textual data associated with the corresponding categorical input feature type that is associated with the image region of the image region value.

4. The computer-implemented method of claim 1 , wherein generating the image representation, based at least in part on the raw image representation, further comprises: determining, for each categorical input feature, a corresponding coordinate grouping of a plurality of coordinate groupings; and generating, for each coordinate grouping of the plurality of coordinate groupings, a coordinate channel; and determining the image representation based on each coordinate channel.

5. The computer-implemented method of claim 1 , wherein generating the image representation, based at least in part on the raw image representation, further comprises: identifying a plurality of character patterns; generating, for each character pattern of the plurality of character pattern, a feature-based channel of a plurality of feature-based channels, wherein: (i) each feature-based channel comprises one or more feature-based channel region values, and (ii) each feature-based channel region value for a corresponding feature-based channel is associated with the corresponding categorical input feature type, and (iii) each feature-based channel region value for a corresponding feature-based channel is determined based at least in part on whether the corresponding categorical input feature type for the feature-based channel region value comprises the corresponding character pattern associated with the corresponding feature-based channel; and generating the image representation based at least in part on each corresponding feature-based channel of the plurality of coordinate channels.

6. The computer-implemented method of claim 1 , wherein generating the image representation, based at least in part on the raw image representation, further comprises: generating, based at least in part on the one or more categorical input feature values, one or more coordinate channels and one or more feature-based channels; and merging the one or more coordinate channels and the one or more feature-based channels to generate the image representation.

7. The computer-implemented method of claim 1 , wherein: each categorical input feature value of the one or more categorical input feature values is associated with a text formatting pattern, at least one image region value of the one or more image region values is configured to depict a visual representation of textual data associated with the categorical input feature value that is associated with the image region value, and the textual data associated with at least one image region value of the one or more image region values is determined based at least in part on the text formatting pattern for the categorical input feature value that is associated with the image region value.

8. The computer-implemented method of claim 1 , wherein the scalar visual representation for at least one character region of the plurality of character regions is a grayscale visual representation of a character depicted by the character region.

9. The computer-implemented method of claim 1 , wherein the image-based machine learning model comprises a convolutional neural network (CNN).

10. The computer-implemented method of claim 1 , further comprising automatically scheduling one or more medical visit appointments based at least in part on the image-based prediction.

11. An apparatus for generating an image-based prediction based at least in part on one or more categorical input feature values, the apparatus comprising at least one processor and at least one memory including a computer program code, the at least one memory and the computer program code configured to, with the at least one processor, cause the apparatus to: receive the one or more categorical input feature values, wherein each categorical input feature value is associated with a categorical input feature type of one or more categorical input feature types; generate a raw image representation of the one or more categorical input feature values, wherein (i) the raw image representation is associated with one or more raw image region values, (ii) each raw image value is associated with a categorical input feature value of the one or more categorical input feature values, (iii) each raw image region value of the one or more raw image region values is determined based at least in part on the corresponding categorical input feature type associated with the raw image region value, (iv) at least one raw image region value of the one or more image region values is configured to depict a visual representation of textual data associated with the categorical input feature value that is associated with the raw image region value; determine, based at least in part on the raw image representation, one or more raw image region values each associated with a character region of a plurality of character regions within the raw image region; determine, for each character region of the plurality of character regions, a character region scalar value and a character region location within the raw image representation; generate, based at least in part on the raw image representation, an image representation of the one or more categorical input feature values to comprise, for each character region of the plurality of character regions, a scalar visual representation of the region scalar value for the character region in the character region location for the character region, —wherein (i) the image representation comprises a plurality of pixels, (ii) the image representation is divided into a plurality of image regions each comprising an image region subset of the plurality of pixels, (iii) each image region is associated with an image region value of a plurality of image region values that describes pixel values for the image region pixel subset that is associated with the image region, (iv) each image region of the plurality of image regions is associated with a categorical input feature type of the one or more categorical input feature types, and (v) each image region value is generated in a manner that is configured to represent a categorical input feature value for the corresponding categorical input feature type that is associated with the image region of the image region value; and process the image representation using an image-based machine learning model to generate an image-based prediction.

12. The apparatus of claim 11 , wherein: the one or more categorical input feature values comprise one or more patient features associated with a patient, and the image-based prediction is a health prediction for the patient.

13. The apparatus of claim 12 , wherein the image-based machine learning model comprises a convolutional neural network (CNN).

14. The apparatus of claim 11 , wherein the image region value of a plurality of image region values is configured to depict a visual representation of textual data associated with the corresponding categorical input feature type that is associated with the image region of the image region value.

15. The apparatus of claim 11 , wherein generating the image representation, based at least in part on the raw image representation, further comprises: determining, for each categorical input feature, a corresponding coordinate grouping of a plurality of coordinate groupings; and generating, for each coordinate grouping of the plurality of coordinate groupings, a coordinate channel; and determining the image representation based on each coordinate channel.

16. The apparatus of claim 11 , wherein generating the image representation, based at least in part on the raw image representation, further comprises: identifying a plurality of character patterns; generating, for each character pattern of the plurality of character pattern, a feature-based channel of a plurality of feature-based channels, wherein: (i) each feature-based channel comprises one or more feature-based channel region values, and (ii) each feature-based channel region value for a corresponding feature-based channel is associated with the corresponding categorical input feature type, and (iii) each feature-based channel region value for a corresponding feature-based channel is determined based at least in part on whether the corresponding categorical input feature type for the feature-based channel region value comprises the corresponding character pattern associated with the corresponding feature-based channel; and generating the image representation based at least in part on each corresponding feature-based channel of the plurality of coordinate channels.

17. The apparatus of claim 11 , wherein generating the image representation, based at least in part on the raw image representation, further comprises: generating, based at least in part on the one or more categorical input feature values, one or more coordinate channels and one or more feature-based channels; and merging the one or more coordinate channels and the one or more feature-based channels to generate the image representation.

18. The apparatus of claim 11 , wherein: each categorical input feature value of the one or more categorical input feature values is associated with a text formatting pattern, at least one image region value of the one or more image region values is configured to depict a visual representation of textual data associated with the categorical input feature value that is associated with the image region value, and the textual data associated with at least one image region value of the one or more image region values is determined based at least in part on the text formatting pattern for the categorical input feature value that is associated with the image region value.

19. A non-transitory computer storage medium comprising instructions for generating an image-based prediction based at least in part on one or more categorical input feature values, the instructions being configured to cause one or more processors to at least perform operations configured to: receive the one or more categorical input feature values, wherein each categorical input feature value is associated with a categorical input feature type of one or more categorical input feature types; generate a raw image representation of the one or more categorical input feature values, wherein (i) the raw image representation is associated with one or more raw image region values, (ii) each raw image value is associated with a categorical input feature value of the one or more categorical input feature values, (iii) each raw image region value of the one or more raw image region values is determined based at least in part on the corresponding categorical input feature type associated with the raw image region value, (iv) at least one raw image region value of the one or more image region values is configured to depict a visual representation of textual data associated with the categorical input feature value that is associated with the raw image region value; determine, based at least in part on the raw image representation, one or more raw image region values each associated with a character region of a plurality of character regions within the raw image region; determine, for each character region of the plurality of character regions, a character region scalar value and a character region location within the raw image representation; generate, based at least in part on the raw image representation, an image representation of the one or more categorical input feature values to comprise, for each character region of the plurality of character regions, a scalar visual representation of the region scalar value for the character region in the character region location for the character region, wherein (i) the image representation comprises a plurality of pixels, (ii) the image representation is divided into a plurality of image regions each comprising an image region subset of the plurality of pixels, (iii) each image region is associated with an image region value of a plurality of image region values that describes pixel values for the image region pixel subset that is associated with the image region, (iv) each image region of the plurality of image regions is associated with a categorical input feature type of the one or more categorical input feature types, and (v) each image region value is generated in a manner that is configured to represent a categorical input feature value for the corresponding categorical input feature type that is associated with the image region of the image region value; and process the image representation using an image-based machine learning model to generate an image-based prediction.

Classification Codes (CPC)

Cooperative Patent Classification codes for this invention. Click any code to explore related patents in that topic.

G16H G06N G06T

Patent Metadata

Filing Date

October 31, 2019

Publication Date

March 1, 2022

Want to explore more patents?

Browse 5M+ US patents with plain-English claim translations and AI-generated analysis.

Browse All Patents Try Prior Art Search