US-10769518

Method of controlling for undesired factors in machine learning models

PublishedSeptember 8, 2020

Assigneenot available in USPTO data we have

Inventorsnot available in USPTO data we have

Technical Abstract

A method of training and using a machine learning model that controls for consideration of undesired factors which might otherwise be considered by the trained model during its subsequent analyses of new data. For example, the model may be a neural network trained on a set of training images to evaluate an insurance applicant based upon an image or audio data of the insurance applicant as part of an underwriting process to determine an appropriate life or health insurance premium. The model is trained to probabilistically correlate an aspect of the applicant's appearance with a personal and/or health-related characteristic. Any undesired factors, such as age, sex, ethnicity, and/or race, are identified for exclusion. The trained model receives the image (e.g., a “selfie”) of the insurance applicant, analyzes the image without considering the identified undesired factors, and suggests the appropriate insurance premium based only on the remaining desired factors.

Patent Claims

18 claims

Legal claims defining the scope of protection, as filed with the USPTO.

1. A computer-implemented method of training and using a machine learning model that controls for consideration of one or more undesired factors which might otherwise be considered by the machine learning model when analyzing new data, the method comprising, via one or more processors: training the machine learning model using a training data set that contains information including the one or more undesired factors; identifying the one or more undesired factors and one or more relevant interaction terms between the one or more undesired factors; and causing the trained machine learning model to not consider the identified one or more undesired factors when analyzing the new data to control for undesired prejudice or discrimination in machine learning models, wherein identifying the one or more undesired factors and the one or more relevant interaction terms between the one or more undesired factors includes training a second machine learning model using a second training data set that contains only the one or more undesired factors and one or more relevant interaction terms between the one or more undesired factors.

2. The computer-implemented method as set forth in claim 1 , wherein causing the machine learning model to not consider the identified one or more undesired factors when analyzing the new data includes combining the machine learning model and the second machine learning model to eliminate a bias created by the one or more undesired factors from the machine learning model's consideration prior to employing the machine learning model to analyze the new data.

3. The computer-implemented method as set forth in claim 1 , wherein identifying the one or more undesired factors and the one or more relevant interaction terms between the one or more undesired factors includes training the machine learning model to identify the one or more undesired factors and the one or more relevant interaction terms between the one or more undesired factors.

4. The computer-implemented method as set forth in claim 3 , wherein causing the machine learning model to not consider the identified one or more undesired factors when analyzing the new data includes instructing the machine learning model to not consider the identified one or more undesired factors while analyzing the new data.

5. The computer-implemented method as set forth in claim 1 , wherein the machine learning model is a neural network.

6. The computer-implemented method as set forth in claim 1 , wherein the second machine learning model is a linear model.

7. The computer-implemented method as set forth in claim 1 , wherein the machine learning model is trained to analyze the new data as part of an underwriting process to determine an appropriate insurance premium.

8. The computer-implemented method as set forth in claim 7 , wherein the new data includes a still image or a video of a person applying for life insurance or health insurance.

9. The computer-implemented method as set forth in claim 7 , wherein the new data includes an image of a piece of property for which a person is applying for property insurance.

10. The computer-implemented method as set forth in claim 7 , wherein the machine learning model is further trained to analyze the new data as part of the underwriting process to determine one or more appropriate terms of coverage.

11. A computer system configured to train and use a machine learning model that controls for consideration of one or more undesired factors which might otherwise be considered by the machine learning model when analyzing new data, the computer system comprising one or more processors configured to: train the machine learning model using a training data set that contains information including the one or more undesired factors; identify the one or more undesired factors and one or more relevant interaction terms between the one or more undesired factors; and cause the trained machine learning model to not consider the identified one or more undesired factors when analyzing the new data to control for undesired prejudice or discrimination in machine learning models, wherein identifying the one or more undesired factors and the one or more relevant interaction terms between the one or more undesired factors includes the one or more processors training a second machine learning model using a second training data set that contains only the one or more undesired factors and one or more relevant interaction terms between the one or more undesired factors.

12. The computer system as set forth in claim 11 , wherein causing the machine learning model to not consider the identified one or more undesired factors when analyzing the new data includes the one or more processors combining the machine learning model and the second machine learning model to eliminate a bias created by the one or more undesired factors from the machine learning model's consideration prior to employing the machine learning model to analyze the new data.

13. The computer system as set forth in claim 11 , wherein identifying the one or more undesired factors and the one or more relevant interaction terms between the one or more undesired factors includes the one or more processors training the machine learning model to identify the one or more undesired factors and the one or more relevant interaction terms between the one or more undesired factors.

14. The computer system as set forth in claim 11 , wherein causing the machine learning model to not consider the identified one or more undesired factors when analyzing the new data includes the one or more processors instructing the machine learning model to not consider the identified one or more undesired factors while analyzing the new data.

15. The computer system as set forth in claim 11 , wherein the machine learning model is a neural network.

16. The computer system as set forth in claim 11 , wherein the second machine learning model is a linear model.

17. The computer system as set forth in claim 11 , wherein the machine learning model is trained to analyze the new data as part of an underwriting process to determine an appropriate insurance premium.

18. The computer system as set forth in claim 11 , wherein the new data includes images of a person applying for life insurance or health insurance.

Classification Codes (CPC)

Cooperative Patent Classification codes for this invention. Click any code to explore related patents in that topic.

G06Q G06N G06V H04N

Patent Metadata

Filing Date

December 19, 2016

Publication Date

September 8, 2020

Want to explore more patents?

Browse 5M+ US patents with plain-English claim translations and AI-generated analysis.

Browse All Patents Try Prior Art Search