Legal claims defining the scope of protection, as filed with the USPTO.
1. A method comprising: receiving, at a device, a request for training data that is based on application data generated by an application executed at a data collection node, wherein the application data is associated with a plurality of metadata identifiers; determining, by the device, one or more training data constraints that restrict use of the application data as training data; generating, by the device, the training data in part by excluding application data of a particular type from being included in the training data based on a match between a metadata identifier of the plurality of metadata identifiers and the one or more training data constraints; and providing, by the device, the training data to be used to train a machine learning model.
2. The method as in claim 1, wherein the excluding the application data of the particular type from being included in the training data includes using the application data of the particular type to generate anonymized training data that is included in the training data.
3. The method as in claim 1, wherein the excluding the application data of the particular type from being included in the training data includes using the application data of the particular type to generate synthetic training data that is included in the training data.
4. The method as in claim 1, wherein the one or more training data constraints are based on a data manifest bonded to the application that includes a listing of the plurality of metadata identifiers.
5. The method as in claim 1, wherein the request includes an identifier of the application.
6. The method as in claim 5, wherein the one or more training data constraints are based on the identifier of the application.
7. The method as in claim 1, wherein the one or more training data constraints are based on a location of the data collection node.
8. The method as in claim 1, wherein the one or more training data constraints are based on a location of a receiver of the training data.
9. The method as in claim 1, further comprising: obtaining, by the device, consent from one or more users for application data associated with the one or more users to be included in the training data.
10. The method as in claim 1, wherein the device is the data collection node.
11. An apparatus, comprising: one or more network interfaces; a processor coupled to the one or more network interfaces and configured to execute one or more processes; and a memory configured to store a process that is executable by the processor, the process when executed configured to: receive a request for training data that is based on application data generated by an application executed at a data collection node, wherein the application data is associated with a plurality of metadata identifiers; determine one or more training data constraints that restrict use of the application data as training data; generate the training data in part by excluding application data of a particular type from being included in the training data based on a match between a metadata identifier of the plurality of metadata identifiers and the one or more training data constraints; and provide the training data to be used to train a machine learning model.
12. The apparatus as in claim 11, wherein the application data of the particular type is excluded from being included in the training data by using the application data of the particular type to generate anonymized training data that is included in the training data.
13. The apparatus as in claim 11, wherein the application data of the particular type is excluded from being included in the training data by using the application data of the particular type to generate synthetic training data that is included in the training data.
14. The apparatus as in claim 11, wherein the one or more training data constraints are based on a data manifest bonded to the application that includes a listing of the plurality of metadata identifiers.
15. The apparatus as in claim 11, wherein the request includes an identifier of the application.
16. The apparatus as in claim 15, wherein the one or more training data constraints are based on the identifier of the application.
17. The apparatus as in claim 11, wherein the one or more training data constraints are based on a location of the data collection node.
18. The apparatus as in claim 11, wherein the one or more training data constraints are based on a location of a receiver of the training data.
19. The apparatus as in claim 11, wherein the process when executed is further configured to: consent from one or more users for application data associated with the one or more users to be included in the training data.
20. A tangible, non-transitory, computer-readable medium storing program instructions that cause a device to execute a process comprising: receiving, at the device, a request for training data that is based on application data generated by an application executed at a data collection node, wherein the application data is associated with a plurality of metadata identifiers; determining, by the device, one or more training data constraints that restrict use of the application data as training data; generating, by the device, the training data in part by excluding application data of a particular type from being included in the training data based on a match between a metadata identifier of the plurality of metadata identifiers and the one or more training data constraints; and providing, by the device, the training data to be used to train a machine learning model.
Unknown
September 16, 2025
Browse 5M+ US patents with plain-English claim translations and AI-generated analysis.