Embodiments provide for predicted data use obligation matching using data differentiators. Example embodiments retrieve a cross-link relationship graph structure from a data repository, where the cross-link relationship graph structure includes a plurality of cross-link relationship graph nodes connected by a plurality of cross-link relationship graph edges. Each cross-link relationship graph node is associated with a unique logical data record set identifier of a plurality of logical data record set identifiers associated with a dataset identifier. For each unique logical data record set identifier, the cross-link relationship graph structure is traversed. Based at least in part on a separation measure associated with each cross-link relationship of one or more cross-link relationships associated with the unique logical data record set identifier, one or more data use obligation scores for the unique logical data record set identifier is generated.
Legal claims defining the scope of protection, as filed with the USPTO.
1. A system comprising one or more processors and at least one memory storing processor executable instructions that, when executed by the one or more processors, cause the one or more processors to perform operations comprising: receiving an intended use request indicating a request to provision access to one or more computing devices to use data, wherein the intended use request comprises (a) a logical data record set identifier and (b) an intended use identifier indicating, for a logical data record indicated by the logical data record set identifier, a purpose for which the logical data record will be used, wherein a logical data record set associated with the logical data record set identifier comprises (i) a plurality of columns and (ii) one or more column differentiators each representing a data use obligation that is (1) for data in a column of the plurality of columns and (2) based on one or more data use obligation policies; determining, for the logical data record set identifier, a data use obligation score indicating a likelihood that one or more data use restrictions apply to a data record identifier indicated by the logical data record set identifier; and responsive to determining that the likelihood is less than a risk threshold associated with the logical data record set identifier and specific to the intended use identifier, granting the intended use request.
2. The system of claim 1, wherein a data use restriction of the one or more data use restrictions comprises instructions that define control of access to or use of data records associated with the data record identifier.
3. The system of claim 1, wherein the logical data record set associated with the logical data record set identifier further comprises a plurality of rows.
4. The system of claim 3, wherein the logical data record set further comprises one or more row differentiators.
5. The system of claim 4, wherein a row differentiator of the one or more row differentiators represents a respective data use obligation for data included in a row of the plurality of rows, and wherein the respective data use obligation is based on one or more data use obligation policies.
6. The system of claim 1, wherein an intended use identified by the intended use identifier comprises one or more of offshore operations, research, internal analytics, external transfer to a third party, standardization, enrichment, consumption, or provision.
7. A computer-implemented method, comprising: receiving, by one or more processors, an intended use request indicating a request to provision access to one or more computing devices to use data, wherein the intended use request comprises (a) a logical data record set identifier and (b) an intended use identifier indicating, for a logical data record indicated by the logical data record set identifier, a purpose for which the logical data record will be used, wherein a logical data record set associated with the logical data record set identifier comprises (i) a plurality of columns and (ii) one or more column differentiators each representing a data use obligation that is (1) for data in a column of the plurality of columns and (2) based on one or more data use obligation policies; determining, by the one or more processors and for the logical data record set identifier, a data use obligation score indicating a likelihood that one or more data use restrictions apply to a data record identifier indicated by the logical data record set identifier; and responsive to determining that the likelihood is less than a risk threshold associated with the logical data record set identifier and specific to the intended use identifier, granting, by the one or more processors the intended use request.
8. The computer-implemented method of claim 7, wherein a data use restriction of the one or more data use restrictions comprises instructions that define control of access to or use of data records associated with the data record identifier.
9. The computer-implemented method of claim 7, wherein the logical data record set associated with the logical data record set identifier further comprises a plurality of rows.
10. The computer-implemented method of claim 9, wherein the logical data record set further comprises one or more row differentiators.
11. The computer-implemented method of claim 10, wherein a row differentiator of the one or more row differentiators represents a respective data use obligation for data included in a row of the plurality of rows, and wherein the respective data use obligation is based on one or more data use obligation policies.
12. The computer-implemented method of claim 7, wherein an intended use identified by the intended use identifier comprises one or more of offshore operations, research, internal analytics, external transfer to a third party, standardization, enrichment, consumption, or provision.
13. One or more non-transitory computer-readable storage media including instructions that, when executed by one or more processors, cause the one or more processors to perform operations comprising: receiving an intended use request indicating a request to provision access to one or more computing devices to use data, wherein the intended use request comprises (a) a logical data record set identifier and (b) an intended use identifier indicating, for a logical data record indicated by the logical data record set identifier, a purpose for which the logical data record will be used, wherein a logical data record set associated with the logical data record set identifier comprises (i) a plurality of columns and (ii) one or more column differentiators each representing a data use obligation that is (1) for data in a column of the plurality of columns and (2) based on one or more data use obligation policies; determining, for the logical data record set identifier, a data use obligation score indicating a likelihood that one or more data use restrictions apply to e-a data record identifier indicated by the logical data record set identifier; and responsive to determining that the likelihood is less than a risk threshold associated with the logical data record set identifier and specific to the intended use identifier, granting the intended use request.
14. The one or more non-transitory computer-readable storage media of claim 13, wherein a data use restriction of the one or more data use restrictions comprises instructions that define control of access to or use of data records associated with the more data record identifier.
15. The one or more non-transitory computer-readable storage media of claim 13, wherein the logical data record set associated with the logical data record set identifier further comprises a plurality of rows.
16. The one or more non-transitory computer-readable storage media of claim 15, wherein the logical data record set further comprises one or more row differentiators and wherein a row differentiator of the one or more row differentiators represents a respective data use obligation for data included in a row of the plurality of rows, and wherein the respective data use obligation is based on one or more data use obligation policies.
17. The one or more non-transitory computer-readable storage media of claim 13, wherein an intended use identified by the intended use identifier comprises one or more of offshore operations, research, internal analytics, external transfer to a third party, standardization, enrichment, consumption, or provision.
Cooperative Patent Classification codes for this invention. Click any code to explore related patents in that topic.
May 8, 2023
April 15, 2025
Browse 5M+ US patents with plain-English claim translations and AI-generated analysis.