This disclosure is directed to a computer system and method to assist in identifying data feature intersection or overlap between private datasets without revealing any specific data items or data features in the datasets. Various technical components including natural language processing, lexical optimization, and encryption and key management technologies such as homomorphic encryption and secret sharing and coding, are integrated into the disclosed system and method to achieve the data feature intersection identification. Such a system and method may be employed in circumstances where data feature intersection is important for collaborative efforts between entities.
Legal claims defining the scope of protection, as filed with the USPTO.
3. The system of claim 2, wherein each of the multi-point decryption key functions comprises a random number and a first-degree polynomial function parameterized by a corresponding one of the one or more decryption key segments.
4. The system of claim 3, wherein, to generate the decryption key, the processor is configured to execute the computer instructions to compute Lagrange basis polynomials using the first partial decryption key reference and the second partial decryption key reference for each of the one or more decryption key segments.
6. The system of claim 5, wherein the additional words or phrases are identified according to the textual data features or description data items and a lexical database common to the system and the data source.
8. The system of claim 7, wherein the popularity of the words and phrases and weighted connection therebetween in the lexical database is configured to evolve via incremental learning.
9. The system of claim 5, wherein the expanded textual data features or description data items is numericized by applying a predefined phonetic algorithm.
12. The system of claim 11, wherein the comparer dataset is unencrypted prior to generating the one or more numerical matching values.
Cooperative Patent Classification codes for this invention. Click any code to explore related patents in that topic.
October 12, 2021
October 15, 2024
Browse 5M+ US patents with plain-English claim translations and AI-generated analysis.