Legal claims defining the scope of protection, as filed with the USPTO.
1. A method comprising: obtaining, by a processing system including at least one processor, a request to add at least a first feature to a feature graph database, wherein the feature graph database represents one or more data tables, where each of the one or more data tables comprises a plurality of rows and a plurality of columns, wherein the first feature comprises a first data column, wherein the first data column comprises a vector of values, wherein the request comprises a first feature ontology of the first feature, wherein the first feature ontology comprises: a label of the first feature and a relationship of the first feature to another feature, wherein the feature graph database comprises: a plurality of objects, each of the plurality of objects associated with a feature of a plurality of features, wherein each of the plurality of features comprises a column of a data table of the one or more data tables; and a plurality of relationships between the plurality of objects, and wherein the obtaining includes obtaining lineage information of the first feature, wherein the lineage information identifies a derivation of the first feature from another feature; identifying, by the processing system, whether the first feature is a duplicate of a second feature in the feature graph database based at least upon the first feature ontology and a second feature ontology of the second feature, wherein the identifying comprises identifying whether the second feature ontology is similar to the first feature ontology according to a similarity definition, wherein the identifying is further based on fingerprint information of the first feature, wherein the fingerprint information comprises a first vector including a plurality of statistical metrics of underlying data records of the first feature, wherein the identifying further comprises calculating a distance metric between the first vector and a second vector including a plurality of statistical metrics associated with the second feature, wherein the fingerprint information further comprises: timing information of the underlying data records of the first feature, wherein the timing information comprises respective time intervals represented by the underlying data records of the first feature, and wherein the identifying is further based on the lineage information; and generating, by the processing system, an indication of whether the first feature is a duplicate in response to the identifying.
2. The method of claim 1, wherein the obtaining includes obtaining the fingerprint information of the first feature.
3. The method of claim 1, further comprising: generating the fingerprint information of the first feature.
4. The method of claim 1, wherein the identifying comprises: identifying that the first feature and the second feature have a same feature label.
5. The method of claim 4, wherein the identifying comprises: identifying that the second feature ontology is the same as the first feature ontology, wherein the indication comprises an indication that the first feature is the same as the second feature.
6. The method of claim 5, wherein the identifying further comprises: determining whether the fingerprint information of the first feature matches fingerprint information of the second feature in accordance with the distance metric.
7. The method of claim 6, wherein the indication comprises an indication that the first feature is the same as an existing feature, when it is determined that the fingerprint information of the first feature matches the fingerprint information of the second feature.
8. The method of claim 6, wherein the indication comprises an indication that the first feature is different from an existing feature, when it is determined that the fingerprint information of the first feature does not match the fingerprint information of the second feature.
9. The method of claim 4, wherein the identifying comprises: identifying that the second feature ontology is different from the first feature ontology, wherein the indication comprises an indication that the first feature is different from the second feature.
10. The method of claim 1, wherein the identifying comprises: identifying that the second feature ontology is similar to the first feature ontology.
11. A non-transitory computer-readable medium storing instructions which, when executed by a processing system including at least one processor, cause the processing system to perform operations, the operations comprising: obtaining a request to add at least a first feature to a feature graph database, wherein the feature graph database represents one or more data tables, where each of the one or more data tables comprises a plurality of rows and a plurality of columns, wherein the first feature comprises a first data column, wherein the first data column comprises a vector of values, wherein the request comprises a first feature ontology of the first feature, wherein the first feature ontology comprises: a label of the first feature and a relationship of the first feature to another feature, wherein the feature graph database comprises: a plurality of objects, each of the plurality of objects associated with a feature of a plurality of features, wherein each of the plurality of features comprises a column of a data table of the one or more data tables; er a concept; and a plurality of relationships between the plurality of objects, and wherein the obtaining includes obtaining lineage information of the first feature, wherein the lineage information identifies a derivation of the first feature from another feature; identifying whether the first feature is a duplicate of a second feature in the feature graph database based at least upon the first feature ontology and a second feature ontology of the second feature, wherein the identifying comprises identifying whether the second feature ontology is similar to the first feature ontology according to a similarity definition, wherein the identifying is further based on fingerprint information of the first feature, wherein the fingerprint information comprises a first vector including a plurality of statistical metrics of underlying data records of the first feature, wherein the identifying further comprises calculating a distance metric between the first vector and a second vector including a plurality of statistical metrics associated with the second feature, wherein the fingerprint information further comprises: timing information of the underlying data records of the first feature, wherein the timing information comprises respective time intervals represented by the underlying data records of the first feature, and wherein the identifying is further based on the lineage information; and generating an indication of whether the first feature is a duplicate in response to the identifying.
12. The non-transitory computer-readable medium of claim 11, wherein the obtaining includes obtaining the fingerprint information of the first feature.
13. A device comprising: a processor system including at least one processor; and a computer-readable medium storing instructions which, when executed by the processing system, cause the processing system to perform operations, the operations comprising: obtaining a request to add at least a first feature to a feature graph database, wherein the feature graph database represents one or more data tables, where each of the one or more data tables comprises a plurality of rows and a plurality of columns, wherein the first feature comprises a first data column, wherein the first data column comprises a vector of values, wherein the request comprises a first feature ontology of the first feature, wherein the first feature ontology comprises: a label of the first feature and a relationship of the first feature to another feature, wherein the feature graph database comprises: a plurality of objects, each of the plurality of objects associated with a feature of a plurality of features, wherein each of the plurality of features comprises a column of a data table of the one or more data tables; and a plurality of relationships between the plurality of objects, and wherein the obtaining includes obtaining lineage information of the first feature, wherein the lineage information identifies a derivation of the first feature from another feature; identifying whether the first feature is a duplicate of a second feature in the feature graph database based at least upon the first feature ontology and a second feature ontology of the second feature, wherein the identifying comprises identifying whether the second feature ontology is similar to the first feature ontology according to a similarity definition, wherein the identifying is further based on fingerprint information of the first feature, wherein the fingerprint information comprises a first vector including a plurality of statistical metrics of underlying data records of the first feature, wherein the identifying further comprises calculating a distance metric between the first vector and a second vector including a plurality of statistical metrics associated with the second feature, wherein the fingerprint information further comprises: timing information of the underlying data records of the first feature, wherein the timing information comprises respective time intervals represented by the underlying data records of the first feature, and wherein the identifying is further based on the lineage information; and generating an indication of whether the first feature is a duplicate in response to the identifying.
14. The device of claim 13, wherein the obtaining includes obtaining the fingerprint information of the first feature.
15. The device of claim 13, wherein the operations further comprise: generating the fingerprint information of the first feature.
16. The device of claim 13, wherein the identifying comprises: identifying that the first feature and the second feature have a same feature label.
17. The device of claim 16, wherein the identifying comprises: identifying that the second feature ontology is the same as the first feature ontology, wherein the indication comprises an indication that the first feature is the same as the second feature.
18. The device of claim 16, wherein the identifying comprises: identifying that the second feature ontology is different from the first feature ontology, wherein the indication comprises an indication that the first feature is different from the second feature.
19. The device of claim 13, wherein the identifying comprises: identifying that the second feature ontology is similar to the first feature ontology.
Unknown
September 30, 2025
Browse 5M+ US patents with plain-English claim translations and AI-generated analysis.