{"schema_version":"1.0","canonical_url":"https://patentable.app/patents/US-11243919","patent":{"patent_number":"US-11243919","title":"Preparing high-quality data repositories sets utilizing heuristic data analysis","assignee":null,"inventors":[],"filing_date":"2015-10-16T00:00:00.000Z","publication_date":"2022-02-08T00:00:00.000Z","cpc_codes":["G06F","G06F","G06F"],"num_claims":14,"abstract":"A mechanism is provide for preparing a high-quality data repository. Data and related metadata from a set of data sources are ingested thereby forming a set of unprepared data. The set of unprepared data is transformed based on a set of functions into a set of transformed data. A set of semantic text descriptions that detail the transformation of the set of unprepared data to the set of transformed data is generated using a first set of semantic associations, a second set of semantic associations, and a set of semantic transformation associations. The set of transformed data is tested against one or more governance policies that tracks data lineage to ultimately show that prepared data is in compliance. Responsive to the set of transformed data adhering to the one or more governance policies, a high-quality data repository is automatically built using the transformed data."},"analysis":{"summary":null,"layman_explanation":null,"technical_analysis":null,"business_analysis":null,"faqs":null,"topics":[],"tech_cluster":null},"seo":{"title":"Preparing high-quality data repositories sets utilizing heuristic data analysis","description":"A mechanism is provide for preparing a high-quality data repository. Data and related metadata from a set of data sources are ingested thereby forming a set of unprepared data. The set of unprepared d","keywords":[]},"attribution":{"source":"Patentable","source_url":"https://patentable.app","canonical_url":"https://patentable.app/patents/US-11243919","license":"CC-BY-4.0-like","license_terms":"AI-generated analysis on this page (summary, layman_explanation, technical_analysis, business_analysis, faqs) may be reused with attribution and a visible link back to the canonical URL above. Patent abstracts, claims, and bibliographic data are USPTO public domain.","required_link":"https://patentable.app/patents/US-11243919","citation_suggestion":"Patentable. \"Preparing high-quality data repositories sets utilizing heuristic data analysis\" (US-11243919). https://patentable.app/patents/US-11243919","copyright_holder":"Nomic Interactive Technology LLC"},"links":{"html":"https://patentable.app/patents/US-11243919","json":"https://patentable.app/api/llm-context/US-11243919","site":"https://patentable.app","llms_txt":"https://patentable.app/llms.txt"},"generated_at":"2026-05-31T16:50:32.415Z"}