{"schema_version":"1.0","canonical_url":"https://patentable.app/patents/US-11481672","patent":{"patent_number":"US-11481672","title":"Machine learning system and apparatus for sampling labelled data","assignee":null,"inventors":[],"filing_date":"2019-05-28T00:00:00.000Z","publication_date":"2022-10-25T00:00:00.000Z","cpc_codes":["G06N","G06F","G06F","G06N"],"num_claims":15,"abstract":"A database including various datasets and metadata associated with each respective dataset is provided. These datasets were used to train predictive models. The database stores a performance value associated with the model trained with each dataset. When provided with a new dataset, a server can determine various metadata for the new dataset. Using the metadata, the server can search the database and retrieve datasets which have similar metadata values. The server can narrow the search based on the performance value associated with the dataset. Based on the retrieved datasets, the server can recommend at least one sampling technique. The sampling technique can be determined based on the one or more sampling techniques that were used in association with the retrieved datasets."},"analysis":{"summary":null,"layman_explanation":null,"technical_analysis":null,"business_analysis":null,"faqs":null,"topics":[],"tech_cluster":null},"seo":{"title":"Machine learning system and apparatus for sampling labelled data","description":"A database including various datasets and metadata associated with each respective dataset is provided. These datasets were used to train predictive models. The database stores a performance value ass","keywords":[]},"attribution":{"source":"Patentable","source_url":"https://patentable.app","canonical_url":"https://patentable.app/patents/US-11481672","license":"CC-BY-4.0-like","license_terms":"AI-generated analysis on this page (summary, layman_explanation, technical_analysis, business_analysis, faqs) may be reused with attribution and a visible link back to the canonical URL above. Patent abstracts, claims, and bibliographic data are USPTO public domain.","required_link":"https://patentable.app/patents/US-11481672","citation_suggestion":"Patentable. \"Machine learning system and apparatus for sampling labelled data\" (US-11481672). https://patentable.app/patents/US-11481672","copyright_holder":"Nomic Interactive Technology LLC"},"links":{"html":"https://patentable.app/patents/US-11481672","json":"https://patentable.app/api/llm-context/US-11481672","site":"https://patentable.app","llms_txt":"https://patentable.app/llms.txt"},"generated_at":"2026-05-31T06:23:36.318Z"}