{"schema_version":"1.0","canonical_url":"https://patentable.app/patents/US-11521056","patent":{"patent_number":"US-11521056","title":"System and methods for intrinsic reward reinforcement learning","assignee":null,"inventors":[],"filing_date":"2017-06-14T00:00:00.000Z","publication_date":"2022-12-06T00:00:00.000Z","cpc_codes":["G06N","G06F","G06N","G06N","G06F"],"num_claims":32,"abstract":"A learning agent is disclosed that receives data in sequence from one or more sequential data sources; generates a model modelling sequences of data and actions; and selects an action maximizing the expected future value of a reward function, wherein the reward function depends at least partly on at least one of: a measure of the change in complexity of the model, or a measure of the complexity of the change in the model. The measure of the change in complexity of the model may be based on, for example, the change in description length of the first part of a two-part code describing one or more sequences of received data and actions, the change in description length of a statistical distribution modelling, the description length of the change in the first part of the two-part code, or the description length of the change in the statistical distribution modelling."},"analysis":{"summary":null,"layman_explanation":null,"technical_analysis":null,"business_analysis":null,"faqs":null,"topics":[],"tech_cluster":null},"seo":{"title":"System and methods for intrinsic reward reinforcement learning","description":"A learning agent is disclosed that receives data in sequence from one or more sequential data sources; generates a model modelling sequences of data and actions; and selects an action maximizing the e","keywords":[]},"attribution":{"source":"Patentable","source_url":"https://patentable.app","canonical_url":"https://patentable.app/patents/US-11521056","license":"CC-BY-4.0-like","license_terms":"AI-generated analysis on this page (summary, layman_explanation, technical_analysis, business_analysis, faqs) may be reused with attribution and a visible link back to the canonical URL above. Patent abstracts, claims, and bibliographic data are USPTO public domain.","required_link":"https://patentable.app/patents/US-11521056","citation_suggestion":"Patentable. \"System and methods for intrinsic reward reinforcement learning\" (US-11521056). https://patentable.app/patents/US-11521056","copyright_holder":"Nomic Interactive Technology LLC"},"links":{"html":"https://patentable.app/patents/US-11521056","json":"https://patentable.app/api/llm-context/US-11521056","site":"https://patentable.app","llms_txt":"https://patentable.app/llms.txt"},"generated_at":"2026-05-30T13:15:14.164Z"}