{"schema_version":"1.0","canonical_url":"https://patentable.app/patents/US-11468310","patent":{"patent_number":"US-11468310","title":"Constraining actions for reinforcement learning under safety requirements","assignee":null,"inventors":[],"filing_date":"2018-03-07T00:00:00.000Z","publication_date":"2022-10-11T00:00:00.000Z","cpc_codes":["G06N","G06N","G06N"],"num_claims":20,"abstract":"A computer-implemented method, computer program product, and system are provided for deep reinforcement learning to control a subject device. The method includes training, by a processor, a neural network to receive state information of a target of the subject device as an input and provide action information for the target as an output. The method further includes inputting, by the processor, current state information of the target into the neural network to obtain current action information for the target. The method also includes correcting, by the processor, the current action information minimally to obtain corrected action information that meets a set of constraints. The method additionally includes performing an action by the subject device based on the corrected action information for the target to obtain a reward from the target."},"analysis":{"summary":null,"layman_explanation":null,"technical_analysis":null,"business_analysis":null,"faqs":null,"topics":[],"tech_cluster":null},"seo":{"title":"Constraining actions for reinforcement learning under safety requirements","description":"A computer-implemented method, computer program product, and system are provided for deep reinforcement learning to control a subject device. The method includes training, by a processor, a neural net","keywords":[]},"attribution":{"source":"Patentable","source_url":"https://patentable.app","canonical_url":"https://patentable.app/patents/US-11468310","license":"CC-BY-4.0-like","license_terms":"AI-generated analysis on this page (summary, layman_explanation, technical_analysis, business_analysis, faqs) may be reused with attribution and a visible link back to the canonical URL above. Patent abstracts, claims, and bibliographic data are USPTO public domain.","required_link":"https://patentable.app/patents/US-11468310","citation_suggestion":"Patentable. \"Constraining actions for reinforcement learning under safety requirements\" (US-11468310). https://patentable.app/patents/US-11468310","copyright_holder":"Nomic Interactive Technology LLC"},"links":{"html":"https://patentable.app/patents/US-11468310","json":"https://patentable.app/api/llm-context/US-11468310","site":"https://patentable.app","llms_txt":"https://patentable.app/llms.txt"},"generated_at":"2026-05-30T23:01:45.522Z"}