{"schema_version":"1.0","canonical_url":"https://patentable.app/patents/US-8521678","patent":{"patent_number":"US-8521678","title":"Learning control system and learning control method","assignee":null,"inventors":[],"filing_date":"2010-06-03T00:00:00.000Z","publication_date":"2013-08-27T00:00:00.000Z","cpc_codes":["G06N"],"num_claims":9,"abstract":"A learning control system according to the present invention is one which performs learning of action values of actions in an apparatus which identifies its state as one of predetermined states, and selects an action based on the obtained action values and the identified state. The learning control system includes n action value learning devices including the first to the n th learning devices which perform learning of n action values from Q1 to Qn, assuming that n is a positive integer and an action value determining device which determines the total action value of an action Q of each state based on outputs of the n action value learning devices. In the learning control system, the first target value of the first action value learning device is determined based on the reward r obtained after an action has been carried out by the next state and a total action value Q′ that was prepared for the action selection in the next state, and the first learning device updates the first action value Q1 using the first target value. When n is 2 or more, the n-th a target value of the n th action value learning device is set to the difference between the (n−1) th target value of the (n−1) th learning device and the action value Qn-1, and the n th learning device updates the n th action value Q1 using the n th target value."},"analysis":{"summary":null,"layman_explanation":null,"technical_analysis":null,"business_analysis":null,"faqs":null,"topics":[],"tech_cluster":null},"seo":{"title":"Learning control system and learning control method","description":"A learning control system according to the present invention is one which performs learning of action values of actions in an apparatus which identifies its state as one of predetermined states, and s","keywords":[]},"attribution":{"source":"Patentable","source_url":"https://patentable.app","canonical_url":"https://patentable.app/patents/US-8521678","license":"CC-BY-4.0-like","license_terms":"AI-generated analysis on this page (summary, layman_explanation, technical_analysis, business_analysis, faqs) may be reused with attribution and a visible link back to the canonical URL above. Patent abstracts, claims, and bibliographic data are USPTO public domain.","required_link":"https://patentable.app/patents/US-8521678","citation_suggestion":"Patentable. \"Learning control system and learning control method\" (US-8521678). https://patentable.app/patents/US-8521678","copyright_holder":"Nomic Interactive Technology LLC"},"links":{"html":"https://patentable.app/patents/US-8521678","json":"https://patentable.app/api/llm-context/US-8521678","site":"https://patentable.app","llms_txt":"https://patentable.app/llms.txt"},"generated_at":"2026-05-31T05:15:02.000Z"}