{"schema_version":"1.0","canonical_url":"https://patentable.app/patents/US-9697833","patent":{"patent_number":"US-9697833","title":"Audio-visual speech recognition with scattering operators","assignee":null,"inventors":[],"filing_date":"2015-08-25T00:00:00.000Z","publication_date":"2017-07-04T00:00:00.000Z","cpc_codes":["G10L","G06T","G06V","G06V","G10L","G10L"],"num_claims":20,"abstract":"Aspects described herein are directed towards methods, computing devices, systems, and computer-readable media that apply scattering operations to extracted visual features of audiovisual input to generate predictions regarding the speech status of a subject. Visual scattering coefficients generated according to one or more aspects described herein may be used as input to a neural network operative to generate the predictions regarding the speech status of the subject. Predictions generated based on the visual features may be combined with predictions based on audio input associated with the visual features. In some embodiments, the extracted visual features may be combined with the audio input to generate a combined feature vector for use in generating predictions."},"analysis":{"summary":null,"layman_explanation":null,"technical_analysis":null,"business_analysis":null,"faqs":null,"topics":[],"tech_cluster":null},"seo":{"title":"Audio-visual speech recognition with scattering operators","description":"Aspects described herein are directed towards methods, computing devices, systems, and computer-readable media that apply scattering operations to extracted visual features of audiovisual input to gen","keywords":[]},"attribution":{"source":"Patentable","source_url":"https://patentable.app","canonical_url":"https://patentable.app/patents/US-9697833","license":"CC-BY-4.0-like","license_terms":"AI-generated analysis on this page (summary, layman_explanation, technical_analysis, business_analysis, faqs) may be reused with attribution and a visible link back to the canonical URL above. Patent abstracts, claims, and bibliographic data are USPTO public domain.","required_link":"https://patentable.app/patents/US-9697833","citation_suggestion":"Patentable. \"Audio-visual speech recognition with scattering operators\" (US-9697833). https://patentable.app/patents/US-9697833","copyright_holder":"Nomic Interactive Technology LLC"},"links":{"html":"https://patentable.app/patents/US-9697833","json":"https://patentable.app/api/llm-context/US-9697833","site":"https://patentable.app","llms_txt":"https://patentable.app/llms.txt"},"generated_at":"2026-06-06T10:57:40.493Z"}