{"schema_version":"1.0","canonical_url":"https://patentable.app/patents/US-10515292","patent":{"patent_number":"US-10515292","title":"Joint acoustic and visual processing","assignee":null,"inventors":[],"filing_date":"2017-06-15T00:00:00.000Z","publication_date":"2019-12-24T00:00:00.000Z","cpc_codes":["G06V","G06F","G06F","G06F","G06F","G06N","G06N","G06N","G06V","G06V","G06V","G10L","G10L","G10L"],"num_claims":15,"abstract":"An approach to joint acoustic and visual processing associates images with corresponding audio signals, for example, for the retrievals of images according to voice queries. A set of paired images and audio signals are processed without requiring transcription, segmentation, or annotation of either the images or the audio. This processing of the paired images and audio is used to determine parameters of an image processor and an audio processor, with the outputs of these processors being comparable to determine a similarity across acoustic and visual modalities. In some implementations, the image processor and the audio processor make use of deep neural networks. Further embodiments associate parts of images with corresponding parts of audio signals."},"analysis":{"summary":null,"layman_explanation":null,"technical_analysis":null,"business_analysis":null,"faqs":null,"topics":[],"tech_cluster":null},"seo":{"title":"Joint acoustic and visual processing","description":"An approach to joint acoustic and visual processing associates images with corresponding audio signals, for example, for the retrievals of images according to voice queries. A set of paired images and","keywords":[]},"attribution":{"source":"Patentable","source_url":"https://patentable.app","canonical_url":"https://patentable.app/patents/US-10515292","license":"CC-BY-4.0-like","license_terms":"AI-generated analysis on this page (summary, layman_explanation, technical_analysis, business_analysis, faqs) may be reused with attribution and a visible link back to the canonical URL above. Patent abstracts, claims, and bibliographic data are USPTO public domain.","required_link":"https://patentable.app/patents/US-10515292","citation_suggestion":"Patentable. \"Joint acoustic and visual processing\" (US-10515292). https://patentable.app/patents/US-10515292","copyright_holder":"Nomic Interactive Technology LLC"},"links":{"html":"https://patentable.app/patents/US-10515292","json":"https://patentable.app/api/llm-context/US-10515292","site":"https://patentable.app","llms_txt":"https://patentable.app/llms.txt"},"generated_at":"2026-05-30T08:43:47.432Z"}