{"schema_version":"1.0","canonical_url":"https://patentable.app/patents/US-10497382","patent":{"patent_number":"US-10497382","title":"Associating faces with voices for speaker diarization within videos","assignee":null,"inventors":[],"filing_date":"2017-04-26T00:00:00.000Z","publication_date":"2019-12-03T00:00:00.000Z","cpc_codes":["G10L","G06V","G06V","G06V","G10L","G10L","G10L","G10L","G10L","G10L","G10L","G11B","G11B","G11B","H04N","H04N","H04N","H04N","H04N","H04N","H04N","G10L","G10L","G10L"],"num_claims":20,"abstract":"A computer-implemented method for speech diarization is described. The method comprises determining temporal positions of separate faces in a video using face detection and clustering. Voice features are detected in the speech sections of the video. The method further includes generating a correlation between the determined separate faces and separate voices based at least on the temporal positions of the separate faces and the separate voices in the video. This correlation is stored in a content store with the video."},"analysis":{"summary":null,"layman_explanation":null,"technical_analysis":null,"business_analysis":null,"faqs":null,"topics":[],"tech_cluster":null},"seo":{"title":"Associating faces with voices for speaker diarization within videos","description":"A computer-implemented method for speech diarization is described. The method comprises determining temporal positions of separate faces in a video using face detection and clustering. Voice features ","keywords":[]},"attribution":{"source":"Patentable","source_url":"https://patentable.app","canonical_url":"https://patentable.app/patents/US-10497382","license":"CC-BY-4.0-like","license_terms":"AI-generated analysis on this page (summary, layman_explanation, technical_analysis, business_analysis, faqs) may be reused with attribution and a visible link back to the canonical URL above. Patent abstracts, claims, and bibliographic data are USPTO public domain.","required_link":"https://patentable.app/patents/US-10497382","citation_suggestion":"Patentable. \"Associating faces with voices for speaker diarization within videos\" (US-10497382). https://patentable.app/patents/US-10497382","copyright_holder":"Nomic Interactive Technology LLC"},"links":{"html":"https://patentable.app/patents/US-10497382","json":"https://patentable.app/api/llm-context/US-10497382","site":"https://patentable.app","llms_txt":"https://patentable.app/llms.txt"},"generated_at":"2026-05-30T17:06:14.770Z"}