{"schema_version":"1.0","canonical_url":"https://patentable.app/patents/US-11521595","patent":{"patent_number":"US-11521595","title":"End-to-end multi-talker overlapping speech recognition","assignee":null,"inventors":[],"filing_date":"2020-05-01T00:00:00.000Z","publication_date":"2022-12-06T00:00:00.000Z","cpc_codes":["G10L","G06N","G06N","G06N","G06N","G10L","G10L","G10L","G10L"],"num_claims":22,"abstract":"A method for training a speech recognition model with a loss function includes receiving an audio signal including a first segment corresponding to audio spoken by a first speaker, a second segment corresponding to audio spoken by a second speaker, and an overlapping region where the first segment overlaps the second segment. The overlapping region includes a known start time and a known end time. The method also includes generating a respective masked audio embedding for each of the first and second speakers. The method also includes applying a masking loss after the known end time to the respective masked audio embedding for the first speaker when the first speaker was speaking prior to the known start time, or applying the masking loss prior to the known start time when the first speaker was speaking after the known end time."},"analysis":{"summary":null,"layman_explanation":null,"technical_analysis":null,"business_analysis":null,"faqs":null,"topics":[],"tech_cluster":null},"seo":{"title":"End-to-end multi-talker overlapping speech recognition","description":"A method for training a speech recognition model with a loss function includes receiving an audio signal including a first segment corresponding to audio spoken by a first speaker, a second segment co","keywords":[]},"attribution":{"source":"Patentable","source_url":"https://patentable.app","canonical_url":"https://patentable.app/patents/US-11521595","license":"CC-BY-4.0-like","license_terms":"AI-generated analysis on this page (summary, layman_explanation, technical_analysis, business_analysis, faqs) may be reused with attribution and a visible link back to the canonical URL above. Patent abstracts, claims, and bibliographic data are USPTO public domain.","required_link":"https://patentable.app/patents/US-11521595","citation_suggestion":"Patentable. \"End-to-end multi-talker overlapping speech recognition\" (US-11521595). https://patentable.app/patents/US-11521595","copyright_holder":"Nomic Interactive Technology LLC"},"links":{"html":"https://patentable.app/patents/US-11521595","json":"https://patentable.app/api/llm-context/US-11521595","site":"https://patentable.app","llms_txt":"https://patentable.app/llms.txt"},"generated_at":"2026-05-30T14:10:35.275Z"}