Legal claims defining the scope of protection, as filed with the USPTO.
3. The method of claim 2, wherein the context of the telephone conversation comprises one or more of: an identity of the entity, a time the telephone call is initiated, an entity location associated with the entity, or a user location associated with the user.
6. The method of claim 1, wherein the server bypasses performance of speech recognition on the spoken utterance.
9. The method of claim 7, wherein the machine learning model is trained based on historical data for previous telephone conversations, and wherein the historical data for the previous telephone conversation comprises, for each previous telephone conversation, at least (i) corresponding previous first speaker intents associated with a first speaker determined based on corresponding first portions of each previous telephone conversation, (ii) corresponding previous second speaker intents associated with a second speaker determined based on corresponding second portions of each previous telephone conversation, (iii) corresponding previous audio data that captures most recent spoken utterance of the first speaker or the second speaker during each previous telephone conversation, and (iv) a corresponding previous intent of a corresponding previous reply to corresponding the most recent spoken utterance.
10. The method of claim 9, wherein the historical data for the previous telephone conversation further comprises, for each previous telephone conversation, (v) a corresponding previous context for each previous telephone conversation.
13. The system of claim 12, wherein the context of the telephone conversation comprises one or more of: an identity of the entity, a time the telephone call is initiated, an entity location associated with the entity, or a user location associated with the user.
16. The system of claim 11, wherein the system bypasses performance of speech recognition on the spoken utterance.
19. The system of claim 17, wherein the machine learning model is trained based on historical data for previous telephone conversations, and wherein the historical data for the previous telephone conversation comprises, for each previous telephone conversation, (i) a corresponding previous context for each previous telephone conversation, (ii) corresponding previous first speaker intents associated with a first speaker determined based on corresponding first portions of each previous telephone conversation, (iii) corresponding previous second speaker intents associated with a second speaker determined based on corresponding second portions of each previous telephone conversation, (iv) corresponding previous audio data that captures most recent spoken utterance of the first speaker or the second speaker during each previous telephone conversation, (v) a corresponding previous intent of a corresponding previous reply to corresponding the most recent spoken utterance.
Unknown
November 8, 2022
Browse 5M+ US patents with plain-English claim translations and AI-generated analysis.