Voice Synthesis for Virtual Agents

PublishedMay 18, 2021

Assigneenot available in USPTO data we have

InventorsDavid Charles Hardage Megan Sarah Jennings Samantha Sprague Joseph Kasonde Abrham Tibebu Workineh+2 more

Technical Abstract

Patent Claims

24 claims

Legal claims defining the scope of protection, as filed with the USPTO.

1. A computer-implemented method performed by at least one processor, the method comprising: receiving information identifying a customer; determining, based on the information identifying the customer, if the customer is a known customer; in response to determining that the customer is an unknown customer, selecting a first voice for a virtual agent and assigning the first voice to the virtual agent during communications with the customer, the first voice including at least one first sound property; in response to determining that the customer is a known customer, selecting a second voice for the virtual agent, including selecting at least one second sound property for a voice that is an alteration of the at least one first sound property, based on previously received information about the customer, and assigning the second voice to the virtual agent during communications with the customer.

2. The computer-implemented method of claim 1 , further comprising: monitoring the communications with the customer; and in response to detecting an intent of the customer during the monitoring, altering the at least one first sound property or the at least one second sound property based on the detected intent.

3. The computer-implemented method of claim 1 , further comprising: performing signal processing on received voice data to identify at least one sound property of a customer voice of the customer, wherein selecting the first voice for the virtual agent is further based on the at least one sound property of the customer voice of the customer.

4. The computer-implemented method of claim 1 , wherein: assigning the first voice to the virtual agent includes synthesizing the first voice using speech synthesis; assigning the second voice to the virtual agent includes synthesizing the second voice using speech synthesis; and the virtual agent uses at least one of the synthesized first voice and the synthesized second voice during communications with the customer.

5. The computer-implemented the method of claim 1 , further comprising: receiving textual message information from a customer service representative to the customer; and altering the voice assigned to the virtual agent based on the textual message information.

6. The computer-implemented the method of claim 1 , wherein selecting the second voice includes matching the customer to a voice profile, the voice profile identified by: identifying, using one or more machine learning techniques, one or more previous communications between the customer and a human service representative that resulted in a positive outcome; and performing signal processing on voice data representing a voice of the human service representative.

7. The computer-implemented method of claim 1 , further comprising: monitoring the communications with the customer; determining a response of the customer to the voice assigned to the virtual agent; and updating a machine learning system based on the determined response.

8. The computer-implemented method of claim 1 , wherein the first voice or the second voice is selected using a machine-learning system to match the voice of a service representative assigned to the customer and is used in real-time in two-way communications with the customer.

9. A system, comprising: at least one processor; and a memory communicatively coupled to the at least one processor, the memory storing instructions which, when executed by the at least one processor, cause the at least one processor to perform operations comprising: receiving information identifying a customer; determining, based on the information identifying the customer, if the customer is a known customer; in response to determining that the customer is an unknown customer, selecting a first voice for a virtual agent and assigning the first voice to the virtual agent during communications with the customer, the first voice including at least one first sound property; in response to determining that the customer is a known customer, selecting a second voice for the virtual agent, including selecting at least one second sound property for a voice that is an alteration of the at least one first sound property, based on previously received information about the customer, and assigning the second voice to the virtual agent during communications with the customer.

10. The system of claim 9 , wherein the operations further comprise: monitoring the communications with the customer; and in response to detecting an intent of the customer during the monitoring, altering the at least one first sound property or the at least one second sound property based on the detected intent.

11. The system of claim 9 , wherein the operations further comprise: performing signal processing on received voice data to identify at least one sound property of a customer voice of the customer, wherein selecting the first voice for the virtual agent is further based on the at least one sound property of the customer voice of the customer.

12. The system of claim 9 , wherein: assigning the first voice to the virtual agent includes synthesizing the first voice using speech synthesis; assigning the second voice to the virtual agent includes synthesizing the second voice using speech synthesis; and the virtual agent uses at least one of the synthesized first voice and the synthesized second voice during communications with the customer.

13. The system of claim 9 , wherein the operations further comprise: receiving textual message information from a customer service representative to the customer; and altering the voice assigned to the virtual agent based on the textual message information.

14. The system of claim 9 , wherein selecting the second voice includes matching the customer to a voice profile, the voice profile identified by: identifying, using one or more machine learning techniques, one or more previous communications between the customer and a human service representative that resulted in a positive outcome; and performing signal processing on voice data representing a voice of the human service representative.

15. The system of claim 9 , wherein the operations further comprise: monitoring the communications with the customer; determining a response of the customer to the voice assigned to the virtual agent; and updating a machine learning system based on the determined response.

16. The system of claim 9 , wherein the first voice or the second voice is selected using a machine-learning system to match the voice of a service representative assigned to the customer for the duration of a call and is used in real-time in two-way communications with the customer.

17. One or more non-transitory computer-readable media storing instructions which, when executed by at least one processor, cause the at least one processor to perform operations comprising: receiving information identifying a customer; determining, based on the information identifying the customer, if the customer is a known customer; in response to determining that the customer is an unknown customer, selecting a first voice for a virtual agent and assigning the first voice to the virtual agent during communications with the customer, the first voice including at least one first sound property; in response to determining that the customer is a known customer, selecting a second voice for the virtual agent, including selecting at least one second sound property for a voice that is an alteration of the at least one first sound property, based on previously received information about the customer, and assigning the second voice to the virtual agent during communications with the customer.

18. The one or more non-transitory computer-readable media of claim 17 , wherein the operations further comprise: monitoring the communications with the customer; and in response to detecting an intent of the customer during the monitoring, altering the at least one first sound property or the at least one second sound property based on the detected intent.

19. The one or more non-transitory computer-readable media of claim 17 , wherein the operations further comprise: performing signal processing on received voice data to identify at least one sound property of a customer voice of the customer, wherein selecting the first voice for the virtual agent is further based on the at least one sound property of the customer voice of the customer.

20. The one or more non-transitory computer-readable media of claim 17 , wherein assigning the first voice to the virtual agent includes synthesizing the first voice using speech synthesis; assigning the second voice to the virtual agent includes synthesizing the second voice using speech synthesis; and the virtual agent uses at least one of the synthesized first voice and the synthesized second voice during communications with the customer.

21. The one or more non-transitory computer-readable media of claim 17 , wherein the operations further comprise: receiving textual message information from a customer service representative to the customer; and altering the voice assigned to the virtual agent based on the textual message information.

22. The one or more non-transitory computer-readable media of claim 17 , wherein selecting the second voice includes matching the customer to a voice profile, the voice profile identified by: identifying, using one or more machine learning techniques, one or more previous communications between the customer and a human service representative that resulted in a positive outcome; and performing signal processing on voice data representing a voice of the human service representative.

23. The one or more non-transitory computer-readable media of claim 17 , wherein the operations further comprise: monitoring the communications with the customer; determining a response of the customer to the voice assigned to the virtual agent; and updating a machine learning system based on the determined response.

24. The one or more non-transitory computer-readable media of claim 17 , wherein the first voice or the second voice is selected using a machine-learning system to match the voice of a service representative assigned to the customer and is used in real-time in two-way communications with the customer.

Patent Metadata

Filing Date

Unknown

Publication Date

May 18, 2021

Inventors

David Charles Hardage

Megan Sarah Jennings

Samantha Sprague

Joseph Kasonde

Abrham Tibebu Workineh

Alejandra Valles

Nelson Thomas Hittner

Want to explore more patents?

Browse 5M+ US patents with plain-English claim translations and AI-generated analysis.

Browse All Patents Try Prior Art Search