System and Methods for Correcting Text-To-Speech Pronunciation

PublishedFebruary 4, 2020

Assigneenot available in USPTO data we have

Technical Abstract

Patent Claims

12 claims

Legal claims defining the scope of protection, as filed with the USPTO.

1. A text-to-speech (TTS) server comprising one or more processors in communication with one or more memory devices, the TTS server configured to: generate, for a plurality of first user devices, a first machine pronunciation of text data according to at least one phonetic rule; receive crowdsource data comprising a plurality of pronunciation corrections of the first machine pronunciation from a plurality of audio input devices of the plurality of first user devices, wherein the plurality of first user devices are located in a first geographic location at a time of submission of the pronunciation corrections; generate a second machine pronunciation of the text data by augmenting the at least one phonetic rule based on the crowdsource data; receive, from a second user device, subsequent to generation of the second machine pronunciation, a TTS request including the text data; determine whether the second user device is located within the first geographic location; and provide, via an audio output device of the second user device, one of (i) the first machine pronunciation in response to the second user device being located outside the first geographic location, and (ii) the second machine pronunciation in response to the second user device being located within the first geographic location.

2. The TTS server of claim 1 further configured to assign one of the pronunciation corrections submitted by one of the plurality of first user devices to a user profile associated with a user of the one of the plurality of first user devices.

3. The TTS server of claim 2 , wherein the pronunciation correction is configured to override the at least one phonetic rule.

4. The TTS server of claim 1 further configured to determine a current location of the plurality of first user devices via location services.

5. A computer-implemented method for correcting pronunciation in a text-to-speech (TTS) system, said method implemented using a TTS server in communication with one or more memory devices, said method comprising: generating, by the TTS server for a plurality of first user devices, a first machine pronunciation of text data according to at least one phonetic rule; receiving, by the TTS server, crowdsource data comprising a plurality of pronunciation corrections of the first machine pronunciation from a plurality of audio input devices of the plurality of first user devices, wherein the plurality of first user devices are located in a first geographic location at a time of submission of the pronunciation corrections; generating, by the TTS server, a second machine pronunciation of the text data by augmenting the at least one phonetic rule based on the crowdsource data; receiving, by the TTS server from a second user device, subsequent to generation of the second machine pronunciation, a TTS request including the text data; determining, by the TTS server, whether the second user device is located within the first geographic location; and providing, by the TTS server, via an audio output device of the second user device, one of (i) the first machine pronunciation in response to the second user device being located outside the first geographic location, and (ii) the second machine pronunciation in response to the second user device being located within the first geographic location.

6. The method of claim 5 further comprising assigning one of the pronunciation corrections submitted by one of the plurality of first user devices to a user profile associated with a user of the one of the plurality of first user devices.

7. The method of claim 6 , wherein the pronunciation correction is configured to override the at least one phonetic rule.

8. The method of claim 5 further comprising determining a current location of the plurality of first user devices via location services.

9. A non-transitory computer readable medium that includes computer executable instructions for correcting pronunciation in a text-to-speech (TTS) system, wherein when executed by a TTS server comprising at least one processor in communication with at least one memory device, the computer executable instructions cause the TTS server to: generate, for a plurality of first user devices, a first machine pronunciation of text data according to at least one phonetic rule; receive crowdsource data comprising a plurality of pronunciation corrections of the first machine pronunciation from a plurality of audio input devices of the plurality of first user devices, wherein the plurality of first user devices are located in a first geographic location at a time of submission of the pronunciation corrections; generate a second machine pronunciation of the text data by augmenting the at least one phonetic rule based on the crowdsource data; receive, from a second user device, subsequent to generation of the second machine pronunciation, a TTS request including the text data; determine whether the second user device is located within the first geographic location; and provide, via an audio output device of the second user device, one of (i) the first machine pronunciation in response to the second user device being located outside the first geographic location, and (ii) the second machine pronunciation in response to the second user device being located within the first geographic location.

10. The non-transitory computer readable medium of claim 9 , wherein the computer executable instructions further cause the TTS computing device to assign one of the pronunciation corrections submitted by one of the plurality of first user devices to a user profile associated with a user of the one of the plurality of first user devices.

11. The non-transitory computer readable medium of claim 10 , wherein the pronunciation correction is configured to override the at least one phonetic rule.

12. The non-transitory computer readable medium of claim 9 , wherein the computer executable instructions further cause the TTS server to determine a current location of the plurality of first user devices via location services.

Patent Metadata

Filing Date

Unknown

Publication Date

February 4, 2020

Inventors

Jason Jay Lacoss-Arnold

Want to explore more patents?

Browse 5M+ US patents with plain-English claim translations and AI-generated analysis.

Browse All Patents Try Prior Art Search