Methods and systems for transcription playback with variable emphasis

PublishedJune 11, 2024

Assigneenot available in USPTO data we have

Inventorsnot available in USPTO data we have

Technical Abstract

Methods and systems are provided for assisting operation of a vehicle using speech recognition and transcription using text-to-speech for transcription playback with variable emphasis. One method involves analyzing a transcription of an audio communication with respect to the vehicle to identify an operational term pertaining to a current operational context of the vehicle within the transcription, creating an indicator identifying the operational term within the transcription for emphasis when the operational term pertains to the current operational context of the vehicle, identifying a user-configured playback rate; and generating an audio reproduction of the transcription of the audio communication in accordance with the user-configured playback rate, wherein the operational term is selectively emphasized within the audio reproduction based on the indicator.

Patent Claims

13 claims

Legal claims defining the scope of protection, as filed with the USPTO.

3. The method of claim 1, wherein generating the audio reproduction comprises selectively increasing a volume level associated with a portion of the audio reproduction corresponding to the operational term.

4. The method of claim 1, wherein generating the audio reproduction comprises selectively decreasing a words per minute associated with a portion of the audio reproduction corresponding to the operational term.

5. The method of claim 1, further comprising obtaining an updated operational context for the vehicle, wherein generating the audio reproduction comprises dynamically varying the emphasis associated with a portion of the audio reproduction comprising the operational term based on a relationship between the updated operational context and the current operational context at a time associated with receipt of the audio communication.

6. The method of claim 1, further comprising identifying the user-configured playback rate based on a position of a graphical user interface element associated with a playback rate for the audio reproduction, wherein the operational term is selectively emphasized within the audio reproduction by reducing the playback rate for a portion of the audio reproduction comprising the operational term relative to the user-configured playback rate.

7. The method of claim 1, further comprising identifying the user-configured playback rate based on historical playback behavior associated with a user.

8. The method of claim 1, wherein creating the indicator comprises creating one or more metadata tags associated with an entry for the transcription in a data storage element, wherein the one or more metadata tags indicate at least one of a position of the operational term within the transcription and the current operational context.

10. The method of claim 9, wherein generating the audio reproduction comprises selectively increasing a volume level associated with the first portion of the speech audio output comprising the operational term relative to a second volume level associated with the second portion of the speech audio output.

11. The method of claim 1, wherein generating the audio reproduction comprises deemphasizing the operational term within the audio reproduction based on the indicator by generating a portion of the audio reproduction comprising the operational term with the user-configured playback rate when an updated operational context for the vehicle does not match the current operational context at a time associated with the audio communication.

13. The computer-readable medium of claim 12, wherein the computer-executable instructions cause the processing system to selectively increase a volume level associated with a portion of the audio reproduction including the operational term.

14. The computer-readable medium of claim 12, wherein the computer-executable instructions cause the processing system to generate the audio reproduction of the transcription of the audio communication in accordance with the user-configured playback rate by converting the transcription of the audio communication into speech having a first words per minute corresponding to the user-configured playback rate, wherein the operational term is selectively emphasized by converting the operational term into speech using a second words per minute that is less than the first words per minute.

15. The computer-readable medium of claim 12, wherein the indicator comprises one or more metadata tags associated with an entry for the transcription in a data storage element, wherein the one or more metadata tags indicate at least one of a position of the operational term within the transcription and the current operational context at a time associated with the audio communication.

16. The computer-readable medium of claim 12, wherein the computer-executable instructions cause the processing system to deemphasize the operational term within the audio reproduction when an updated operational context for the vehicle does not match the current operational context at a time associated with the audio communication.

18. The method of claim 1, further comprising dynamically adjusting the user-configured playback rate based on the current operational context.

Classification Codes (CPC)

Cooperative Patent Classification codes for this invention. Click any code to explore related patents in that topic.

G06F G10L

Patent Metadata

Filing Date

August 27, 2021

Publication Date

June 11, 2024

Want to explore more patents?

Browse 5M+ US patents with plain-English claim translations and AI-generated analysis.

Browse All Patents Try Prior Art Search