Voice Notes with Transcription

PublishedApril 22, 2025

Assigneenot available in USPTO data we have

InventorsAlex Edelsburg Kyle Goodrich Pedram Javidpour Andrew James McPhee Daniel Moreno+1 more

Technical Abstract

Patent Claims

20 claims

Legal claims defining the scope of protection, as filed with the USPTO.

1. A method comprising: receiving a first audio message at a recipient user device; causing presentation, on a display of the recipient user device, of a chat user interface (UI) including a first user-selectable element actionable to cause playing of the first audio message and a second user-selectable element actionable to cause presentation of a transcription of the first audio message, the first user-selectable element and the second user-selectable element being simultaneously presented in the chat UI; in response to detecting activation of the second user-selectable element: modifying the chat UI to include the transcription of the first audio message; and causing presentation of the modified chat UI on the display of the recipient user device, the first user-selectable element actionable to cause playing of the first audio message being presented in the modified chat UI simultaneously with the transcription of the first audio message; and in response to detecting activation of a third user-selectable element associated with a second audio message, removing the transcription of the first audio message from the modified chat UI.

2. The method of claim 1, comprising commencing generating of the transcription of the first audio message, in response to the detecting of activation of the second user-selectable element.

3. The method of claim 1, wherein the chat UI includes a fourth user-selectable element actionable to cause playing of the first audio message at a speed different from a normal play speed.

4. The method of claim 1, wherein the third user-selectable element is a partial transcription of the second audio message, the method further comprising causing presentation of a full transcription of the second audio message in response to the detecting the activation of the third user-selectable element.

5. The method of claim 1, wherein the chat UI includes a microphone user-selectable element actionable to initiate an audio recording process and a virtual keyboard for obtaining text input from a user, the method comprising: detecting activation of the microphone user-selectable element; and in response to detecting the activation of the microphone user-selectable element, replacing the virtual keyboard in the chat UI with a voice note tray, the voice note tray comprising one or more user-selectable elements for facilitating obtaining audio input from a user.

6. The method of claim 5, wherein the detecting of activation of the microphone user-selectable element is based on detecting a tap gesture directed to the microphone user-selectable element.

7. The method of claim 5, wherein the voice note tray includes a user-selectable element actionable to discard an audio message.

8. The method of claim 5, wherein the voice note tray includes a record user-selectable element actionable to commence recording of an audio message.

9. The method of claim 8, comprising: in response to detecting activation of the record user-selectable element, commencing capturing audio input; generating a new audio message based on the audio input; and communicating the new audio message to a second user device.

10. The method of claim 1, wherein the chat UI is presented by a messaging client application executing at the recipient user device, the messaging client application provided by a backend service of a messaging system for exchanging data over a network.

11. A system comprising: one or more processors; and a non-transitory computer readable storage medium comprising instructions that when executed by the one or more processors cause the one or more processors to perform operations comprising: receiving a first audio message at a recipient user device; causing presentation, on a display of the recipient user device, of a chat user interface (UI) including a first user-selectable element actionable to cause playing of the first audio message and a second user-selectable element actionable to cause presentation of a transcription of the first audio message, the first user-selectable element and the second user-selectable element being simultaneously presented in the chat UI; in response to detecting activation of the second user-selectable element: modifying the chat UI to include the transcription of the first audio message; and causing presentation of the modified chat UI on the display of the recipient user device, the first user-selectable element actionable to cause playing of the first audio message being presented in the modified chat UI simultaneously with the transcription of the first audio message; and in response to detecting activation of a third user-selectable element associated with a second audio message, removing the transcription of the first audio message from the modified chat UI.

12. The system of claim 11, wherein the operations caused by instructions executed by the one or more processors further include commencing generating of the transcription of the first audio message, in response to the detecting of activation of the second user-selectable element.

13. The system of claim 11, wherein the chat UI includes a fourth user-selectable element actionable to cause playing of the first audio message at a speed different from a normal play speed.

14. The system of claim 11, wherein the third user-selectable element is a partial transcription of the second audio message, the operations further comprising causing presentation of a full transcription of the second audio message in response to the detecting the activation of the third user-selectable element.

15. The system of claim 11, wherein the chat UI includes a microphone user-selectable element actionable to initiate an audio recording process and a virtual keyboard for obtaining text input from a user, the operations caused by instructions executed by the one or more processors further include: detecting activation of the microphone user-selectable element; and in response to detecting the activation of the microphone user-selectable element, replacing the virtual keyboard in the chat UI with a voice note tray, the voice note tray comprising one or more user-selectable elements for facilitating obtaining audio input from a user.

16. The system of claim 15, wherein the detecting of activation of the microphone user-selectable element is based on detecting a tap gesture directed to the microphone user-selectable element.

17. The system of claim 15, wherein the voice note tray includes a user-selectable element actionable to discard an audio message.

18. The system of claim 15, wherein the voice note tray includes a record user-selectable element actionable to commence recording of an audio message.

19. The system of claim 18, wherein the operations caused by instructions executed by the one or more processors further include: in response to detecting activation of the record user-selectable element, commencing capturing audio input; generating a new audio message based on the audio input; and communicating the new audio message to a second user device.

20. A machine-readable non-transitory storage medium having instruction data executable by a machine to cause the machine to perform operations comprising: receiving a first audio message at a recipient user device; causing presentation, on a display of the recipient user device, of a chat user interface (UI) including a first user-selectable element actionable to cause playing of the first audio message and a second user-selectable element actionable to cause presentation of a transcription of the first audio message, the first user-selectable element and the second user-selectable element being simultaneously presented in the chat UI; in response to detecting activation of the second user-selectable element: modifying the chat UI to include the transcription of the first audio message; and causing presentation of the modified chat UI on the display of the recipient user device, the first user-selectable element actionable to cause playing of the first audio message being presented in the modified chat UI simultaneously with the transcription of the first audio message; and in response to detecting activation of a third user-selectable element associated with a second audio message, removing the transcription of the first audio message from the modified chat UI.

Patent Metadata

Filing Date

Unknown

Publication Date

April 22, 2025

Inventors

Alex Edelsburg

Kyle Goodrich

Pedram Javidpour

Andrew James McPhee

Daniel Moreno

Jeremy Baker Voss

Want to explore more patents?

Browse 5M+ US patents with plain-English claim translations and AI-generated analysis.

Browse All Patents Try Prior Art Search