Legal claims defining the scope of protection, as filed with the USPTO.
1. A device comprising: a processor; and a memory in communication with the processor, the memory comprising executable instructions that, when executed by the processor, cause the processor to control the device to perform functions of: recording a speech by a person to generate an audio recording of the speech; capturing, at a first time of recording the speech, a first enhancement element for the audio recording, the first enhancement element comprising a hashtag mentioned by the person in the audio recording at the first time; adding the first enhancement element to the audio recording, the first enhancement element being associated with a portion of the audio recording at the first time; causing a waveform visually representing the audio recording to be displayed via a graphical user interface (GUI); identifying the first enhancement element associated with the audio recording; causing a first visual representation of the first enhancement element to be displayed via the GUI, the first visual representation being displayed along with the waveform; receiving a first user input selecting the displayed first visual representation; and in response to the received first user input, causing the hashtag to be displayed via the GUI.
2. The device of claim 1 , wherein the instructions, when executed by the processor, further cause the processor to control the device to perform functions of: causing a portion of the audio recording at a second time to be transcribed; causing text from the transcribed portion of the audio recording to be identified; and causing a second visual representation of a second enhancement element to be displayed via the GUI along with the waveform and first visual representation of the first enhancement element, the second enhancement element comprising the text.
3. The device of claim 2 , wherein the text comprises at least one of a name, website, event time, event location and hashtag.
4. The device of claim 1 , wherein the first enhancement element further comprises a timestamp indicating the first time at which the hashtag was mentioned by the person in the audio recording.
5. The device of claim 1 , wherein the first enhancement element further comprises a display duration for which the first enhancement element is to be displayed along with the waveform via the display.
6. The device of claim 1 , wherein the visual representation of the first enhancement element comprises a preview of the first enhancement element.
7. The device of claim 1 , wherein the instructions, when executed by the processor, further cause the processor to control the device to perform functions of: receiving a second user input selecting an editing mode; in response to receiving the second user input, causing the first enhancement element to be presented at an original location on the waveform; receiving a third user input modifying the original location of the first enhancement element on the waveform; and in response to receiving the third user input, causing the first enhancement element presented at the modified location on the waveform.
8. A method of operating a device for displaying an enhancement element for an audio recording, the method comprising: recording a speech by a person to generate an audio recording of the speech; capturing, at a first time of recording the speech, a first enhancement element for the audio recording, the first enhancement element comprising a hashtag mentioned by the person in the audio recording at the first time; adding the first enhancement element to the audio recording, the first enhancement element being associated with a portion of the audio recording at the first time; causing a waveform visually representing the audio recording to be displayed via a graphical user interface (GUI); identifying the first enhancement element associated with the audio recording; causing a first visual representation of the first enhancement element to be displayed via the GUI, the being displayed along with the waveform; receiving a first user input selecting the displayed first visual representation; and in response to the received first user input, causing the hashtag to be displayed via the GUI.
9. The method of claim 8 , further comprising: causing a portion of the audio recording at a second time to be transcribed; causing text from the transcribed portion of the audio recording to be identified; and causing a second visual representation of a second enhancement element to be displayed via the GUI along with the waveform and first visual representation of the first enhancement element, the second enhancement element comprising the text.
10. The method of claim 9 , wherein the text comprises at least one of a name, website, event time, event location and hashtag.
11. The method of claim 8 , wherein the first enhancement element further comprises a timestamp indicating the first time at which the hashtag was mentioned by the person in the audio recording.
12. The method of claim 8 , wherein the first enhancement element includes a display duration for which the first enhancement element is to be displayed along with the waveform via the display.
13. The method of claim 8 , wherein the visual representation of the first enhancement element comprises a preview of the first enhancement element.
14. A non-transitory computer readable medium containing instructions which, when executed by a processor, cause a computer to perform functions of: recording a speech by a person to generate an audio recording of the speech; capturing, at a first time of recording the speech, an enhancement element for the audio recording, the enhancement element comprising a hashtag mentioned by the person in the audio recording at the first time; adding the enhancement element to the audio recording, the enhancement element being associated with a portion of the audio recording at the first time; causing a waveform visually representing an audio recording to be displayed via a graphical user interface (GUI); identifying the enhancement element associated with the audio recording; causing a visual representation of the enhancement element to be displayed via the GUI, the visual representation being displayed along with the waveform; receiving a user input selecting the displayed visual representation; and in response to the received user input, causing the hashtag to be displayed via the GUI.
15. The non-transitory computer readable medium of claim 14 , wherein the visual representation comprises a preview of the enhancement element.
16. The non-transitory computer readable medium of claim 14 , wherein the enhancement element further comprises a display duration for which the enhancement element is to be displayed along with the waveform via the display.
17. The non-transitory computer readable medium of claim 14 , wherein the enhancement element further comprises a timestamp indicating the particular time at which the hashtag was mentioned by the person in the audio recording.
18. The non-transitory computer readable medium of claim 14 , wherein the instructions, when executed by the processor, further cause the processor to control the computer to perform functions of: receiving a second user input selecting an editing mode; in response to receiving the second user input, causing the enhancement element to be presented at an original location on the waveform; receiving a third user input modifying the original location of the enhancement element on the waveform; and in response to receiving the third user input, causing the enhancement element presented at the modified location on the waveform.
Unknown
October 19, 2021
Browse 5M+ US patents with plain-English claim translations and AI-generated analysis.