9997169

Image-Based Techniques for Audio Content

PublishedJune 12, 2018
Assigneenot available in USPTO data we have
Technical Abstract

Patent Claims
20 claims

Legal claims defining the scope of protection, as filed with the USPTO.

1

1. A method comprising: receiving, at a device from a first media device, audio data and a video of an object; determining vibrations of the object from the video, the vibrations caused by sound in a vicinity of the object, wherein a portion of the vibrations is responsive to media content rendered at the first media device; accessing a database to retrieve vibration data associated with the media content rendered at the first media device; filtering the vibration data from the portion to generate vibration information; generating audio information based on the vibration information; filtering the audio data based on the audio information to generate audio content associated with the first media device; and transmitting the audio content from the device to a second media device.

2

2. The method of claim 1 , wherein the video is generated by an imaging system that utilizes ambient lighting.

3

3. The method of claim 1 , wherein the video is generated by an imaging system that projects light on the object.

4

4. The method of claim 1 , wherein the device includes a network device.

5

5. The method of claim 1 , wherein the device includes a customer premises equipment device.

6

6. The method of claim 1 , wherein filtering the audio data comprises cancelling the vibration data from the audio data.

7

7. The method of claim 1 , wherein the audio data includes attribute data, the attribute data associated with the first media device, a location of the first media device, or both, and wherein the generation of the audio content is based on the attribute data.

8

8. The method of claim 1 , wherein the video indicates amplitude of the vibrations of the object, phase of the vibrations of the object, frequency of the vibrations of the object, or a combination thereof.

9

9. An electronic device comprising: a processor; and a memory in communication with the processor, the memory including instructions executable by the processor to perform operations including: receiving, from a first media device, audio data and a video of an object; determining vibrations of the object from the video, the vibrations caused by sound in a vicinity of the object, wherein a portion of the vibrations is responsive to media content rendered at the first media device; accessing a database to retrieve vibration data associated with the media content rendered at the first media device; filtering the vibration data from the portion to generate vibration information; generating audio information based on the vibration information; filtering the audio data based on the audio information to generate audio content associated with the first media device; and transmitting the audio content to a second media device.

10

10. The electronic device of claim 9 , wherein a second portion of the vibrations is due to vocal sound produced by a person in the vicinity of the object.

11

11. The electronic device of claim 10 , wherein the vocal sound represents speech of the person in connection with a video meeting between the person and a second person associated with the second media device.

12

12. The electronic device of claim 11 , wherein the operations further include: receiving video content from the first media device, the video content associated with the video meeting; and encoding the video content and the audio content into a message.

13

13. The electronic device of claim 12 , wherein the audio content is transmitted to the second media device by sending the message to the second media device.

14

14. The electronic device of claim 9 , wherein filtering the audio data comprises cancelling the vibration data from the audio data.

15

15. A computer-readable storage device comprising instructions executable by a processor to perform operations including: receiving, from a first media device, audio data and a video of an object; determining vibrations of the object from the video, the vibrations caused by sound in a vicinity of the object, wherein a portion of the vibrations is responsive to media content rendered at the first media device; accessing a database to retrieve vibration data associated with the media content rendered at the first media device; filtering the vibration data from the portion to generate vibration information; generating audio information based on the vibration information; filtering the audio data based on the audio information to generate audio content associated with the first media device; and sending the audio content to a second media device.

16

16. The computer-readable storage device of claim 15 , wherein the video indicates amplitude of the vibrations, phase of the vibrations, frequency of the vibrations, or a combination thereof.

17

17. The computer-readable storage device of claim 15 , wherein the audio data includes attribute data, the attribute data associated with the first media device, a location of the first media device, or both, and wherein the generation of the audio content is based on the attribute data.

18

18. The computer-readable storage device of claim 15 , wherein a second portion of the audio content represents speech of a person in connection with a video meeting between the person and a second person.

19

19. The computer-readable storage device of claim 18 , wherein the operations further include: receiving video content from the first media device, the video content associated with the video meeting; and encoding the video content and the audio content into a message.

20

20. The computer-readable storage device of claim 19 , wherein the audio content is transmitted to the second media device by sending the message to the second media device.

Patent Metadata

Filing Date

Unknown

Publication Date

June 12, 2018

Inventors

James H. Pratt
Michael Lattanzi
Steven Belz
Constance Missimer

Want to explore more patents?

Browse 5M+ US patents with plain-English claim translations and AI-generated analysis.

Citation & reuse

Analysis on this page is generated by Patentable — an AI-powered patent intelligence platform. AI-generated summaries, explanations, and analysis may be reused with attribution and a visible link back to the canonical URL below. Patent abstracts and claims are USPTO public domain.

Cite as: Patentable. “IMAGE-BASED TECHNIQUES FOR AUDIO CONTENT” (9997169). https://patentable.app/patents/9997169

© 2026 Patentable. All rights reserved.

Patentable is a research and drafting-assistant tool, not a law firm, and does not provide legal advice. Documents we generate are drafts for review by a licensed patent attorney.

IMAGE-BASED TECHNIQUES FOR AUDIO CONTENT — James H. Pratt | Patentable