A camera system capable of capturing images of an event in a dynamic environment includes two microphones configured to capture stereo audio of the event. The microphones are on orthogonal surfaces of the camera system. Because the microphones are on orthogonal surfaces of the camera system, the camera body can impact the spatial response of the two recorded audio channels differently, leading to degraded stereo recreation if standard beam forming techniques are used. The camera system includes tuned beam forming techniques to generate multi-channel audio that more accurately recreates the stereo audio by compensating for the shape of the camera system and the orientation of microphones on the camera system. The tuned beam forming techniques include optimizing a set of beam forming parameters, as a function of frequency, based on the true spatial response of the recorded audio signals.
Legal claims defining the scope of protection, as filed with the USPTO.
1. A camera system comprising: a camera body; an image sensor assembly coupled to a lens assembly to detect and capture images from sources external to the camera body; a first microphone on a first surface of the camera body; a second microphone on a second surface of the camera body, the second surface substantially orthogonal to the first surface; an audio assembly coupling the first microphone and the second microphone, the audio assembly configured to capture a first audio channel via the first microphone and a second audio channel via the second microphone from an audio source during the capture of images by the image sensor, the audio assembly further configured to output the audio channels; and a processor coupled to the audio assembly, the processor coupled with a memory storing instructions that when executed causes the processor to: receive the captured audio channels, apply a set of tuned beam forming parameters tuned for temporal, spectral, and spatial characteristics in the audio channels associated with the orthogonality of the first surface and the second surface, the characteristics affecting the representation of the audio source during stereo audio playback; process the captured audio channels using the set of tuned beam forming parameters, the beam forming parameters applying a delay to the first audio channel and the second audio channel associated with the orthogonality of the first surface and the second surface to generate a first virtual microphone channel and a second virtual microphone channel which improves the representation of the audio source during stereo audio playback; combine the first virtual audio channel and the second virtual audio channel into an audio stream that is configured for stereo audio playback; and output the audio stream.
2. The camera system of 1 , wherein the set of tuned beam forming parameters applied to the audio channels is associated with the linear distance between the first microphone and the second microphone across the surface of the camera body.
3. The camera system of 1 , wherein the set of tuned beam forming parameters applied is associated with the linear distance between the first microphone and the second microphone through the body of the camera.
4. The camera system of 1 , wherein the set of tuned beam forming parameters applied is based on maximizing a noise ratio between opposing hemispheres of the spatial response of the first and second audio signals.
5. The camera system of 1 , wherein set of tuned beam forming parameters can be independently applied to separate frequency sub-bands of the audio channels based on the shape of the camera body and the frequency range of the sub-band.
6. The camera system of 1 , wherein set of tuned beam forming parameters access gain and delay values stored in the system memory associated with the characteristics and shape of the camera body.
7. The camera system of 1 , wherein the camera system determines the gain and delay to apply with the set of tuned beam forming parameters based on the characteristics and shape of the camera body.
8. An electronic device with audio recording capability comprising: a device body; a first microphone on a first surface of the electronic device; a second microphone on a second surface of the electronic device, the second surface substantially orthogonal to the first surface; an audio assembly coupling the first microphone and the second microphone, the audio assembly configured to capture a first audio channel and a second audio channel from an audio source via the first microphone and the second microphone, the audio assembly further configured to output the audio channels; and a processor coupled to the audio assembly, the processor coupled with a memory storing instructions that when executed causes the processor to: receive the captured audio channels, apply a set of tuned beam forming parameters tuned for temporal, spectral, and spatial characteristics in the audio channels associated with the orthogonality of the first surface and the second surface, the characteristics affecting the representation of the audio source during stereo audio playback; process the captured audio channels using the set of tuned beam forming parameters, the beam forming parameters applying a delay to the first audio channel and the second audio channel associated with the orthogonality of the first surface and the second surface to generate a first virtual microphone channel and a second virtual microphone channel which improves the representation of the audio source during stereo audio playback; combine the first virtual audio channel and the second virtual audio channel into an audio stream that is configured for stereo audio playback; and output the audio stream.
9. The electronic device of 8 , wherein the set of tuned beam forming parameters applied to the audio channels is associated with the linear distance between the first microphone and the second microphone across the surface of the electronic device.
10. The electronic device of 8 , wherein the set of tuned beam forming parameters applied is associated with the linear distance between the first microphone and the second microphone through the body of the electronic device.
11. The electronic device of 8 , wherein tuning the set of beam forming parameters applied is based on maximizing a noise ratio between opposing hemispheres of the spatial response of the first and second audio signals.
12. The electronic device of 8 , wherein set of tuned beam forming parameters can be independently applied to separate frequency sub-bands of the audio channels based on the shape of the camera body and the frequency range of the sub-band.
13. The electronic device of 8 , wherein set of tuned beam forming parameters access gain and delay values stored in the system memory associated with the characteristics and shape of the camera body.
14. The electronic device of 8 , wherein the camera system determines the gain and delay to apply with the set of tuned beam forming parameters based on the characteristics and shape of the camera body.
15. A method for generating stereo audio from at least two microphones on a camera body comprising: capturing a first audio channel from an audio source via a first microphone on a first surface of the camera body during the capture of images by the camera system; capturing a second audio channel from the audio source via a second microphone on a second surface of the camera body during the capture of images by the camera system, the second surface orthogonal to the first surface; separating the first and the second audio channels into frequency sub-bands; selecting a set of frequency sub-bands of the audio channels; applying a set of tuned beam forming parameters to the selected frequency sub-bands, the tuned beam forming parameters tuned for temporal, spectral, and spatial characteristics in the audio channels associated with the orthogonality of the first surface and the second surface, the characteristics affecting the representation of the audio source during stereo audio playback; processing the captured audio channels using the tuned set of beam forming parameters, the beam forming parameters applying a delay to the first audio channel and the second audio channel associated with the orthogonality of the first surface and the second surface to generate a first virtual microphone channel and a second virtual microphone channel which improves the representation of the audio source during stereo audio playback; combining the first virtual audio channel and the second virtual audio channel into an audio stream that is configured for stereo audio playback; and outputting the audio stream.
16. The method of 15 , wherein the set of tuned beam forming parameters applied to the audio channels is associated with the linear distance between the first microphone and the second microphone across the surface of the camera body.
17. The method of 15 , wherein the set of tuned beam forming parameters applied is associated with the linear distance between the first microphone and the second microphone through the body of the camera.
18. The method of 15 , wherein tuning the set of beam forming parameters applied is based on maximizing a noise ratio between opposing hemispheres of the spatial response of the first and second audio signals.
19. The method of 15 , wherein set of tuned beam forming parameters can be independently applied to separate frequency sub-bands of the audio channels based on the shape of the camera body and the frequency range of the sub-band.
20. The method of 15 , wherein set of tuned beam forming parameters access gain and delay values stored in the system memory associated with the characteristics and shape of the camera body.
Cooperative Patent Classification codes for this invention. Click any code to explore related patents in that topic.
September 11, 2017
November 6, 2018
Browse 5M+ US patents with plain-English claim translations and AI-generated analysis.