Systems and methods for controlling adaptivity of noise cancellation are presented. One or more audio signals are received by one or more corresponding microphones. The one or more signals may be decomposed into frequency sub-bands. Noise cancellation consistent with identified adaptation constraints is performed on the one or more audio signals. The one or more audio signals may then be reconstructed from the frequency sub-bands and outputted via an output device.
Legal claims defining the scope of protection, as filed with the USPTO.
1. A method for controlling adaptivity of noise cancellation, the method comprising: adapting, using at least one hardware processor, a coefficient to suppress a noise component of a primary audio signal and form a modified audio signal, the primary audio signal representing a first captured sound and comprising a speech component and the noise component; and outputting the modified audio signal via an output device, wherein adapting the coefficient includes reducing a value of the coefficient based on an audio noise energy estimate, the coefficient being faded to zero when the audio noise energy estimate is less than a threshold, the threshold being determined based on an estimate of the microphone self-noise in the primary or a secondary audio signal, the secondary audio signal representing a second captured sound.
2. The method of claim 1 , wherein the coefficient is faded to about zero based on the noise energy estimate.
3. The method of claim 1 , wherein the noise energy estimate may be determined from the primary audio signal, the secondary audio signal or a residual audio signal derived from a difference of the primary audio signal and the speech component of the primary audio signal.
4. The method of claim 3 , wherein the noise energy estimate is performed on individual frequency sub-bands of the residual audio signal.
5. A method for controlling adaptivity of noise cancellation, the method comprising: determining, using at least one hardware processor, a first transfer function between a speech component of a primary audio signal and a speech component of a secondary audio signal, the primary audio signal representing a first captured sound and comprising the speech component and a noise component, and the secondary audio signal representing a second captured sound and comprising the speech component and a noise component; determining a second transfer function between the noise component of the primary audio signal and the noise component of the secondary audio signal; determining a difference between the first transfer function and the second transfer function; adapting a coefficient applied to the primary audio signal to generate a modified primary audio signal when the difference exceeds a threshold; and outputting the modified primary audio signal via an output device.
6. The method of claim 5 , further comprising: adapting a first coefficient to suppress the speech component of the primary audio signal thus forming a residual audio signal; adapting a second coefficient applied to the residual audio signal when a difference exceeds the threshold to obtain a noise prediction audio signal; and subtracting the noise prediction audio signal from the primary audio signal to generate a modified primary signal.
7. The method of claim 6 , wherein adapting the second coefficient is performed on individual frequency sub-bands of the primary audio signal.
8. The method of claim 6 , wherein determining the first transfer function and the second transfer function comprises cross-correlating the primary audio signal and the secondary audio signal.
9. The method of claim 6 , wherein the second coefficient is adapted when an estimate of far-end activity exceeds the threshold.
10. A non-transitory computer-readable storage medium having a program embodied thereon, the program executable by a processor to perform a method for controlling adaptivity of noise cancellation, the method comprising: determining a first transfer function between a speech component of a primary audio signal and a speech component of a secondary signal, the primary audio signal representing a first captured sound and comprising the speech component and a noise component, and the secondary audio signal representing a second captured sound and comprising the speech component and the noise component; determining a second transfer function between the noise component of the primary audio signal and the noise component of the secondary audio signal; determining a difference between the first transfer function and the second transfer function; adapting a coefficient applied to the primary audio signal to generate a modified primary audio signal when the difference exceeds a threshold; and outputting the modified primary audio signal via an output device.
11. The non-transitory computer-readable storage medium of claim 10 , the method further comprising: adapting a first coefficient to suppress the speech component of the primary audio signal thus forming a residual audio signal; adapting a second coefficient applied to the residual audio signal when the difference exceeds the threshold to obtain a noise prediction audio signal; and subtracting the noise prediction audio signal from the primary audio signal to generate a modified primary signal.
12. The non-transitory computer-readable storage medium of claim 11 , wherein adapting the second coefficient is performed on individual frequency sub-bands of the primary audio signal.
13. The non-transitory computer-readable storage medium of claim 11 , wherein determining the first transfer function and the second transfer function comprises cross-correlating the primary audio signal and the secondary audio signal.
14. The non-transitory computer-readable storage medium of claim 11 , wherein the second coefficient is adapted when an estimate of far-end activity exceeds the threshold.
15. A non-transitory computer-readable storage medium having a program embodied thereon, the program executable by a processor to perform a method for controlling adaptivity of noise cancellation, the method comprising: adapting a coefficient to suppress a noise component of a primary audio signal and form a modified audio signal, the primary audio signal representing a first captured sound and comprising a speech component and the noise component; and outputting the modified audio signal via an output device, wherein adapting the coefficient includes reducing a value of the coefficient based on an audio noise energy estimate, the coefficient fading to zero when the audio noise energy estimate is less than a threshold, the threshold being determined based on an estimate of the microphone self-noise in the primary or a secondary audio signal, the secondary audio signal representing a second captured sound.
16. The non-transitory computer-readable storage medium of claim 14 , wherein the coefficient is faded to about zero based on the noise energy estimate.
17. The non-transitory computer-readable storage medium of claim 15 , wherein the noise energy estimate may be determined from the primary audio signal, the secondary audio signal or a residual audio signal derived from a difference of the primary audio signal and the speech component of the primary audio signal.
18. The non-transitory computer-readable storage medium of claim 17 , wherein the noise energy estimate is performed on individual frequency sub-bands of the residual audio signal.
Cooperative Patent Classification codes for this invention. Click any code to explore related patents in that topic.
April 13, 2009
February 3, 2015
Browse 5M+ US patents with plain-English claim translations and AI-generated analysis.