Legal claims defining the scope of protection. Each claim is shown in both the original legal language and a plain English translation.
1. A device for processing a bitstream, the device comprising: one or more processors configured to obtain the bitstream, the bitstream comprising a compressed version of a spatial component of a sound field, the spatial component of the sound field being represented by a vector in a spherical harmonics domain, wherein a value of a syntax element for a current frame is indicative of a vector quantization codebook used when compressing the vector, the bitstream further comprising an indicator, the indicator having a particular value indicating the bitstream does not include the value of the syntax element for the current frame and the value of the syntax element for the current frame is equal to a value of the syntax element for a previous frame; and a memory coupled to the one or more processors, the memory configured to store the bitstream.
A device processes an audio bitstream representing a compressed spatial component of a sound field, specifically Higher-Order Ambisonics (HOA) audio. The sound field's spatial component is represented as a vector in the spherical harmonics domain. The bitstream includes a syntax element whose value indicates which vector quantization codebook was used to compress the vector in the current frame. An indicator within the bitstream signals whether the syntax element's value should be reused from the previous frame. If the indicator has a specific value, the decoder reuses the syntax element value from the previous frame, saving bits. The device stores the bitstream in memory.
2. The device of claim 1 , wherein the one or more processors are further configured to use the vector quantization codebook to reconstruct the vector.
Building upon the device described for processing an audio bitstream, where an indicator signals whether to reuse a vector quantization codebook index from the previous frame (as indicated by a syntax element in the bitstream), the device further uses this vector quantization codebook to reconstruct the original vector. The device accesses the codebook specified by the syntax element to dequantize the compressed vector, thus recreating the original spatial component of the sound field. This allows the audio signal to be reconstructed from the compressed bitstream.
3. The device of claim 1 , wherein the syntax element is a first syntax element and the indicator comprises one or more bits of a value of a second syntax element for the current frame, the value of the second syntax element for the current frame indicating a quantization mode used when compressing the vector.
In the device that processes an audio bitstream by indicating reuse of a vector quantization codebook from a previous frame, the syntax element indicating the codebook is a "first syntax element." The indicator for reuse is encoded within a "second syntax element" for the current frame. The value of this "second syntax element" indicates the quantization mode used during vector compression. Thus, the indicator signaling whether to reuse the codebook index is embedded within the quantization mode information itself, saving additional bits in the bitstream.
4. The device of claim 3 , wherein: the indicator comprises a value of a third syntax element for the current frame and a value of a fourth syntax element for the current frame, and the value of the third syntax element for the current frame plus the value of the fourth syntax element for the current frame being equal to zero indicates the bitstream does not include the value of the first syntax element for the current frame and the value of the first syntax element for the current frame is equal to the value of the first syntax element for the previous frame.
Expanding on the device where the codebook reuse indicator is based on a second syntax element indicating quantization mode, the indicator consists of two additional syntax elements: a "third syntax element" and a "fourth syntax element." If the sum of the values of the third and fourth syntax elements for the current frame equals zero, this indicates that the first syntax element (the codebook index) is *not* present in the bitstream and that the codebook index value from the previous frame should be reused. This provides a specific, efficient signaling mechanism to indicate codebook reuse.
5. The device of claim 3 , wherein the indicator includes a most significant bit of the value of the second syntax element for the current frame and a second most significant bit of the value of the second syntax element for the current frame.
In the device that determines vector quantization codebook reuse based on a second syntax element, the indicator is formed by specific bits within that second syntax element. Specifically, the *most significant bit* (MSB) and the *second most significant bit* (2nd MSB) of the second syntax element together form the indicator. The combination of these two bits signals whether the codebook index (the first syntax element) should be reused from the previous frame. This implements codebook reuse indication through bit-field encoding.
6. The device of claim 1 , the one or more processors further configured to: decompose higher-order ambisonic audio data to obtain the vector; and specify the vector in the bitstream to obtain the bitstream.
In the device processing audio bitstreams using codebook reuse, the device first decomposes higher-order ambisonic (HOA) audio data to obtain the vector representing the spatial component of the sound field. Then, the device encodes and specifies this vector in the bitstream. This generates the bitstream with the HOA information, allowing later decoding and reconstruction of the audio signal. The initial HOA decomposition is a required step for this compression scheme.
7. The device of claim 1 , the one or more processors further configured to: obtain, from the bitstream, an audio object that corresponds to the vector; and combine the audio object with the vector to reconstruct higher-order ambisonic (HOA) audio data.
Expanding on the device for processing audio bitstreams and reusing quantization codebooks, the device obtains an audio object from the bitstream that corresponds to the vector representing the spatial component of the sound field. The device then combines this audio object with the vector to reconstruct higher-order ambisonic (HOA) audio data. This combination allows the recovery of a more complete audio signal from its compressed representation in the bitstream.
8. The device of claim 1 , wherein: the one or more processors are configured to render the HOA audio data to output one or more loudspeaker feeds, the device is coupled to one or more loudspeakers, wherein the one or more loudspeaker feeds drive the one or more loudspeakers.
The device that decodes and reconstructs audio from the described bitstream also renders the reconstructed HOA audio data to produce one or more loudspeaker feeds. The device is connected to one or more loudspeakers, and the generated loudspeaker feeds drive those speakers. The loudspeaker feeds contain the correct audio signal for each speaker to recreate the original sound field from the HOA representation.
9. The device of claim 1 , wherein the one or more processors are further configured to: obtain the value of the syntax element for the current frame from the bitstream when the indicator does not have the particular value.
The device utilizes an indicator to decide whether to re-use quantization codebook index. In the case when the indicator does *not* have the specific value indicating re-use, the device will obtain the value of the syntax element representing the vector quantization codebook index directly from the current frame's data in the bitstream. This way, the device properly updates the codebook index.
10. A method for processing a bitstream, the method comprising: obtaining the bitstream, the bitstream comprising a compressed version of a spatial component of a sound field, the spatial component of the sound field being represented by a vector in a spherical harmonics domain, wherein a value of a syntax element for a current frame is indicative of a vector quantization codebook used when compressing the vector, the bitstream further comprising an indicator, the indicator having a particular value indicating the bitstream does not include the value of the syntax element for the current frame and the value of the syntax element for the current frame is equal to a value of the syntax element for a previous frame; and storing the bitstream.
A method for processing an audio bitstream representing a compressed spatial component of a sound field involves obtaining the bitstream, where the spatial component is represented by a vector in a spherical harmonics domain. The bitstream contains a syntax element whose value indicates the vector quantization codebook used for vector compression in the current frame. An indicator in the bitstream signals whether the codebook index should be reused from the previous frame. If the indicator has a specific value, the codebook index from the previous frame is reused for the current frame. The method also includes storing the bitstream.
11. The method of claim 10 , further comprising using the vector quantization codebook to reconstruct the vector.
The method for processing an audio bitstream, involving re-using a codebook index and storing the bitstream (as indicated by a syntax element and its indicator), also involves using the indicated vector quantization codebook to reconstruct the original vector. This enables the recovery of the sound field's spatial component from the compressed bitstream and contributes to recreating the original audio signal.
12. The method of claim 10 , wherein the syntax element is a first syntax element and the indicator comprises one or more bits of a value of a second syntax element for the current frame, the value of the second syntax element for the current frame indicating a quantization mode used when compressing the vector.
In the method of processing an audio bitstream through re-using vector quantization codebooks, the syntax element used to indicate the codebook index is denoted as a "first syntax element." The indicator for codebook re-use is embedded within one or more bits of a "second syntax element" for the current frame. The value of this "second syntax element" represents the quantization mode used in compressing the vector.
13. The method of claim 12 , wherein: the indicator comprises a value of a third syntax element for the current frame and a value of a fourth syntax element for the current frame, and the value of the third syntax element for the current frame plus the value of the fourth syntax element for the current frame being equal to zero indicates the bitstream does not include the value of the first syntax element for the current frame and the value of the first syntax element for the current frame is equal to the value of the first syntax element for the previous frame.
In the method where an indicator determines re-use based on a second syntax element, the indicator utilizes two additional syntax elements: a "third syntax element" and a "fourth syntax element." When the sum of the values of the third and fourth syntax elements for the current frame equals zero, this signifies that the first syntax element (codebook index) is absent from the current frame's bitstream and that the codebook index from the previous frame must be re-used.
14. The method of claim 12 , wherein the indicator includes a most significant bit of the value of the second syntax element for the current frame and a second most significant bit of the value of the second syntax element for the current frame.
Within the method of determining codebook reuse based on a second syntax element, the indicator consists of the *most significant bit* (MSB) and the *second most significant bit* (2nd MSB) of that second syntax element. The specific bit values act as the signal determining whether the first syntax element (codebook index) should be re-used from the previous frame.
15. The method of claim 10 , further comprising: decomposing higher-order ambisonic audio data to obtain the vector; and specifying the vector in the bitstream to obtain the bitstream.
In the method of processing audio bitstreams with codebook reuse, the method also incorporates decomposing higher-order ambisonic (HOA) audio data to obtain a vector representing the sound field's spatial component, and specifying that vector within the bitstream, resulting in a compressed representation of the HOA audio.
16. The method of claim 10 , further comprising: obtaining, from the bitstream, an audio object that corresponds to the vector; and combining the audio object with the vector to reconstruct higher-order ambisonic audio data.
The method for processing audio bitstreams also involves obtaining an audio object, representing the non-spatial data, from the bitstream that corresponds to the spatial vector. The method combines the audio object with the spatial vector to reconstruct complete higher-order ambisonic (HOA) audio data, after vector dequantization.
17. The method of claim 10 , further comprising: decoding the bitstream to obtain Higher Order Ambisonics (HOA) coefficients; and rendering the HOA coefficients to output one or more loudspeaker feeds, a device rendering the HOA coefficients to output the one or more loudspeaker feeds is coupled to one or more loudspeakers, wherein the one or more loudspeaker feeds drive the one or more loudspeakers.
This audio processing method produces loudspeaker feeds to recreate HOA audio. The method first decodes the bitstream to derive Higher Order Ambisonics (HOA) coefficients. Next, the HOA coefficients are rendered to generate one or more loudspeaker feeds. These feeds are sent to loudspeakers that generate the audio signal.
18. The method of claim 10 , further comprising: obtaining the value of the syntax element for the current frame from the bitstream when the indicator does not have the particular value.
When using the re-use indicator in the audio bitstream processing method, and the indicator's value does not signal re-use of the quantization codebook index, then the method involves reading the value of the syntax element from the current frame data in the bitstream. The codebook index is updated with the newly-read value.
19. A device for processing a bitstream, the device comprising: means for obtaining the bitstream, the bitstream comprising a compressed version of a spatial component of a sound field, the spatial component of the sound field being represented by a vector in a spherical harmonics domain, wherein a value of a syntax element for a current frame is indicative of a vector quantization codebook used when compressing the vector, the bitstream further comprising an indicator, the indicator having a particular value indicating the bitstream does not include the value of the syntax element for the current frame and the value of the syntax element for the current frame is equal to a value of the syntax element for a previous frame; and means for storing the bitstream.
A device for processing a bitstream uses means for obtaining the bitstream, where the bitstream contains compressed spatial components of a sound field (represented as a vector in the spherical harmonics domain), and a syntax element indicating which vector quantization codebook was used in compression. An indicator signals whether to reuse the syntax element from the previous frame. The device also includes means for storing the bitstream.
20. The device of claim 19 , further comprising: means for using the vector quantization codebook to reconstruct the vector.
A device for processing a bitstream using means for obtaining and storing also has the capability, or means, to use the indicated vector quantization codebook to reconstruct the spatial vector.
21. The device of claim 19 , wherein the syntax element is a first syntax element and the indicator comprises one or more bits of a value of a second syntax element for the current frame, the value of the second syntax element for the current frame indicating a quantization mode used when compressing the vector.
The audio bitstream processing device indicates the syntax element (that contains the quantization codebook index) as a "first syntax element". The indicator showing whether to reuse the previous codebook index, is located within a "second syntax element". The second syntax element's value shows the vector quantization mode.
22. The device of claim 19 , further comprising: means for decomposing higher-order ambisonic audio data to obtain the vector; and means for specifying the vector in the bitstream to obtain the bitstream.
A device processing an audio bitstream also incorporates means for decomposing higher-order ambisonic (HOA) audio data to obtain a spatial vector, and means for specifying that vector within the created bitstream.
23. The device of claim 19 , the device further comprising: means for obtaining the value of the syntax element for the current frame from the bitstream when the indicator does not have the particular value.
When processing audio bitstreams with codebook re-use, and the codebook re-use indicator shows re-use is not desired, the device has means for directly obtaining the value of the syntax element from the current frame in the bitstream.
24. A non-transitory computer-readable storage medium having instructions stored thereon that, when executed, configure a device to: obtain a bitstream, the bitstream comprising a compressed version of a spatial component of a sound field, the spatial component of the sound field being represented by a vector in a spherical harmonics domain, wherein a value of a syntax element for a current frame is indicative of a vector quantization codebook used when compressing the vector, the bitstream further comprising an indicator, the indicator having a particular value indicating the bitstream does not include the value of the syntax element for the current frame and the value of the syntax element for the current frame is equal to a value of the syntax element for a previous frame; and store the bitstream.
A non-transitory computer-readable medium stores instructions for audio processing. These instructions cause a device to obtain a bitstream, the bitstream including a spatial component of a sound field (represented by a vector in the spherical harmonics domain), and a syntax element to indicate quantization codebook. There is an indicator to signal whether to reuse from a previous frame. There is a means to store the bitstream.
25. The non-transitory computer-readable storage medium of claim 24 , wherein the instructions, when executed, configure the device to use the vector quantization codebook to reconstruct the vector.
The instructions stored on the non-transitory medium for audio processing, which obtains, indicates reuse, and stores, also include using the vector quantization codebook indicated to reconstruct the spatial vector from the compressed bitstream.
26. The non-transitory computer-readable storage medium of claim 24 , wherein the syntax element is a first syntax element and the indicator comprises one or more bits of a value of a second syntax element for the current frame, the value of the second syntax element for the current frame indicating a quantization mode used when compressing the vector.
The non-transitory storage medium stores instructions to designate the syntax element (containing the quantization codebook index) as a "first syntax element". The codebook re-use indicator is embedded as part of a "second syntax element" value, which also indicates the vector's quantization mode.
27. The non-transitory computer-readable storage medium of claim 24 , wherein the instructions, when executed, cause the device to: decompose higher-order ambisonic audio data to obtain the vector; and specify the vector in the bitstream to obtain the bitstream.
A non-transitory medium also has instructions to first decompose higher-order ambisonic audio to obtain a spatial vector. The vector is then specified within the compressed audio bitstream.
28. The non-transitory computer-readable storage medium of claim 24 , wherein the instructions, when executed, cause the device to: obtain the value of the syntax element for the current frame from the bitstream when the indicator does not have the particular value.
The non-transitory medium has instructions for an indicator for codebook reuse. When the indicator shows that the codebook is not to be reused, instructions cause direct reading the value of the syntax element from the current frame.
Unknown
August 29, 2017
Browse 5M+ US patents with plain-English claim translations and AI-generated analysis.