US-11239859

Methods and devices for vector segmentation for coding

PublishedFebruary 1, 2022

Assigneenot available in USPTO data we have

Inventorsnot available in USPTO data we have

Technical Abstract

A method for partitioning of input vectors for coding is presented. The method comprises obtaining of an input vector. The input vector is segmented, in a non-recursive manner, into an integer number, NSEG, of input vector segments. A representation of a respective relative energy difference between parts of the input vector on each side of each boundary between the input vector segments is determined, in a recursive manner. The input vector segments and the representations of the relative energy differences are provided for individual coding. Partitioning units and computer programs for partitioning of input vectors for coding, as well as positional encoders, are presented.

Patent Claims

14 claims

Legal claims defining the scope of protection, as filed with the USPTO.

1. An audio coding method, the method comprising: obtaining an input vector of coefficients originating from an audio signal; segmenting the input vector into an integer number (N SEG ) of input vector segments according to a ratio between a total bit-budget for quantizing the input vector and a maximum number of bits allowed for quantizing a vector segment, wherein the maximum number of bits is constrained by a vector quantizer; determining a representation of a respective relative energy difference between parts of the input vector on each side of each boundary between the input vector segments, wherein the determining comprises: a) setting the input vector as an upper level input vector; b) splitting the upper level input vector into left and right parts, each part comprising one or more input vector segments, wherein the upper level input vector is split at the segment boundary between the left and the right part into two lower level input vectors; c) calculating a representation of a relative energy difference between the two lower level input vectors according to an energy ratio between the lower level input vectors; and d) repeating the steps b) and c) of splitting and calculating by re-setting the lower level input vectors as a respective upper level input vector, until all boundaries between input vector segments are provided with an associated representation of a relative energy difference; allocating bits for encoding the shape of each of the input vector segment and for encoding of the representations of the relative energy differences between the input vector segments, wherein bits for encoding the input vector segments are distributed between segments according to relative energy differences between parts of the input vector; and providing each the input vector segment, the representations of the relative energy differences, and allocation information to the vector quantizer for individual encoding of the input vector segments.

2. The method of claim 1 , wherein N SEG is the smallest integer number by which each the input vector segment fulfils constraints associated with a quantizer for the encoding.

3. The method of claim 1 , wherein if the upper level input vector has to be divided into non-equally sized lower level input vectors, selecting the segment boundary as the boundary closest to the center of the upper level input vector giving a larger last lower level input vector than first lower level input vector.

4. The method of claim 1 , wherein the step of allocating bits is performed in connection to the step of determining.

5. The method of claim 1 , wherein the step of allocating bits for encoding of each the input vector segments performed in connection to the step d) calculating a representation of a relative energy difference.

6. The method of claim 5 , wherein the step of allocating bits allocates bits for the lower level input vectors in dependence of a ratio between lengths of the lower level input vectors and a ratio between the energies in the lower level input vectors.

8. An audio encoder for positional encoding, the audio encoding comprising: an input unit configured to receive an input vector representing an audio signal; a partitioning unit configured to partition input vectors of coefficients originating from the audio signal for positional encoding of shapes of the input vectors; a vector quantizer configured to vector quantize segments of an input vector individually, wherein a maximum number of bits allowed for quantizing a vector segment is constrained by the vector quantizer; and an output unit for an encoded signal, wherein the partitioning unit is configured to segment the input vector into an integer number (N SEG ) of input vector segments according to a ratio between a total bit-budget for quantizing the input vector and the maximum number of bits allowed, the partitioning unit is configured to determine a representation of a respective relative energy difference between parts of the input vector on each side of each boundary between the input vector segments by performing a process that includes: a) setting the input vector as an upper level input vector; b) splitting the upper level input vector into left and right parts, each part comprising one or more input vector segments, wherein the upper level input vector is split at the segment boundary between the left and the right part into two lower level input vectors; c) calculating a representation of a relative energy difference between the two lower level input vectors according to an energy ratio between the lower level input vectors; and d) repeating the steps b) and c) of splitting and calculating by re-setting the lower level input vectors as a respective upper level input vector, until all boundaries between input vector segments are provided with an associated representation of a relative energy difference, the partitioning unit is configured to allocate bits for encoding the shape of each of the input vector segment and for encoding of the representations of the relative energy differences between the input vector segments, wherein bits for encoding the input vector segments are distributed between segments according to relative energy differences between parts of the input vector, and the partitioning unit is configured to provide each the input vector segment, the representations of the relative energy differences and allocation information to the quantizer for individual encoding of the input vector segments.

9. The audio encoder of claim 8 , wherein N SEG is the smallest integer number by which each the input vector segment fulfils constraints associated with a quantizer for the encoding.

10. The audio encoder of claim 8 , wherein the partitioning unit is configured to, if the upper level input vector has to be divided into non-equally sized lower level input vectors, select the segment boundary as the boundary closest to the center of the upper level input vector giving a larger last lower level input vector than first lower level input vector.

11. The audio encoder of claim 8 , wherein the partitioning unit is configured to perform the allocating of bits in connection to the determining, in a recursive manner, of a representation of a respective relative energy difference.

12. The audio encoder of claim 8 , wherein the partitioning unit is configured to allocate bits for encoding of each the input vector segments performed concurrently to the d) calculating of a representation of a relative energy difference.

13. The audio encoder of claim 12 , wherein the partitioning unit is configured to perform the allocating of bits by allocating bits for the lower level input vectors in dependence of a ratio between lengths of the lower level input vectors and a ratio between the energies in the lower level input vectors.

15. A computer program product comprising a non-transitory computer readable medium storing a computer program comprising instructions, which when executed by at least one processor, cause the at least one processor to perform the method of claim 1 .

16. An audio encoder for positional encoding, the audio encoding comprising: a memory; and processing circuitry coupled to the memory, wherein the audio encoder is configured to: segment an input vector representing an audio signal into an integer number (N SEG ) of input vector segments according to a ratio between a total bit-budget for quantizing the input vector and a maximum number of bits allowed for quantizing a vector segment; determine a representation of a respective relative energy difference between parts of the input vector on each side of each boundary between the input vector segments by performing a process that includes: a) setting the input vector as an upper level input vector; b) splitting the upper level input vector into left and right parts, each part comprising one or more input vector segments, wherein the upper level input vector is split at the segment boundary between the left and the right part into two lower level input vectors; c) calculating a representation of a relative energy difference between the two lower level input vectors according to an energy ratio between the lower level input vectors; and d) repeating the steps b) and c) of splitting and calculating by re-setting the lower level input vectors as a respective upper level input vector, until all boundaries between input vector segments are provided with an associated representation of a relative energy difference, allocate bits for encoding the shape of each of the input vector segment and for encoding of the representations of the relative energy differences between the input vector segments, wherein bits for encoding the input vector segments are distributed between segments according to relative energy differences between parts of the input vector, and provide each the input vector segment, the representations of the relative energy differences and allocation information to a quantizer for individual encoding of the input vector segments.

Classification Codes (CPC)

Cooperative Patent Classification codes for this invention. Click any code to explore related patents in that topic.

G06F G10L H04N

Patent Metadata

Filing Date

June 5, 2020

Publication Date

February 1, 2022

Want to explore more patents?

Browse 5M+ US patents with plain-English claim translations and AI-generated analysis.

Browse All Patents Try Prior Art Search