Disclosed herein includes a system, a method, and a device for receiving input data to generate a plurality of outputs for a layer of a neural network. The plurality of outputs are arranged in a first array. Dimensions of the first array may be compared with dimensions of a processing unit (PE) array including a plurality of PEs. According to a result of the comparing, the first array is partitioned into subarrays by the processor. Each of the subarrays has dimensions less than or equal to the dimensions of the PE array. A first group of PEs in the PE array is assigned to a first one of the subarrays. A corresponding output of the plurality of outputs is generated using a portion of the input data by each PE of the first group of PEs assigned to the first one of the subarrays.
Legal claims defining the scope of protection, as filed with the USPTO.
6. The method according to claim 1, wherein the plurality of outputs are outputs of convolution operations for the layer of the neural network.
15. The device according to claim 10, wherein the plurality of outputs are outputs of convolution operations for the layer of a neural network.
16. The device according to claim 10, wherein said each PE is configured to output the first plurality of dot products as outputs of convolution operations for the layer of the neural network.
Cooperative Patent Classification codes for this invention. Click any code to explore related patents in that topic.
July 15, 2019
June 13, 2023
Browse 5M+ US patents with plain-English claim translations and AI-generated analysis.