A DNN hardware accelerator and an operation method of the DNN hardware accelerator are provided. The DNN hardware accelerator includes: a network distributor for receiving an input data and distributing respective bandwidth of a plurality of data types of a target data amount based on a plurality of bandwidth ratios of the target data amount; and a processing element array coupled to the network distributor, for communicating data of the data types of the target data amount between the network distributor based on the distributed bandwidth of the data types.
Legal claims defining the scope of protection, as filed with the USPTO.
2. The DNN hardware accelerator according to claim 1, wherein the bandwidth proportions of the target data size are obtained from dynamically analyzing the target data size by a microprocessor and sent to the network distributor.
3. The DNN hardware accelerator according to claim 1, wherein the bandwidth proportions of the target data size are associated with individual weights of the data types of the target data size.
6. The DNN hardware accelerator according to claim 1, wherein the input data received by the network distributor is from a buffer or from a memory connected through a system bus.
7. The DNN hardware accelerator according to claim 1, wherein when the target data size comprises a plurality of data layers, the data layers have different bandwidth proportions, and the network distributor allocates different individual bandwidths for the data layers.
8. The DNN hardware accelerator according to claim 1, wherein when the target data size comprises a plurality of data layers, the data layers have the same bandwidth proportions, and the network distributor allocates the same individual bandwidths for the data layers.
10. The operation method according to claim 9, wherein the step of analyzing, the step of configuring the bandwidth proportions and the step of determining are dynamically performed by a microprocessor.
11. The operation method according to claim 9, wherein the step of analyzing, the step of configuring the bandwidth proportions and the step of determining are achieved in an offline manner.
13. The operation method according to claim 9, wherein when the target data size comprises a plurality of data layers, different bandwidth proportions are configured for the data layers.
14. The operation method according to claim 9, wherein when the target data size comprises a plurality of data layers, the same bandwidth proportions are configured for the data layers.
17. The DNN hardware accelerator according to claim 15, wherein when the target data size comprises a plurality of data layers, the bandwidth and utilization analysis unit configures different individual transmission bandwidths for the data layers.
18. The DNN hardware accelerator according to claim 15, wherein when the target data size comprises a plurality of data layers, the bandwidth and utilization analysis unit configures the same individual transmission bandwidths for the data layers.
20. The operation method according to claim 19, wherein the bandwidth proportions of the target data size are obtained by dynamically analyzing the target data size and are sent to the network distributor.
21. The operation method according to claim 19, wherein the bandwidth proportions of the target data size are associated with individual weights of the data types of the target data size.
24. The operation method according to claim 19, wherein the input data received by the network distributor is from a buffer or from a memory connected through a system bus.
Cooperative Patent Classification codes for this invention. Click any code to explore related patents in that topic.
January 15, 2019
January 10, 2023
Browse 5M+ US patents with plain-English claim translations and AI-generated analysis.