Apparatus and method for multi-phase pruning for neural network with multi-sparsity levels

PublishedMay 28, 2024

Assigneenot available in USPTO data we have

Inventorsnot available in USPTO data we have

Technical Abstract

Disclosed are an apparatus and a method of multi-phase pruning a neural network with multi-sparsity levels and an SIMD-based neural network pruning method, and the SIMD-based neural network pruning method according to an exemplary embodiment of the present disclosure includes GEMM-transforming an internode weight kernel applied to a layer in a neural network; and pruning the GEMM-transformed weight kernel with a predetermined SIMD width as a unit.

Patent Claims

2 claims

Legal claims defining the scope of protection, as filed with the USPTO.

5. The neural network multi-phase pruning method according to claim 4, wherein in the performing of coarse-grain pruning, at least some continuous regions of an original weight kernel which is not GEMNI-transformed are removed from the original weight kernel.

9. The multi-phase pruning apparatus according to claim 8, wherein the processor is further configured to remove at least some continuous regions of an original weight kernel which is not GEMM-transformed from the original weight kernel.

Classification Codes (CPC)

Cooperative Patent Classification codes for this invention. Click any code to explore related patents in that topic.

G06N G06F

Patent Metadata

Filing Date

November 18, 2020

Publication Date

May 28, 2024

Want to explore more patents?

Browse 5M+ US patents with plain-English claim translations and AI-generated analysis.

Browse All Patents Try Prior Art Search