Fetching non-zero data

PublishedJune 11, 2024

Assigneenot available in USPTO data we have

Inventorsnot available in USPTO data we have

Technical Abstract

Embodiments of the present disclosure include techniques storing and retrieving data. In one embodiment, sub-matrices of data are stored as row slices and column slices. A fetch circuit determines if particular slices of one sub-matrix, when combined with corresponding slices of another sub-matrix, produce a zero result and need not be retrieved. In another embodiment, the present disclosure includes a memory circuit comprising memory banks and sub-banks. The sub-banks store slices of sub-matrices. A request moves between serially configured memory banks and slices in different sub-banks may be retrieved at the same time.

Patent Claims

17 claims

Legal claims defining the scope of protection, as filed with the USPTO.

2. The circuit of claim 1, wherein the fetch circuit determines row slices of the first sub-matrix of data that produce a non-zero result when multiplied by a plurality of corresponding column slices of the second sub-matrix of data while row slices of a third sub-matrix of data that produce a non-zero result when multiplied by a plurality of corresponding column slices of the fourth sub-matrix of data are being retrieved.

3. The circuit of claim 2, wherein the first sub-matrix of data and the third sub-matrix of data are from a first matrix of data, and wherein the second sub-matrix of data and the fourth sub-matrix of data are from a second matrix of data.

4. The circuit of claim 1, wherein the at least one memory circuit stores a first mask corresponding to the first sub-matrix, wherein the first mask specifies row slices having at least one non-zero value.

5. The circuit of claim 4, wherein the fetch circuit eliminates row slices having all zero values from being retrieved based on the first mask.

6. The circuit of claim 1, wherein the fetch circuit analyzes the first sub-matrix in said at least one memory to determine the row slices that produce non-zero results.

7. The circuit of claim 6, wherein the fetch circuit determines, for a plurality of row slices, whether a particular row slice produces a zero or non-zero result when multiplied by a plurality of corresponding column slices.

8. The circuit of claim 6, wherein the fetch circuit receives a bit mask comprising 1 bit per row slice of the first sub-matrix, wherein particular row slices of the first sub-matrix having a first bit mask value indicating the particular row slices comprise all zeros are eliminated from the determined row slices to be retrieved.

9. The circuit of claim 8, further wherein, after eliminating row slices comprising all zeros, the fetch circuit determines first row slices of the first sub-matrix of data that produce a zero result when multiplied by a plurality of corresponding column slices of the second sub-matrix of data to eliminate the first row slices from the determined row slices to be retrieved.

10. The circuit of claim 8, further wherein the fetch circuit logically ANDs values of remaining non-all zero row slices of the first sub-matrix with corresponding values of the column slices of the second sub-matrix to produce a plurality of results and logically ORs the plurality of results to eliminate a plurality of non-all zero row slices producing a zero result from the determined row slices to be retrieved.

11. The circuit of claim 1, wherein the first sub-matrix is stored in row major order and the second sub-matrix is stored in column major order.

12. The circuit of claim 1, wherein the first sub-matrix is a portion of a first matrix stored in row major order and the second sub-matrix is a portion of a second matrix stored in column major order.

13. The circuit of claim 1, wherein the fetch circuit generates at least one data structure specifying said row slices that produce a non-zero result when multiplied by corresponding column slices.

14. The circuit of claim 13, wherein the fetch circuit generates a first data structure specifying addresses of a plurality of sub-matrices and a mask specifying the location of said row slices that produce a non-zero result when multiplied by corresponding column slices across within the plurality of sub-matrices.

15. The circuit of claim 14, wherein the fetch circuit generates a second data structure storing retrieved row slices and a mask specifying the location of said row slices within the plurality of sub-matrices.

16. The circuit of claim 1, wherein the at least one memory circuit is a static random access memory.

17. The circuit of claim 1, wherein the determined row slices retrieved from the at least one memory circuit are loaded into a multiplier circuit.

18. The circuit of claim 1, wherein the fetch circuit determines column slices of the second sub-matrix of data that produce a non-zero result when multiplied by a plurality of corresponding row slices of the first sub-matrix of data, and the determined column slices are retrieved from the at least one memory circuit.

Classification Codes (CPC)

Cooperative Patent Classification codes for this invention. Click any code to explore related patents in that topic.

G06F

Patent Metadata

Filing Date

April 26, 2022

Publication Date

June 11, 2024

Want to explore more patents?

Browse 5M+ US patents with plain-English claim translations and AI-generated analysis.

Browse All Patents Try Prior Art Search