Designed a Generic NxN systolic array consists of a grid of N2 processing elements (PEs), arranged in N rows and N columns. Each PE performs multiply-accumulate operations for corresponding elements of the input matrices, which is widely used in AI Accelerators -
View it on GitHub