High-performance computational framework combining float32/64 arithmetic for optimal speed-accuracy trade-offs. Features V/W-cycle algorithms, communication-avoiding optimizations, CUDA acceleration, and comprehensive validation on Poisson/heat equations with proven convergence bounds. -
View it on GitHub