COOLJAPAN

Posts tagged #nvidia

1 posts

Jul 1, 2026 · 12 min

OxiCUDA 0.4.0 Released — On-Device GPU Validation Catches What CPU Parity Tests Never Could

OxiCUDA 0.4.0 is an on-device validation pass: for the first time, hand-written PTX across 60+ crates was JIT-compiled and run on real NVIDIA hardware (RTX A4000, sm_86, CUDA 12.4) instead of only checked for CPU-logic parity — catching register-shadowing bugs, base-2/base-e math errors, invalid PTX, and literal stub kernels. 38,093 tests passing, ~1.27M SLoC, 73 crates.

releaseoxicudacuda