COOLJAPAN

Posts tagged #speech-synthesis

1 posts

Mar 26, 2026 · 4 min

VoiRS 0.1.0 Release Candidate 1 — Pure Rust Neural TTS, Voice Recognition & Sound Framework

Production-grade pure Rust Text-to-Speech (TTS), Voice Recognition, and Sound framework. VITS + HiFi-GAN/DiffWave vocoders, real-time ≤0.05× RTF on GPU, streaming synthesis, SSML, 20+ languages, ONNX/Kokoro-82M support, SafeTensors checkpoints. Full integration with SciRS2/NumRS2. WASM, GPU (CUDA/Metal), Python/FFI bindings. The sovereign speech AI layer for the entire COOLJAPAN ecosystem (now 21M+ SLoC total).

releasevoirstts