In brief
- StepAudio 2.5 Realtime is an end-to-end real-time speech model with fully customizable personas in Chinese and English.
- StepFun claims first place across all five voice AI benchmarks tested in April 2026, beating GPT Realtime 1.5 and Gemini Live.
- The model was trained on a million-scale…
Read Full Article at Source