Alibaba AI voice model cracks top 5 globally, outperforming US rivals in regional accents
Alibaba's new AI voice model, Fun-Realtime-TTS-Preview, developed by Tongyi Lab, has achieved fifth place globally on the Artificial Analysis Speech Arena leaderboard with a score of 1,190. This marks the first time a Chinese-engineered voice system has entered the top five, outperforming Western competitors like OpenAI and xAI.

Briefing Summary
AI-generatedAlibaba's new AI voice model, Fun-Realtime-TTS-Preview, developed by Tongyi Lab, has achieved fifth place globally on the Artificial Analysis Speech Arena leaderboard with a score of 1,190. This marks the first time a Chinese-engineered voice system has entered the top five, outperforming Western competitors like OpenAI and xAI. The model demonstrates a particular strength in accurately capturing complex Chinese dialects and regional accents, supporting over 30 languages, seven major Chinese dialects, and more than 20 regional accents. This achievement highlights Alibaba's technical advancements in voice AI, especially in understanding diverse linguistic variations.
Article analysis
Model · rule-basedKey claims
4 extractedFun-Realtime-TTS-Preview is the only Chinese-engineered voice system in the global top five.
The model supports more than 30 languages, seven major Chinese dialects, and over 20 regional accents.
The Alibaba AI voice model outperformed US rivals OpenAI and xAI on a major global benchmark.
Alibaba AI voice model Fun-Realtime-TTS-Preview secured fifth spot on Artificial Analysis Speech Arena leaderboard with a score of 1,190.