Pushing the Boundaries of Large Speech Models : Dataocean AI’s Thousand-Person Multilingual Speech Synthesis Datasets
Recently, a text-to-speech project named ChatTTS goes viral and has garnered 28k stars on GitHub. Designed specifically for dialogue scenarios, this speech generation model supports both English and Chinese languages. It has been optimized for conversational tasks, achieving natural and fluent speech synthesis. Highlights of ChatTTS Dialog-based TTS: ChatTTS is optimized for dialog-based tasks, achieving […]
Dataocean AI New Datasets – July
Dataocean AI has launched new high quality datasets including minor language smart voice dataset, telephoto landscape image dataset, and multi-skin tone cabin video dataset. These resources aim to help enterprises develop more extensive and higher-quality large models and AI applications to meet the diverse needs of global users. Arabic Speech Recognition Dataset Product Features: Arabic, […]
Open Datasets: GigaSpeech 2 – 30,000 Hours of Southeast Asian Multilingual Speech Recognition Open Source Dataset Released
The term “Giga” originates from “gigantic,” reflecting the vast audio resources available on the internet. However, the quality of these audio resources varies significantly, and high-quality audio-text pairs are particularly scarce and expensive to annotate, especially for low-resource languages. GigaSpeech, a highly successful open-source English dataset, addresses this issue by providing thousands of hours of […]
Unlocking the Emotional Data Behind GPT-4o
GPT-4o can already be considered an emotionally rich and human-like intelligent voice assistant, or more accurately, a “new species” that is increasingly approaching human interaction. This powerful model also has the ability to understand and synthesize text, images, videos, and voice, and can even be seen as an unfinished version of GPT-5. Click here […]
Key Data for Humanlike Text-to-Speech Systems
As numerous tech companies race to enhance the multimodal capabilities of large models and strive to integrate functions like text summarization and image editing into mobile devices, OpenAI has launched a new product! CEO Samuel Harris Altman expressed his state with three letters: her (just like the movie “Her”). In the early morning of May […]
Dataocean AI New Datasets – May
In the field of artificial intelligence, the technology of large models is continuously driving innovation and development across various industries. Dataocean AI has introduced new multilingual, multi-emotional, and multi-scenario intelligent voice data, as well as image data with Chinese element styles, to help companies develop more diverse and high-quality models and products to meet the […]