Open Datasets: GigaSpeech 2 – 30,000 Hours of Southeast Asian Multilingual Speech Recognition Open Source Dataset Released

The term “Giga” originates from “gigantic,” reflecting the vast audio resources available on the internet. However, the quality of these audio resources varies significantly, and high-quality audio-text pairs are particularly scarce and expensive to annotate, especially for low-resource languages.  GigaSpeech, a highly successful open-source English dataset, addresses this issue by providing thousands of hours of […]

Dataocean AI New Datasets – September

We are excited to announce our NEW arrivals, including Accented English Speech Recognition Corpus from 60+ Countries, Accented Spanish Speech Recognition Corpus from 8 Countries, India Multilingual Speech Recognition Corpus, Spontaneous Dialogue Speech Synthesis Corpus, Multi-Style, Multi-Tone Speech Synthesis Corpus. These high-quality AI training datasets will help enhance model performance, allowing AI products to meet […]

Chinese Continuous Visual Speech Recognition Challenge Workshop 2024 Has Concluded Successfully

On the morning of August 16th, the Chinese Continuous Visual Speech Recognition Challenge Workshop 2024 (CNVSRC Workshop 2024) was held at the 19th National Conference on Man-Machine Speech Communication (NCMMSC 2024) in Urumqi,China. The workshop includes CNVSRC 2024 introduction, address, rank announcement, technical report and system description sharing. The workshop is a forum to exchange ideas regarding Chinese large […]

How AthleteGPT is Perfectly Prepared for the Paris Olympics: The Technology Behind the Success

At this year’s Paris Olympics, approximately 10,000 athletes from over 200 countries and regions will compete with determination for the Olympic spirit and their dreams. “AthleteGPT,” a powerful intelligent voice assistant, will assist athletes and staff from different countries and regions, enabling smoother communication and interaction throughout this summer event. How do I get to […]

Pushing the Boundaries of Large Speech Models : Dataocean AI’s Thousand-Person Multilingual Speech Synthesis Datasets

Recently, a text-to-speech project named ChatTTS  goes viral  and has garnered 28k stars on GitHub. Designed specifically for dialogue scenarios, this speech generation model supports both English and Chinese languages. It has been optimized for conversational tasks, achieving natural and fluent speech synthesis. Highlights of ChatTTS Dialog-based TTS: ChatTTS is optimized for dialog-based tasks, achieving […]

Dataocean AI New Datasets – July

Dataocean AI has launched new high quality datasets including minor language smart voice dataset, telephoto landscape image dataset, and multi-skin tone cabin video dataset. These resources aim to help enterprises develop more extensive and higher-quality large models and AI applications to meet the diverse needs of global users.   Arabic Speech Recognition Dataset Product Features: Arabic, […]