Open Datasets: GigaSpeech 2 – 30,000 Hours of Southeast Asian Multilingual Speech Recognition Open Source Dataset Released

The term “Giga” originates from “gigantic,” reflecting the vast audio resources available on the internet. However, the quality of these audio resources varies significantly, and high-quality audio-text pairs are particularly scarce and expensive to annotate, especially for low-resource languages.  GigaSpeech, a highly successful open-source English dataset, addresses this issue by providing thousands of hours of […]

Dataocean AI New Datasets – December

The new dataset from Dataocean AI for December is here! This release includes datasets in speech recognition, speech synthesis, multimodal learning, and more, designed to support the training of multimodal large models. Developers can easily overcome data bottlenecks and efficiently improve model performance.   Professional Scenario Text-Image Pair Dataset 🔥 Product Features: Includes images taken […]

Dataocean AI: An Expert in Content Moderation for a Safe and Reliable Network Environment

With the popularization and development of technology, fake information and illegal content are more easily disseminated, and harmful content has become more complex and subtle, while the demand for content safety is more urgent than ever. A comprehensive compliance solution includes reviewing various content modalities such as text, images, audio, and video, and identifying deceptive […]

Dataocean AI New Datasets – September

We are excited to announce our NEW arrivals, including Accented English Speech Recognition Corpus from 60+ Countries, Accented Spanish Speech Recognition Corpus from 8 Countries, India Multilingual Speech Recognition Corpus, Spontaneous Dialogue Speech Synthesis Corpus, Multi-Style, Multi-Tone Speech Synthesis Corpus. These high-quality AI training datasets will help enhance model performance, allowing AI products to meet […]

Chinese Continuous Visual Speech Recognition Challenge Workshop 2024 Has Concluded Successfully

On the morning of August 16th, the Chinese Continuous Visual Speech Recognition Challenge Workshop 2024 (CNVSRC Workshop 2024) was held at the 19th National Conference on Man-Machine Speech Communication (NCMMSC 2024) in Urumqi,China. The workshop includes CNVSRC 2024 introduction, address, rank announcement, technical report and system description sharing. The workshop is a forum to exchange ideas regarding Chinese large […]

How AthleteGPT is Perfectly Prepared for the Paris Olympics: The Technology Behind the Success

At this year’s Paris Olympics, approximately 10,000 athletes from over 200 countries and regions will compete with determination for the Olympic spirit and their dreams. “AthleteGPT,” a powerful intelligent voice assistant, will assist athletes and staff from different countries and regions, enabling smoother communication and interaction throughout this summer event. How do I get to […]