Skip to content
  • Datasets
  • Services
  • Industry Solutions
  • Company
  • Shopping Cart
  • Contact Us
Speech
icon-asr
ASR
icon-tts
TTS
Text
icon-nlp
NLP
icon-lexicon
Lexicon
icon-machine-translation
Machine Translation
Image
icon-cv
CV
icon-ocr
OCR
Multimodal
icon-multimodal
Multimodal
View all datasets
Data services
  • Large Language Model
  • Multimodal
  • Automatic Speech Recognition
  • Text to Speech
data services
  • Natural Language Processing
  • Computer Vision
  • Autonomous Driving
  • Lexicon
Model services
  • Model Training and Evaluation
  • DOTS Platform
View all services
  • Agentic AI
  • Autonomous Driving
  • Smart Home
  • Smart Finance
  • LLMs
  • Retail
  • Internet
  • Smart Healthcare
View all Solutions
  • About Us
  • Resources
Contact Us

归档

Enhancing Voice Assistant Intelligence through LLM

“Hi Siri, what do you think of the iPhone 15?&#82 […]

Data Cleaning – Warm-up Before Training Large Language Models

In the competition of AI large language models, dataset […]

ChatGPT Goes Multimodal: Excelling in Audio, Text, and Image Interpretation

Recently, OpenAI released a multimodal voice and image […]

Auto-AVSR: Audio-Visual Speech Recognition with Automatic Labels

Audio-visual speech recognition has received a lot of a […]

Chinese Continuous Visual Speech Recognition Challenge 2023

Visual speech recognition, also known as lip reading, i […]

SeamLessM4T: A Multi-Modal Model Beyond the Constraints of LLM

On August 23, Meta released a new large speech recognit […]

← older
newer →
datasets
Speech
  • ASR
  • TTS
  • ASR
  • TTS
Image
  • CV
  • OCR
  • CV
  • OCR
Text
  • NLP
  • Lexicon
  • Machine Translation
  • NLP
  • Lexicon
  • Machine Translation
Multimodal
  • Multimodal
  • Multimodal
Services
Data services
  • Large Language Model
  • Multimodal
  • Automatic Speech Recognition
  • Text to speech
  • Natural Language Processing
  • Computer Vision
  • Autonomous Driving
  • Lexicon
  • Large Language Model
  • Multimodal
  • Automatic Speech Recognition
  • Text to speech
  • Natural Language Processing
  • Computer Vision
  • Autonomous Driving
  • Lexicon
Model services
  • Model Training and Evaluation
  • DOTS Platform
  • Model Training and Evaluation
  • DOTS Platform
Industry Solutions
  • Autonomous Driving
  • Smart Home
  • AR/VR
  • Intelligent Finance
  • Retail
  • Internet
  • Intelligent Health Care
  • LLMs
  • Autonomous Driving
  • Smart Home
  • AR/VR
  • Intelligent Finance
  • Retail
  • Internet
  • Intelligent Health Care
  • LLMs
Company
  • About Us
  • Resources
  • Contact Us
  • About Us
  • Resources
  • Contact Us
Datasets
Speech
  • ASR
  • TTS
  • ASR
  • TTS
Image
  • CV
  • OCR
  • CV
  • OCR
Text
  • NLP
  • Lexicon
  • Machine Translation
  • NLP
  • Lexicon
  • Machine Translation
Multimodal
  • Multimodal
  • Multimodal
Services
Data services
  • Large Language Model
  • Multimodal
  • Automatic Speech Recognition
  • Text to speech
  • Natural Language Processing
  • Computer Vision
  • Autonomous Driving
  • Lexicon
  • Large Language Model
  • Multimodal
  • Automatic Speech Recognition
  • Text to speech
  • Natural Language Processing
  • Computer Vision
  • Autonomous Driving
  • Lexicon
Model services
  • Model Training and Evaluation
  • DOTS Platform
  • Model Training and Evaluation
  • DOTS Platform
Industry Solutions
  • Autonomous Driving
  • Smart Home
  • AR/VR
  • Intelligent Finance
  • Retail
  • Internet
  • Intelligent Health Care
  • LLMs
  • Autonomous Driving
  • Smart Home
  • AR/VR
  • Intelligent Finance
  • Retail
  • Internet
  • Intelligent Health Care
  • LLMs
Company
  • About Us
  • Resources
  • Contact Us
  • About Us
  • Resources
  • Contact Us
© 2025 DATAOCEAN AI. All rights reserved.
  • Privacy Policy
  • Terms of Service
  • Cookie Policy
  • Cookie Preferences
  • Privacy Policy
  • Terms of Service
  • Cookie Policy
  • Cookie Preferences
X
Welcome to
Welcome to Dataocean AI!
×