Chinese-English Short Message Translation Corpus

This dataset provides a specification of the contents of the Chinese-English machine translation corpus of short messages.
English, Chinese
The corpus contains 77,000 Chinese short messages and matching English transcriptions
Data format
Data content
Chinese-English Parallel Corpus dataset
Accuracy Rate
The accuracy of the labeling results is 95%

Chinese-English-Korean-Japanese Parallel Corpus
This dataset contains 200,027 high-frequency sentence pairs with rich linguistic phenomenon, mainly for the Chinese, English, Korean and Japanese machine translation research and development.
Chinese-English-Japanese-Korean Parallel Corpus
Chinese-English-Japanese-Korean Parallel

