Name: Modern Standard Arabic Pronunciation+POS+Vowel Lexicon - DataoceanAI
SKU: King-Lexicon-036
Availability: InStock

Lexicon

Modern Standard Arabic Pronunciation+POS+Vowel Lexicon

Speech recognition speech synthesis Arabic

This Modern Standard Arabic Pronunciation+POS+Vowel Lexicon, curated by DataoceanAI Inc., offers a wealth of linguistic resources tailored specifically for the Modern Standard Arabic language as spoken in Arab World. With 59,837 meticulously crafted entries and an impressive 97.00% entry accuracy rate, this lexicon provides accurate pronunciation transcription in the popular SAMPA phonemic system. Additionally, the lexicon is POS annotated for each entry, providing added value to the dataset. It serves as indispensable training data for speech recognition, speech synthesis, and other language processing applications.

Specifications:

ID:

King-Lexicon-036

Language:

Arabic

COUNTRY

Arab World

SIZE

59,837 entries

Format

TSV

CONTENT

word form, diacriticized form, phonemic transcription and POS annotation for each entry

PHONEME SET

ar-msa_sampa

Accuracy Rate

The accuracy of the labeling results is 97%

People also searched for

Korean Stem and Suffix Lexicon

This Korean Stem and Suffix Lexicon, curated by DataoceanAI Inc., offers a wealth of linguistic resources tailored specifically for the Stem and Suffix in Korean. With 208,581 meticulously crafted entries and an impressive 95.00% entry accuracy rate, this lexicon provides accurate Korean Stem and Suffix. It serves as indispensable training data for speech recognition, speech synthesis, and other language processing applications.

Spain Basque Pronunciation Lexicon

This Spain Basque Pronunciation Lexicon, curated by DataoceanAI Inc., offers a wealth of linguistic resources tailored specifically for the Basque language as spoken in Spain. With 50,023 meticulously crafted entries and an impressive 95.00% entry accuracy rate, this lexicon provides accurate pronunciation transcription in the popular XSAMPA phonemic system. It serves as indispensable training data for speech recognition, speech synthesis, and other language processing applications.

Chinese Changshahua Pronunciation Lexicon

This Chinese Changshahua Pronunciation Lexicon, curated by DataoceanAI Inc., offers a wealth of linguistic resources tailored specifically for the Changshahua as spoken in Changsha City, Hunan Province of China. With 41,736 meticulously crafted entries and an impressive 95.00% entry accuracy rate, this lexicon provides accurate pronunciation transcription in the popular PINYIN phonemic system. It serves as indispensable training data for speech recognition, speech synthesis, and other language processing applications.

Chinese Tibetan Pronunciation Lexicon (lhasa)

This Chinese Tibetan Pronunciation Lexicon, curated by DataoceanAI Inc., offers a wealth of linguistic resources tailored specifically for the Tibetan language as spoken in Lhasa, China. With 106,811 meticulously crafted entries and an impressive 95.00% entry accuracy rate, this lexicon provides accurate pronunciation transcription in the popular PINYIN phonemic system. It serves as indispensable training data for speech recognition, speech synthesis, and other language processing applications.