Speech Dataset

Our Advantages
REAL SCENARIO
REAL SCENARIO
We only provide real world and truth ground data.
HIGH VALUE
HIGH VALUE
Algorithm research and data design make our data more valuable for machine.
DIVERSITY
DIVERSITY
We use different devices in different situations to get data from all over the world.
EXCEPTIONAL QUALITY
EXCEPTIONAL QUALITY
After 3 rounds of quality inspection, the accuracy is over 95%.
SAFETY COMPLIANCE
SAFETY COMPLIANCE
Our professional legal counsel delivered norms to comply with the laws of every single country.
AFTER SALE SUPPORT
AFTER SALE SUPPORT
Once you become our client, you will be guaranteed at least one year.
Speech Recognition Dataset
Mandarin-China Children Speech Dataset
Mandarin-China
1,105 Hours
10,060 Speaker Number
view detail
Chinese-Mandarin-LiveStream Speech Datasets
Natural Language
5079 Hours
Scene: Live
view detail
English-US Call Center Speech Dataset
Scene: Live
Age: >16 years old
287 Hours
view detail
Chinese-Mandarin-English Speech Dataset Co-Switch
Reading
8477 Participants
4089 Hours
view detail
English-US Speech Dataset
Reading
1935 Speakers
865 Hours
view detail
Arabic-Saudi Arabic Speech Dataset-2
Reading
131 Speakers
82 Hours
view detail
French-Algeria Speech Datasets
Reading
330 Speakers
211 Hours
view detail
English-China Children Speech Dataset
English-China
217 Hours
1,000 Speaker Number
view detail
Chinese-Mandarin Speech Datasets-Children
Reading
10060 Speaker
1105.2 Hours
view detail
Russian Speech Data -Russia
Speech Style : Reading
Speakers : 1932
Speech Hours : 1208
view detail
Malaysia Speech Data- 10000 hours
Speech Style: Reading
Age >16 years old
Speech Hours: 10000 hours
view detail
Arabic Speech Data- 10000 hours
Speech Style: Reading
Age >16 years old
Speech Hours: 10000 hours
view detail
Portuguese Speech Data- 10000 hours
Speech Style: Reading
Age >16 years old
Speech Hours: 10000 hours
view detail
Indonesian Speech Data- 10000 hours
Speech Style: Reading
Age >16 years old
10000 hours
view detail
French Speech Data-Algeria
Speech Style : Reading
Speakers : 330
Speech Hours : 211
view detail
Chinese Mandarin Multimodel Generic Speech Dataset
Speech Style : Multimodel Spech Data
Speakers : 500
Speech Hours : Each speaker shot 100 scripts,about 6-10minutes
view detail