+86 -18210599182
contact@dataoceanai.com
Speechocean: AI Data Resource and Data Service Provider
Till now, Speechocean has 34,000+ hours off-the-shelf accent English speech recognition corpora, 100+ hours speech synthesis corpora and 3 different kinds of accents English pronunciation lexica can be licensed.
Please check the forms below:
Speech Recognition Corpus | ||
Language | Speakers | Total Hours |
Chinese English | 8,000+ | 5,500+ |
American English | 7,000+ | 5,500+ |
European English | 2,000+ | 4,000+ |
Indian English | 2,000+ | 3,500+ |
British English | 1,500+ | 2,000+ |
Australian English | 1,000+ | 1,500+ |
Multi-regional American English | 1,200+ | 2,000+ |
Canadian English | 1,500+ | 1,000+ |
Japanese English | 1,000+ | 1,000+ |
Indonesian English | 800+ | 350+ |
Singapore English | 400+ | 700+ |
Hong Kong English | 400+ | 800+ |
Korean English | 100+ | 200+ |
Multilingual mixed English | 7,000+ | 6,500+ |
Speech Synthesis Corpus | |
Language | Hours |
British English | 35 |
American English | 90 |
Lexicon | |
Language | Entries |
British English | 200,000+ |
American English | 500,000+ |
Indian English | 100,000+ |
Speechocean always devoted itself to providing engineering data products and services to enterprises and scientific research institutions in the whole industry chain of AI. Our business involves various domains such as speech recognition, speech synthesis, computer vision, lexicon, and natural language processing and provides relevant services for the design, collection, transcription, annotation, etc. of data.
If you have any further inquiries, please do not hesitate to contact us.
Email: marketing@speechocean.com