Recently, we extracted a 10-hour Chinese Mandarin speech recognition corpus for free use. It is a subset of King-ASR-009, which is one of the star products of speechocean and has been used for many AI products in the market. King-ASR-009 contents 159 hours recorded by 260 people in a quiet environment.
Information of free data
Language: Chinese Mandarin
Serial number: King-ASR-L-009
Speakers: 20 (10 males and 10 females)
Channels: 4 channels
Total utterance: 9600
Environment: quiet office
Transcription files: included
Purpose: academic usage only
Steps to get free data
Step 1: Click this link http://en.speechocean.com/register.html, use your email ID to register as a member and activate your account successfully.
Step 2: Search speechocean on LinkedIn and follow us.
Step 3: Send your email ID and your LinkedIn account to email@example.com. After we ensure you to meet the above two criteria, we will send you a FTP link of free data in 3 working days.
Oriental Language Recognition Challenge (OLR2020)
We will hold the fifth "Oriental Language Recognition Challenge (OLR2020)" and will let you know the details in recent days. Welcome to consult and attend this event!
Speechocean always devoted itself to providing specialized engineering data products and services to enterprises and scientific research institutions in the whole industry chain of AI. Our business involves various domains such as speech recognition, speech synthesis, computer vision, lexicon, and natural language processing and provides relevant services for the design, collection, transcription, annotation, etc. of data.
If you have any further inquiries, please do not hesitate to contact us.