Tel.

+86
-10-62660053

E-mail

contact@speechocean.com

Leave Info.
TOP

Multi-language NLP Databases Overview

2021.06.15

Speechocean: AI Data Resource and Data Service Provider


Till now, Speechocean has over 6,000,000+  sentences of NLP off-the-shelf dataset which contain Chinese, English, Japanese can been licensed. 


Please check the form below:


NLP Corpus

 

Language & ContentNumber of sentences
Chinese (Email/SMS/Chatting/Prosodic etc.)4,000,000
Traditional Chinese (Email/SMS etc.)1,500,000
US English (SMS)200,000
UK English (Names/POI)200,000
Hong Kong Cantonese200,000
Japanese (POI)200,000

 

Speechocean always devoted itself to providing engineering data products and services to enterprises and scientific research institutions in the whole industry chain of AI. Our business involves various domains such as speech recognition, speech synthesis, computer vision, lexicon, and natural language processing and provides relevant services for the design, collection, transcription, annotation, etc. of data.


If you have any further inquiries, please do not hesitate to contact us.

Email: marketing@speechocean.com


Telephone
Leave Information
Member