Industry recognized certification enables you to add this credential to your resume upon completion of all courses

Need Custom Training for Your Team?
Get Quote
Call Us

Toll Free (844) 397-3739

Inquire About This Course
Jacky Ma, Instructor - Chinese Natural Language Processing in Practice

Jacky Ma

Jacky has worked in the finance industry for over 10 years with experience including quantitative research and FinTech management. He also led a team of engineers to build a data science driven marketing platform which serves a number of international brands. Jacky holds BSc & M.Phil in Computer Science and Engineering from The Chinese University of Hong Kong. He is now moving to apply data science (and his empathy) in an area of humanity - matchmaking.

Instructor: Jacky Ma

Understand machine learning techniques that are specifically applied to Chinese language

  • Gain essential knowledge and practical skills to work with Chinese textual content. Understand machine learning techniques that are specifically applied to Chinese language.
  • Instructor: Data Scientist with substantive expertise in digital marketing and fintech. Led a team of engineers and built a data science driven marketing platform which serves a number of international brands. 

Duration: 1h 17m

Course Description

Text mining is one of the prospering areas in data science that allows data scientist to work with textual contents – however, some common practices around text mining, such as stopwords and stemming, are not applicable to Chinese texts due to the difference in language structures. On the other hand, a study from InternetWorld Stats showed that Chinese Language Internet users accounted for 23.2% of the World Internet users (as of December 31, 2013), which is the second largest group of users (native English users if the largest group at 28.6%). No doubt that the business world has a strong demand on text-mining skills for Chinese texts. It is important to provide knowledge and necessary tools to extend data scientist text-mining capacity to include Chinese text contents.

What am I going to get from this course?

  • Know the basics of Chinese text structures: characters, vocabulary types, sentences
  • Understand the computer representations of Chinese text encoding and convention: Unicode, GB, HZ, Big5
  • Understand the theory for Chinese text segmentation and applying Chinese segmentation using the Jieba library

Prerequisites and Target Audience

What will students need to know or do before starting this course?

  • Basic knowledge on Python development
  • Basic knowledge on text mining
  • Knowledge on machine learning and statistics
  • Interest in learning to apply their data science skills to Chinese text documents

Who should take this course? Who should not?

This course targets data scientists who is working on natural language processing
and would like to extend into textual contents in Chinese. Students are assumed
to have basic knowledge in Python and text mining. Knowledge in Chinese
language is not a must but having interest in it will make the course easier.


Module 1: Introduction of Basic Structures of Chinese

Lecture 1 Course Overview and Objectives
Lecture 2 Chinese Grammar
Lecture 3 Traditional and Simplified Chinese
Lecture 4 Jain-Fan Conversion
Lecture 5 Chinese Vocabulary
Lecture 6 Chinese Pinyin
Lecture 7 History of Chinese Characters

Module 2: Deep Dive into Text Segmentation

Lecture 8 Chinese Text Simulation
Lecture 9 Jieba Part-of-Speech Tagging
Lecture 10 Chinese NLP in Action
Lecture 11 Jieba Text Segmenation


7 Reviews

Sharon V

May, 2017

It is a nice course for those interested in working on natural language processing in Chinese, as it can help further career, as Chinese is the second largest used language. It can help in dealing with Chinese business opportunities. The course is organised in a good manner and made easy to understand even if you do not know the Chinese language. The instructor has command over the subject, which helped me to learn machine learning techniques in the Chinese. It really helped me understand the basics of Chinese text structures etc and the computer representations of Chinese text encoding and convention.

garry E

May, 2017

As I was interested to understand machine-learning techniques applied to Chinese language, this course helped me gain knowledge and skills to work with Chinese textual content. Understand machine-learning techniques. A great course indeed.

Harry M

May, 2017

A fantastic and interesting course. It greatly helped me learn to work on machine learning techniques to Chinese. The instructor patiently explained many important natural language processing techniques in understandable manner, though I was not familiar with Chinese language.

Bob R

July, 2017

The instructor has control over the subject matter, which worked for me to acquire machine learning methods in the Chinese. It is a delightful subject for those concerned with pursuing on natural language processing in Chinese, as it can support further occupation, as Chinese is the next largest used language.

Sebastien P

July, 2017

Made sense of the fundamentals of Chinese text structures etc and the computer descriptions of Chinese text encoding and practice. A comprehensive course.

Venkata Balaji N

July, 2017

The course was extremely fascinating. The learnings from the course can be used with Chinese business affairs. Very well organized and in an efficient manner.

Juan Camilo M

July, 2017

This is a very fine course. It is quite useful and explanatory. As I was obsessed with to find out machine-learning approaches used in the Chinese language, this study encouraged me to pick up insight and techniques to perform with Chinese textual composition.