• Online, Self-Paced
Course Description

This course explores how natural language processing (NLP) is used for machine learning, and examines the benefits and challenges of NLP when creating an application that can essentially understand human language. In its 13 videos, learners will be shown the essential components of NLP, including parsers, corpus, and corpus linguistic, as well as how to implement regular expressions. This course goes on to examine tokenization, a way to separate a piece of text into smaller units, and then illustrates different tokenization use cases with NLTK (Natural Language Toolkit). You will learn to use stop words using libraries and the NLTK. This course demonstrates how to implement regular expressions in Python to build NLP-powered applications. Learners will examine the list of Python NLP libraries along with their essential capabilities, including NLTK, Gensim, CoreNLP, spaCy and PyNLPl. You will learn to set up and configure an NLTK environment to illustrate how to process raw text. Finally, this course demonstrates the use of filtering stopwords in a tokenized sentence using NLTK.

Learning Objectives

{"discover the key concepts covered in this course"}

Framework Connections

The materials within this course focus on the NICE Framework Task, Knowledge, and Skill statements identified within the indicated NICE Framework component(s):

Specialty Areas

  • Systems Architecture

Feedback

If you would like to provide feedback for this course, please e-mail the NICCS SO at NICCS@hq.dhs.gov.