• Online, Self-Paced
Course Description

MongoDB is a powerful NoSQL database that can use Spark to perform real-time analytics on its data. In this course, you will learn how to integrate and use MongoDB and Spark together using Java and Python.

Learning Objectives

Spark Connector

  • start the course
  • download and install the MongoDB Spark Connector
  • describe the configuration options and configure the MongoDB Spark connector

MongoDB and Spark with Java

  • read and use ReadConfig to read data from MongoDB
  • use datasets and schemas to retrieve data
  • use SQL queries to return data
  • write RDD data from Spark into MongoDB
  • use an aggregation pipeline to filter and perform aggregation in MongoDB

Practice: Using Spark for MongoDB

  • use the MongoDB Connector for Spark with Java and Python

Framework Connections

The materials within this course focus on the NICE Framework Task, Knowledge, and Skill statements identified within the indicated NICE Framework component(s):

Specialty Areas

  • Data Administration
  • Systems Administration

Feedback

If you would like to provide feedback for this course, please e-mail the NICCS SO at NICCS@hq.dhs.gov.