• Online, Self-Paced
Course Description

MongoDB is a powerful NoSQL database that can use Spark to perform real-time analytics on its data. In this course, you will learn how to integrate and use MongoDB and Spark together using Java and Python.

Learning Objectives

Spark Connector

  • start the course
  • download and install the MongoDB Spark Connector
  • describe the configuration options and configure the MongoDB Spark connector

MongoDB and Spark with Java

  • read and use ReadConfig to read data from MongoDB
  • use datasets and schemas to retrieve data
  • use SQL queries to return data
  • write RDD data from Spark into MongoDB
  • use an aggregation pipeline to filter and perform aggregation in MongoDB

Practice: Using Spark for MongoDB

  • use the MongoDB Connector for Spark with Java and Python

Framework Connections

The materials within this course focus on the Knowledge Skills and Abilities (KSAs) identified within the Specialty Areas listed below. Click to view Specialty Area details within the interactive National Cybersecurity Workforce Framework.