Welcome to the Big Data Engineering Foundations Resource Page

This page contains the lesson notes, examples, and data used in the Part 1 and Part 2 videos.

Updated: 02-Dec-2022

Lesson Notes and Zeppelin Notebooks:

The lesson notes for both Part 1 and Part 2 can be found in either tgz or zip format (click to download).

PySpark-with-CSV-Files-and-Hive-Tables Zeppelin Notebook for Lesson 6.4 PySpark examples using Zeppelin. Use "Save As" to download the file.

Linux Hadoop® Minimal Virtual Machine (LHM-VM):

This virtual machine can be used for many of the examples in the video. See the installation instructions for more information.

Command for Downloading Lesson Notes into the LHM-VM:

Once you have started and logged into the LHM-VM (as user hands-on), cut and paste the following lines to download and extract get the lesson notes into the LHM-VM. Preform the following from inside the LHM-VM:

  $ wget  --no-check-certificate https://www.clustermonkey.net/download/LiveLessons/Big_Data_Engineering_Foundations/Big_Data_Engineering_Foundations-V2.tgz

Extract the archive file using the "tar" command:

  $ tar xvzf Big_Data_Engineering_Foundations-V2.tgz

Other Resources:

Questions or Problems

Please contact Douglas Eadline at deadline(you know what goes here)eadline(and here)org


Unless otherwise noted, all supplementary content, notes, and examples © Copyright Douglas Eadline 2021,2022, All rights reserved.