<?xml version="1.0" encoding="UTF-8"?>
<!-- generator="FeedCreator 1.8" -->
<?xml-stylesheet href="https://www.clustermonkey.net/scalable-analytics/lib/exe/css.php?s=feed" type="text/css"?>
<rdf:RDF
    xmlns="http://purl.org/rss/1.0/"
    xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
    xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
    xmlns:dc="http://purl.org/dc/elements/1.1/">
    <channel rdf:about="https://www.clustermonkey.net/scalable-analytics/feed.php">
        <title>Live On-Line Training:&lt;BR&gt; Scalable Data Pipelines with Hadoop, Spark, and Kafka</title>
        <description></description>
        <link>https://www.clustermonkey.net/scalable-analytics/</link>
        <image rdf:resource="https://www.clustermonkey.net/scalable-analytics/lib/tpl/dokuwiki/images/favicon.ico" />
       <dc:date>2026-04-24T19:54:46+00:00</dc:date>
        <items>
            <rdf:Seq>
                <rdf:li rdf:resource="https://www.clustermonkey.net/scalable-analytics/doku.php?id=first_steps_for_bash_programming_training&amp;rev=1738256767&amp;do=diff"/>
                <rdf:li rdf:resource="https://www.clustermonkey.net/scalable-analytics/doku.php?id=first_steps_for_command_line_class&amp;rev=1599679613&amp;do=diff"/>
                <rdf:li rdf:resource="https://www.clustermonkey.net/scalable-analytics/doku.php?id=first_steps_for_data_engineering_class&amp;rev=1608306144&amp;do=diff"/>
                <rdf:li rdf:resource="https://www.clustermonkey.net/scalable-analytics/doku.php?id=first_steps_for_getting_started_with_kafka&amp;rev=1664973392&amp;do=diff"/>
                <rdf:li rdf:resource="https://www.clustermonkey.net/scalable-analytics/doku.php?id=first_steps_for_hands-on_class&amp;rev=1635949718&amp;do=diff"/>
                <rdf:li rdf:resource="https://www.clustermonkey.net/scalable-analytics/doku.php?id=first_steps_for_kafka_methods_and_administration&amp;rev=1679317118&amp;do=diff"/>
                <rdf:li rdf:resource="https://www.clustermonkey.net/scalable-analytics/doku.php?id=first_steps_for_linux_command_line_training&amp;rev=1626288995&amp;do=diff"/>
                <rdf:li rdf:resource="https://www.clustermonkey.net/scalable-analytics/doku.php?id=first_steps_for_scalable_pyspark_for_data_science&amp;rev=1704668919&amp;do=diff"/>
                <rdf:li rdf:resource="https://www.clustermonkey.net/scalable-analytics/doku.php?id=linux_hadoop_minimal_installation_instructions&amp;rev=1590086803&amp;do=diff"/>
                <rdf:li rdf:resource="https://www.clustermonkey.net/scalable-analytics/doku.php?id=linux_hadoop_minimal_installation_instructions_version_0.42&amp;rev=1591205240&amp;do=diff"/>
                <rdf:li rdf:resource="https://www.clustermonkey.net/scalable-analytics/doku.php?id=linux_hadoop_minimal_installation_instructions_version_2&amp;rev=1771442891&amp;do=diff"/>
                <rdf:li rdf:resource="https://www.clustermonkey.net/scalable-analytics/doku.php?id=linux_hadoop_minimal_installation_instructions_version_3&amp;rev=1773259205&amp;do=diff"/>
                <rdf:li rdf:resource="https://www.clustermonkey.net/scalable-analytics/doku.php?id=old_version_0.42&amp;rev=1608336312&amp;do=diff"/>
                <rdf:li rdf:resource="https://www.clustermonkey.net/scalable-analytics/doku.php?id=sign_up_form&amp;rev=1571060541&amp;do=diff"/>
                <rdf:li rdf:resource="https://www.clustermonkey.net/scalable-analytics/doku.php?id=start&amp;rev=1770672721&amp;do=diff"/>
            </rdf:Seq>
        </items>
    </channel>
    <image rdf:about="https://www.clustermonkey.net/scalable-analytics/lib/tpl/dokuwiki/images/favicon.ico">
        <title>Live On-Line Training:<BR> Scalable Data Pipelines with Hadoop, Spark, and Kafka</title>
        <link>https://www.clustermonkey.net/scalable-analytics/</link>
        <url>https://www.clustermonkey.net/scalable-analytics/lib/tpl/dokuwiki/images/favicon.ico</url>
    </image>
    <item rdf:about="https://www.clustermonkey.net/scalable-analytics/doku.php?id=first_steps_for_bash_programming_training&amp;rev=1738256767&amp;do=diff">
        <dc:format>text/html</dc:format>
        <dc:date>2025-01-30T17:06:07+00:00</dc:date>
        <dc:creator>Anonymous (anonymous@undisclosed.example.com)</dc:creator>
        <title>first_steps_for_bash_programming_training</title>
        <link>https://www.clustermonkey.net/scalable-analytics/doku.php?id=first_steps_for_bash_programming_training&amp;rev=1738256767&amp;do=diff</link>
        <description>When the VM is Started

Open a terminal (using Putty or MobaXterm on Windows) and enter the following to log in to the LHM-VM as user “hands-on”  (password=“minimal”)


  ssh hands-on@127.0.0.1 -p 2222


Once you are logged in to the LHM-VM, you should see the following prompt string:</description>
    </item>
    <item rdf:about="https://www.clustermonkey.net/scalable-analytics/doku.php?id=first_steps_for_command_line_class&amp;rev=1599679613&amp;do=diff">
        <dc:format>text/html</dc:format>
        <dc:date>2020-09-09T19:26:53+00:00</dc:date>
        <dc:creator>Anonymous (anonymous@undisclosed.example.com)</dc:creator>
        <title>first_steps_for_command_line_class</title>
        <link>https://www.clustermonkey.net/scalable-analytics/doku.php?id=first_steps_for_command_line_class&amp;rev=1599679613&amp;do=diff</link>
        <description>Beginning/Intermediate Linux Command Line Quick Start

The following steps explain how load and start the Linux Hadoop Minimal Virtual Machine (LHM-VM) and download the course notes files. A full and expanded explanation is provided as part of the class. The following steps are a</description>
    </item>
    <item rdf:about="https://www.clustermonkey.net/scalable-analytics/doku.php?id=first_steps_for_data_engineering_class&amp;rev=1608306144&amp;do=diff">
        <dc:format>text/html</dc:format>
        <dc:date>2020-12-18T15:42:24+00:00</dc:date>
        <dc:creator>Anonymous (anonymous@undisclosed.example.com)</dc:creator>
        <title>first_steps_for_data_engineering_class</title>
        <link>https://www.clustermonkey.net/scalable-analytics/doku.php?id=first_steps_for_data_engineering_class&amp;rev=1608306144&amp;do=diff</link>
        <description>Data Engineering Quick Start

The following steps explain how load and start the Linux Hadoop Minimal Virtual Machine (LHM-VM) and download the course notes files. A full and expanded explanation is provided as part of the class. The following steps are a</description>
    </item>
    <item rdf:about="https://www.clustermonkey.net/scalable-analytics/doku.php?id=first_steps_for_getting_started_with_kafka&amp;rev=1664973392&amp;do=diff">
        <dc:format>text/html</dc:format>
        <dc:date>2022-10-05T12:36:32+00:00</dc:date>
        <dc:creator>Anonymous (anonymous@undisclosed.example.com)</dc:creator>
        <title>first_steps_for_getting_started_with_kafka</title>
        <link>https://www.clustermonkey.net/scalable-analytics/doku.php?id=first_steps_for_getting_started_with_kafka&amp;rev=1664973392&amp;do=diff</link>
        <description>Getting Started with Kafka Quick Start

The following steps explain how load and start the Linux Hadoop Minimal Virtual Machine (LHM-VM) and download the course notes files. A full and expanded explanation is provided as part of the class. The following steps are a</description>
    </item>
    <item rdf:about="https://www.clustermonkey.net/scalable-analytics/doku.php?id=first_steps_for_hands-on_class&amp;rev=1635949718&amp;do=diff">
        <dc:format>text/html</dc:format>
        <dc:date>2021-11-03T14:28:38+00:00</dc:date>
        <dc:creator>Anonymous (anonymous@undisclosed.example.com)</dc:creator>
        <title>first_steps_for_hands-on_class</title>
        <link>https://www.clustermonkey.net/scalable-analytics/doku.php?id=first_steps_for_hands-on_class&amp;rev=1635949718&amp;do=diff</link>
        <description>Hands-on Introduction to Apache Hadoop and Spark Programming Quick Start

The following steps explain how load and start the Linux Hadoop Minimal Virtual Machine (LHM-VM) and download the course notes files. A full and expanded explanation is provided as part of the class. The following steps are a</description>
    </item>
    <item rdf:about="https://www.clustermonkey.net/scalable-analytics/doku.php?id=first_steps_for_kafka_methods_and_administration&amp;rev=1679317118&amp;do=diff">
        <dc:format>text/html</dc:format>
        <dc:date>2023-03-20T12:58:38+00:00</dc:date>
        <dc:creator>Anonymous (anonymous@undisclosed.example.com)</dc:creator>
        <title>first_steps_for_kafka_methods_and_administration</title>
        <link>https://www.clustermonkey.net/scalable-analytics/doku.php?id=first_steps_for_kafka_methods_and_administration&amp;rev=1679317118&amp;do=diff</link>
        <description>Kafka Methods and Administration Quick Start

The following steps explain how load and start the Linux Hadoop Minimal Virtual Machine (LHM-VM) and download the course notes files. A full and expanded explanation is provided as part of the class. The following steps are a</description>
    </item>
    <item rdf:about="https://www.clustermonkey.net/scalable-analytics/doku.php?id=first_steps_for_linux_command_line_training&amp;rev=1626288995&amp;do=diff">
        <dc:format>text/html</dc:format>
        <dc:date>2021-07-14T18:56:35+00:00</dc:date>
        <dc:creator>Anonymous (anonymous@undisclosed.example.com)</dc:creator>
        <title>first_steps_for_linux_command_line_training</title>
        <link>https://www.clustermonkey.net/scalable-analytics/doku.php?id=first_steps_for_linux_command_line_training&amp;rev=1626288995&amp;do=diff</link>
        <description>Linux Command Line Quick Start

The following steps explain how load and start the Linux Hadoop Minimal Virtual Machine (LHM-VM) and download the training notes files. A full and expanded explanation is provided as part of the class. 

If you are using Linux or Mac, a terminal application is available that includes and</description>
    </item>
    <item rdf:about="https://www.clustermonkey.net/scalable-analytics/doku.php?id=first_steps_for_scalable_pyspark_for_data_science&amp;rev=1704668919&amp;do=diff">
        <dc:format>text/html</dc:format>
        <dc:date>2024-01-07T23:08:39+00:00</dc:date>
        <dc:creator>Anonymous (anonymous@undisclosed.example.com)</dc:creator>
        <title>first_steps_for_scalable_pyspark_for_data_science</title>
        <link>https://www.clustermonkey.net/scalable-analytics/doku.php?id=first_steps_for_scalable_pyspark_for_data_science&amp;rev=1704668919&amp;do=diff</link>
        <description>Scalable PySpark for Data Science

The following steps explain how load and start the Linux Hadoop Minimal Virtual Machine (LHM-VM) and download the course notes files. A full and expanded explanation is provided as part of the class. The following steps are a</description>
    </item>
    <item rdf:about="https://www.clustermonkey.net/scalable-analytics/doku.php?id=linux_hadoop_minimal_installation_instructions&amp;rev=1590086803&amp;do=diff">
        <dc:format>text/html</dc:format>
        <dc:date>2020-05-21T18:46:43+00:00</dc:date>
        <dc:creator>Anonymous (anonymous@undisclosed.example.com)</dc:creator>
        <title>linux_hadoop_minimal_installation_instructions</title>
        <link>https://www.clustermonkey.net/scalable-analytics/doku.php?id=linux_hadoop_minimal_installation_instructions&amp;rev=1590086803&amp;do=diff</link>
        <description>Linux Hadoop Minimal VM Notes

Version: .42

Date: June 3, 2019

Author: Douglas Eadline

Email: deadline(you know what goes here)basement-supercomputing.com

Unless otherwise noted, all course content, notes, and examples are
(c) Copyright Basement Supercomputing 2019, All rights reserved.</description>
    </item>
    <item rdf:about="https://www.clustermonkey.net/scalable-analytics/doku.php?id=linux_hadoop_minimal_installation_instructions_version_0.42&amp;rev=1591205240&amp;do=diff">
        <dc:format>text/html</dc:format>
        <dc:date>2020-06-03T17:27:20+00:00</dc:date>
        <dc:creator>Anonymous (anonymous@undisclosed.example.com)</dc:creator>
        <title>linux_hadoop_minimal_installation_instructions_version_0.42</title>
        <link>https://www.clustermonkey.net/scalable-analytics/doku.php?id=linux_hadoop_minimal_installation_instructions_version_0.42&amp;rev=1591205240&amp;do=diff</link>
        <description>This version is DEPRECATED as of June 1, 2020. It will still work for most of class examples, but does not have Kafka or Python 3

Linux Hadoop Minimal VM Notes

Version: .42

Date: June 3, 2019

Author: Douglas Eadline

Email: d...@b...g.com

Unless otherwise noted, all course content, notes, and examples are</description>
    </item>
    <item rdf:about="https://www.clustermonkey.net/scalable-analytics/doku.php?id=linux_hadoop_minimal_installation_instructions_version_2&amp;rev=1771442891&amp;do=diff">
        <dc:format>text/html</dc:format>
        <dc:date>2026-02-18T19:28:11+00:00</dc:date>
        <dc:creator>Anonymous (anonymous@undisclosed.example.com)</dc:creator>
        <title>linux_hadoop_minimal_installation_instructions_version_2</title>
        <link>https://www.clustermonkey.net/scalable-analytics/doku.php?id=linux_hadoop_minimal_installation_instructions_version_2&amp;rev=1771442891&amp;do=diff</link>
        <description>Linux Hadoop Minimal VM Notes VERSION 2

NOTE: The LHM uses CentOS7 which has reached EOL. An update to Rocky Linux 9 is planned. The LHM is generally safe to use because services do not connect to the Internet. 

There are now two versions of the LHM (Intel-x86_64 and Apple-M)</description>
    </item>
    <item rdf:about="https://www.clustermonkey.net/scalable-analytics/doku.php?id=linux_hadoop_minimal_installation_instructions_version_3&amp;rev=1773259205&amp;do=diff">
        <dc:format>text/html</dc:format>
        <dc:date>2026-03-11T20:00:05+00:00</dc:date>
        <dc:creator>Anonymous (anonymous@undisclosed.example.com)</dc:creator>
        <title>linux_hadoop_minimal_installation_instructions_version_3</title>
        <link>https://www.clustermonkey.net/scalable-analytics/doku.php?id=linux_hadoop_minimal_installation_instructions_version_3&amp;rev=1773259205&amp;do=diff</link>
        <description>Linux Hadoop Minimal VM Notes VERSION 3

All Versions Run on Oracle Virtual Box (Intel-x86_64 and Apple-Arm)

	*  LHM Version 3+ for both x86 and Apple Arm run on VirtualBox version 7.2 or higher. 
	*  There is a separate machines for each architecture.</description>
    </item>
    <item rdf:about="https://www.clustermonkey.net/scalable-analytics/doku.php?id=old_version_0.42&amp;rev=1608336312&amp;do=diff">
        <dc:format>text/html</dc:format>
        <dc:date>2020-12-19T00:05:12+00:00</dc:date>
        <dc:creator>Anonymous (anonymous@undisclosed.example.com)</dc:creator>
        <title>old_version_0.42</title>
        <link>https://www.clustermonkey.net/scalable-analytics/doku.php?id=old_version_0.42&amp;rev=1608336312&amp;do=diff</link>
        <description>VERSION 0.42: (Deprecated)

CentOS Linux 6.9, Apache Hadoop 2.8.1, Pig 0.17.0, Hive 2.3.2, Spark 1.6.3, Derby 10.13.1.1, Zeppelin 0.7.3, Sqoop 1.4.7, Flume-1.8.0. Used in previous classes.

	*  Linux Hadoop Minimal Installation Instructions VERSION 0.42 (Read First)  
	*  Linux Hadoop Minimal V0.42 MD5
	*  Linux Hadoop Minimal Virtual Machine V0.42 file</description>
    </item>
    <item rdf:about="https://www.clustermonkey.net/scalable-analytics/doku.php?id=sign_up_form&amp;rev=1571060541&amp;do=diff">
        <dc:format>text/html</dc:format>
        <dc:date>2019-10-14T13:42:21+00:00</dc:date>
        <dc:creator>Anonymous (anonymous@undisclosed.example.com)</dc:creator>
        <title>sign_up_form</title>
        <link>https://www.clustermonkey.net/scalable-analytics/doku.php?id=sign_up_form&amp;rev=1571060541&amp;do=diff</link>
        <description></description>
    </item>
    <item rdf:about="https://www.clustermonkey.net/scalable-analytics/doku.php?id=start&amp;rev=1770672721&amp;do=diff">
        <dc:format>text/html</dc:format>
        <dc:date>2026-02-09T21:32:01+00:00</dc:date>
        <dc:creator>Anonymous (anonymous@undisclosed.example.com)</dc:creator>
        <title>start</title>
        <link>https://www.clustermonkey.net/scalable-analytics/doku.php?id=start&amp;rev=1770672721&amp;do=diff</link>
        <description>Welcome to the Effective Data Pipelines Series

This page provides many of the resources for books, videos and on-line trainings.

You can find more information on all current  video and book titles and upcoming on-line trainings from O'Reilly.

Training Descriptions

Many of the trainings are run on a regular basis. Check the</description>
    </item>
</rdf:RDF>
