User Tools

Site Tools


start

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
Next revision Both sides next revision
start [2020/06/24 21:50]
deadline updated LHM to V2-beta2
start [2020/07/10 03:15]
deadline added v2 zeppelin notebook
Line 44: Line 44:
  
 ====Class Notes for Data Engineering at Scale with Apache Hadoop and Spark==== ====Class Notes for Data Engineering at Scale with Apache Hadoop and Spark====
-(Updated 06-Mar-2020)+(Updated 23-Jun-2020)
  
   * [[First Steps for Data Engineering Class]]     * [[First Steps for Data Engineering Class]]  
Line 62: Line 62:
  
 ====Zeppelin Notebook for Scalable Data Science with Hadoop and Spark==== ====Zeppelin Notebook for Scalable Data Science with Hadoop and Spark====
-(Updated 20-Aug-2019)+(Updated 09-Jul-2020)
  
-  * [[https://www.clustermonkey.net/download/Hands-on_Hadoop_Spark/Scalable-Analytics.json|Scalable-Analytics.json]]+  * [[https://www.clustermonkey.net/download/Hands-on_Hadoop_Spark/Scalable-Analytics-V2.json|Scalable-Analytics-V2.json]] New version that uses Hive, Python, and PySpark 
 +  * [[https://www.clustermonkey.net/download/Hands-on_Hadoop_Spark/Scalable-Analytics.json|Scalable-Analytics.json]] Old Version that uses Pig, Python, and PySpark
  
 ---- ----
Line 82: Line 83:
 ===VERSION 2: (Current)===  ===VERSION 2: (Current)=== 
 (Updated 24-Jun-2020) (Updated 24-Jun-2020)
-CentOS Linux 7.6, Anaconda 3:Python 3.7.4, Apache Hadoop 3.2.1, Hive 3.1.2, Apache Spark 2.4.5, Derby 10.14.2.0, Zeppelin 0.8.2, Sqoop 1.4.7, Kafka 2.5.0. **Used in all current classes as of June 1, 2020.**+CentOS Linux 7.6, Anaconda 3:Python 3.7.4, R 3.6.0, Hadoop 3.2.1, Hive 3.1.2, Apache Spark 2.4.5, Derby 10.14.2.0, Zeppelin 0.8.2, Sqoop 1.4.7, Kafka 2.5.0. **Used in all current classes as of June 1, 2020.**
  
   * [[Linux Hadoop Minimal Installation Instructions VERSION 2]] (Read First)    * [[Linux Hadoop Minimal Installation Instructions VERSION 2]] (Read First) 
start.txt · Last modified: 2024/01/29 21:19 by deadline