User Tools

Site Tools


start

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
Next revision Both sides next revision
start [2020/06/24 21:55]
deadline
start [2020/07/10 03:15]
deadline added v2 zeppelin notebook
Line 62: Line 62:
  
 ====Zeppelin Notebook for Scalable Data Science with Hadoop and Spark==== ====Zeppelin Notebook for Scalable Data Science with Hadoop and Spark====
-(Updated 20-Aug-2019)+(Updated 09-Jul-2020)
  
-  * [[https://www.clustermonkey.net/download/Hands-on_Hadoop_Spark/Scalable-Analytics.json|Scalable-Analytics.json]]+  * [[https://www.clustermonkey.net/download/Hands-on_Hadoop_Spark/Scalable-Analytics-V2.json|Scalable-Analytics-V2.json]] New version that uses Hive, Python, and PySpark 
 +  * [[https://www.clustermonkey.net/download/Hands-on_Hadoop_Spark/Scalable-Analytics.json|Scalable-Analytics.json]] Old Version that uses Pig, Python, and PySpark
  
 ---- ----
Line 82: Line 83:
 ===VERSION 2: (Current)===  ===VERSION 2: (Current)=== 
 (Updated 24-Jun-2020) (Updated 24-Jun-2020)
-CentOS Linux 7.6, Anaconda 3:Python 3.7.4, Apache Hadoop 3.2.1, Hive 3.1.2, Apache Spark 2.4.5, Derby 10.14.2.0, Zeppelin 0.8.2, Sqoop 1.4.7, Kafka 2.5.0. **Used in all current classes as of June 1, 2020.**+CentOS Linux 7.6, Anaconda 3:Python 3.7.4, R 3.6.0, Hadoop 3.2.1, Hive 3.1.2, Apache Spark 2.4.5, Derby 10.14.2.0, Zeppelin 0.8.2, Sqoop 1.4.7, Kafka 2.5.0. **Used in all current classes as of June 1, 2020.**
  
   * [[Linux Hadoop Minimal Installation Instructions VERSION 2]] (Read First)    * [[Linux Hadoop Minimal Installation Instructions VERSION 2]] (Read First) 
start.txt · Last modified: 2024/01/29 21:19 by deadline