cis 612 lab4 2 spark hdfs installation 2020

6
Set Up Guide of Spark on HDFS CIS612 Big Data and Parallel Database Processing Systems Downloaded and installed Spark successfully on ubuntu and create collection on Spark RDD with JSON Files move the extracted directory to /opt:

Upload: others

Post on 12-Jun-2022

27 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: CIS 612 Lab4 2 Spark HDFS Installation 2020

Set Up Guide of Spark on HDFS

CIS612 Big Data and Parallel Database Processing Systems

Downloaded and installed Spark successfully on ubuntu and create collection on Spark RDD with JSON Files

move the extracted directory to /opt:

Page 2: CIS 612 Lab4 2 Spark HDFS Installation 2020

Configure the Environment

Page 3: CIS 612 Lab4 2 Spark HDFS Installation 2020
Page 4: CIS 612 Lab4 2 Spark HDFS Installation 2020
Page 5: CIS 612 Lab4 2 Spark HDFS Installation 2020
Page 6: CIS 612 Lab4 2 Spark HDFS Installation 2020

Reference:

https://www.edureka.co/blog/apache-hive-installation-on-ubuntu/comment-page-2/#comments

https://bigdataprogrammers.com/load-csv-file-in-hive/

https://www.liquidweb.com/kb/how-to-install-apache-spark-on-ubuntu/