Spark for Big Data
Overview
Instructor(s): Rick McMullen, Jian Tao
Time: Friday, March 8, 2019 1:30PM-4:00PM CT
Location: SCC 102.B
Prerequisite(s): Active HPRC account, Python
This class will introduce the Spark Big Data computing environment and how to use it on HPRC clusters.
A Registration button will appear here when registration has been opened.
Course Materials
- Spark for Big Data (Spring 2019): PDF
Participation
During the training, attendees are expected to log in to an HPRC cluster using their own computer and complete the instructor-led examples and exercises.
Agenda
The course agenda will be available soon.
- What Spark is and what it is good for
- Using Spark on the Ada cluster using the Open OnDemand portal
- Running Jupyter+Spark
- Learning some Spark basics with Jupyter notebooks
