Skip to content

Navigate your cloud native world with training that matures your DevOps practices

Learn how to put the latest open source technology into practice with hands-on training, delivered by industry experts, aligned to your desired business outcomes

rx-m-cloud.png
KTP-logo.svg
KCSP-logo.svg
cncf-member-silver.svg
lf-atp-logo.svg
apache-bronze-sponsor-logo.svg

Apache Spark Programming

2 Days

Available On-Site

Available Virtually

Open Enrollments Available

Customizable


This Apache Spark training course is for data engineers, analysts, architects, software engineers, IT operations and technical managers interested in a thorough, hands-on overview of Apache Spark and SparkSQL. The course covers the core APIs for using Spark; fundamental mechanisms and basic internals of the platform; SQL and other high-level data access tools; as well as Spark’s streaming capabilities and machine learning APIs.

Each topic includes lecture content along with hands-on use of Spark in lab exercises. Attendees will code jobs and perform data analysis queries, and visualizations using their own Spark cluster. All class code is directly usable with pure open-source Spark or any commercial Spark distribution.

Delivery

Available for Instructor-Led (ILT) in-person/onsite training or Virtual Instructor-Led training (VILT) delivery; Open Enrollment options may be available.

Who Should Attend?

Application Developers, Analysts and Data Scientists

What Attendees will learn

This course is designed to provide attendees with a comprehensive, hands-on overview of Apache Spark and SparkSQL. Learning modules include:

  • Internals of the Apache Spark Platform
  • Manipulating and Analysis of Resislient Distributed Datasets and DataFrames
  • Working with key/value data and file based data
  • Querying Hadoop and JDBC/ODBC sources with SparkSQL
  • Coding User Defined Functions

Prerequisites

Each attendee will require the ability to run a 64 bit virtual machine (provided with the course). Basic Linux command line skills are helpful. The coding examples use Python and PySpark so some experience with Python is important.


Apache Spark Programming Open Enrollments

November 2020
February 2021
March 2021
May 2021
No event found!

Contact us to request more information about enrolling in the Spark Programming course or to inquire about booking a custom in-house course for your team.

Frequently Asked Questions about Open Enrollment Courses

RX-M's Cloud Native & DevOps enablement philosophy

Bring a neutral perspective

We bring a market neutral perspective to every engagement, taking no stake in any of the competing cloud native platforms, components or solutions so we can offer unbiased insights to our clients

Practice what we teach

We are a multi-cloud company consisting of prominent open source contributors with large-scale software engineering experience, actively contributing to the evolution of next-gen software architectures, application management, and platforms

Be solution focused

RX-M has the unique ability to deliver purpose-built, solution-based training in the form of custom curriculum that aligns with each of our client's specific desired outcomes so your team has the skills needed to accelerate the business

Our team has been trusted to work alongside Cloud Native and DevOps teams at some of the most exciting companies

grey-client-logos-16-mar-2020.svg