Skip to content

Navigate your cloud native world with training that matures your DevOps practices

Learn how to put the latest open source technology into practice with hands-on training, delivered by industry experts, aligned to your desired business outcomes

rx-m-cloud.png
CNCF-Logo.svg
KTP-logo.svg
KCSP-logo.svg
lf-atp-logo.svg
apache-bronze-sponsor-logo.svg

Spark Overview

1 Day

Available On-Site

Available Virtually

Open Enrollments Available

Customizable


This one-day Spark training course is for data engineers, analysts, architects, software engineers, IT operations and technical managers interested in a brief hands-on overview of the Apache Spark platform. The course covers core APIs for using Spark, basic internals of the platform, SQL and other high-level data access tools, as well as Spark’s streaming capabilities and machine learning APIs.

Each topic includes lecture content along with hands-on use of a Spark cluster in lab exercises. After attending the training, students will be able to: communicate with team members using appropriate terminology; identify and experiment with use cases for Spark and Databricks appropriate to business needs; build data pipelines and query large data sets using Spark SQL and DataFrames; execute and modify ETL jobs using the Spark API, DataFrames and RDDs; and analyze Spark jobs using the UIs and logs.

Delivery

Available for Instructor-Led (ILT) in-person/onsite training or Virtual Instructor-Led training (VILT) delivery; Open Enrollment options may be available.

Who Should Attend?

Application Developers, Analysts and Data Scientists

What Attendees will learn

This course is designed to provide attendees with a comprehensive introduction to Apache Spark and SparkSQL. Learning modules include:

  • Internals of the Apache Spark Platform
  • Working with key/value data and file based data
  • Querying Hadoop and JDBC/ODBC sources with SparkSQL

Prerequisites

Each attendee will require the ability to run a 64 bit virtual machine (provided with the course). Basic Linux command line skills are helpful. The coding examples use Python and PySpark so some experience with Python is important.


Contact us to request more information about enrolling in the Spark Overview course or to inquire about booking a custom in-house course for your team.

Other Open Enrollments from RX-M

May 2020
June 2020
No event found!
Load More

Frequently Asked Questions about Open Enrollment Courses

RX-M's Cloud Native & DevOps enablement philosophy

Bring a neutral perspective

We bring a market neutral perspective to every engagement, taking no stake in any of the competing cloud native platforms, components or solutions so we can offer unbiased insights to our clients

Practice what we teach

We are a multi-cloud company consisting of prominent open source contributors with large-scale software engineering experience, actively contributing to the evolution of next-gen software architectures, application management, and platforms

Be solution focused

RX-M has the unique ability to deliver purpose-built, solution-based training in the form of custom curriculum that aligns with each of our client's specific desired outcomes so your team has the skills needed to accelerate the business

Our team has been trusted to work alongside Cloud Native and DevOps teams at some of the most exciting companies

grey-client-logos-16-mar-2020.svg