Advanced Spark for Data Science and Data Engineering - University of California

edX
Online

Kostenlos

Wichtige informationen

  • Kurs
  • Online
  • Dauer:
    4 Weeks
  • Wann:
    Freie Auswahl
Beschreibung

Learn common Spark use cases and take a deeper dive into Spark’s architecture and APIs. With an apprenticeship you earn while you learn, you gain recognized qualifications, job specific skills and knowledge and this helps you stand out in the job market.With this course you earn while you learn, you gain recognized qualifications, job specific skills and knowledge and this helps you stand out in the job market.

Wichtige informationen

Voraussetzungen: Programming background and experience with Python required. All exercises will use PySpark (part of Spark). Previous experience with Spark equivalent to CS110x: Big Data Analysis with Spark.

Veranstaltungsort(e)

Wo und wann

Beginn Lage
Freie Auswahl
Online

Was lernen Sie in diesem Kurs?

Engineering
Data science
APIs
Spark
Data Engineering

Themenkreis

Gain a deeper understanding of Spark by learning about its APIs, architecture, and common use cases.  This statistics and data analysis course will cover material relevant to both data engineers and data scientists.  You’ll learn how Spark efficiently transfers data across the network via its shuffle, details of memory management, optimizations to reduce compute costs, and more.  Learners will see several use cases for Spark and will work to solve a variety of real-world problems using public datasets.  After taking this course, you should have a thorough understanding of how Spark works and how you can best utilize its APIs to write efficient, scalable code.  You’ll also learn about a wide variety of Spark’s APIs, including the APIs in Spark Streaming.