Quality Seal Emagister EMAGISTER CUM LAUDE

Big Data Analysis with Spark - University of California

edX
Online
4 Meinungen

Kostenlos

Wichtige informationen

  • Kurs
  • Online
  • Dauer:
    4 Weeks
  • Wann:
    Freie Auswahl
Beschreibung

Learn how to apply data science techniques using parallel programming in Spark to explore big data. With an apprenticeship you earn while you learn, you gain recognized qualifications, job specific skills and knowledge and this helps you stand out in the job market.With this course you earn while you learn, you gain recognized qualifications, job specific skills and knowledge and this helps you stand out in the job market.

Wichtige informationen

Voraussetzungen: Programming background and experience with Python required. All exercises will use PySpark (part of Apache Spark). Previous experience with Spark equivalent to CS105x: Introduction to Spark required.

Veranstaltungsort(e)

Wo und wann

Beginn Lage
Freie Auswahl
Online

Meinungen

X

14.09.2016
Das Beste good hands-on lab to get you started quickly. But the lecture is not so related to the lab. Better take it with a book on Spark.

Zu verbessern No negative aspects.

Kurs abgeschlossen: September 2016 | Recomendarías este centro? Sí.
E

07.10.2016
Das Beste Great course organization, especially the balance between theory and practice. Some tasks were too easy and some were not clear at first, but piazza search usually helped. I consider this is a very good pyspark tutorial with explanation of spark key features.

Zu verbessern N/A.

Kurs abgeschlossen: Oktober 2016 | Recomendarías este centro? Sí.
E

09.11.2015
Das Beste A lot of overlapping with the 2 other courses of the xSerie. I would definitely not advise taking this course if you took them. The last of the 4 weeks consists of only 20 minutes of video explaining very basic statistic concepts.

Zu verbessern Nothing.

Kurs abgeschlossen: November 2015 | Recomendarías este centro? Sí.

Was lernen Sie in diesem Kurs?

Data analysis
Programming
Big Data
Spark
Science Techniques

Themenkreis

Organizations use their data to support and influence decisions and build data-intensive products and services, such as recommendation, prediction, and diagnostic systems. The collection of skills required by organizations to support these functions has been grouped under the term ‘data science’.

This statistics and data analysis course will attempt to articulate the expected output of data scientists and then teach students how to use PySpark (part of Spark) to deliver against these expectations. The course assignments include log mining, textual entity recognition, and collaborative filtering exercises that teach students how to manipulate data sets using parallel processing with PySpark.

This course covers advanced undergraduate-level material. It requires a programming background and experience with Python (or the ability to learn it quickly). All exercises will use PySpark (the Python API for Spark), and previous experience with Spark equivalent to Introduction to Spark, is required.