Instructional Video5:18
Curated Video

PySpark and AWS: Master Big Data with PySpark and AWS - Rectifying the Error

Higher Ed
In this session, we will rectify the errors we faced while creating DataFrames from RDD. This clip is from the chapter "Spark DFs" of the series "PySpark and AWS: Master Big Data with PySpark and AWS".This section focuses on Spark...
Instructional Video8:03
Curated Video

PySpark and AWS: Master Big Data with PySpark and AWS - RDD Filter

Higher Ed
In this lesson, you will learn about Filter() function. This clip is from the chapter "Spark RDDs" of the series "PySpark and AWS: Master Big Data with PySpark and AWS".This section focuses on PySpark RDDs.
Instructional Video1:55
Curated Video

PySpark and AWS: Master Big Data with PySpark and AWS - Quiz (Sort, OrderBy)

Higher Ed
It's time for a short quiz on sorting. This clip is from the chapter "Spark DFs" of the series "PySpark and AWS: Master Big Data with PySpark and AWS".This section focuses on Spark DataFrames in depth.
Instructional Video1:29
Curated Video

PySpark and AWS: Master Big Data with PySpark and AWS - Quiz (Average)

Higher Ed
It's time for a short quiz on average. This clip is from the chapter "Spark RDDs" of the series "PySpark and AWS: Master Big Data with PySpark and AWS".This section focuses on PySpark RDDs.
Instructional Video8:20
Curated Video

PySpark and AWS: Master Big Data with PySpark and AWS - Project (Filtering)

Higher Ed
In this lesson, we will print the names of employees working in NY state under finance department whose bonuses are greater than the average bonuses of employees in NY state. This clip is from the chapter "Spark DFs" of the series...
Instructional Video8:30
Curated Video

PySpark and AWS: Master Big Data with PySpark and AWS - Glue Job (Full Load)

Higher Ed
This video covers how to use the right glue job (full load). This clip is from the chapter "Project - Change Data Capture / Replication Ongoing" of the series "PySpark and AWS: Master Big Data with PySpark and AWS".This section covers...
Instructional Video2:20
Curated Video

PySpark and AWS: Master Big Data with PySpark and AWS - ETL Pipeline Flow

Higher Ed
In this video, we will discuss the pipeline we will be working on. This clip is from the chapter "ETL Pipeline" of the series "PySpark and AWS: Master Big Data with PySpark and AWS".This section covers Extract Transform Load (ETL)...
Instructional Video5:06
Curated Video

PySpark and AWS: Master Big Data with PySpark and AWS - Total Enrollments per Course

Higher Ed
In this lesson, we will work on the 4th step, which is to show the total number of students enrolled per course. This clip is from the chapter "Spark RDDs" of the series "PySpark and AWS: Master Big Data with PySpark and AWS".This...
Instructional Video5:19
Curated Video

PySpark and AWS: Master Big Data with PySpark and AWS - Spark Streaming Reading Data

Higher Ed
In this session, you will read data from file and get it in the Spark streaming context. This clip is from the chapter "Spark Streaming" of the series "PySpark and AWS: Master Big Data with PySpark and AWS".The section primarily focuses...
Instructional Video5:15
Curated Video

PySpark and AWS: Master Big Data with PySpark and AWS - Spark Streaming Display

Higher Ed
In this lesson, we will work on Spark streaming display for better data visualization. This clip is from the chapter "Spark Streaming" of the series "PySpark and AWS: Master Big Data with PySpark and AWS".The section primarily focuses on...
Instructional Video5:10
Curated Video

PySpark and AWS: Master Big Data with PySpark and AWS - Spark Streaming Context

Higher Ed
In this video, you will learn how to read data and specify the directory in the Spark streaming context with RDD. This clip is from the chapter "Spark Streaming" of the series "PySpark and AWS: Master Big Data with PySpark and AWS".The...
Instructional Video8:28
Curated Video

PySpark and AWS: Master Big Data with PySpark and AWS - Spark RDDs

Higher Ed
Learn about spark RDDs (Resilient Distributed Dataset) in this session. This clip is from the chapter "Spark RDDs" of the series "PySpark and AWS: Master Big Data with PySpark and AWS".This section focuses on PySpark RDDs.
Instructional Video6:13
Curated Video

PySpark and AWS: Master Big Data with PySpark and AWS - Spark DF with Column Renamed and Alias

Higher Ed
In this video, we will learn how we can rename the column in Spark DataFrames. This clip is from the chapter "Spark DFs" of the series "PySpark and AWS: Master Big Data with PySpark and AWS".This section focuses on Spark DataFrames in...
Instructional Video6:25
Curated Video

PySpark and AWS: Master Big Data with PySpark and AWS - Spark DF (Sort, OrderBy)

Higher Ed
In this lesson, we will explore sorting in Spark DataFrame using sort and orderBy. This clip is from the chapter "Spark DFs" of the series "PySpark and AWS: Master Big Data with PySpark and AWS".This section focuses on Spark DataFrames...
Instructional Video11:08
Curated Video

PySpark and AWS: Master Big Data with PySpark and AWS - Spark DF (Group By - Filtering)

Higher Ed
Let's explore filtering for group by in this video. This clip is from the chapter "Spark DFs" of the series "PySpark and AWS: Master Big Data with PySpark and AWS".This section focuses on Spark DataFrames in depth.
Instructional Video11:10
Curated Video

PySpark and AWS: Master Big Data with PySpark and AWS - Solution (Average)

Higher Ed
Let's discuss a solution for the quiz on average. This clip is from the chapter "Spark RDDs" of the series "PySpark and AWS: Master Big Data with PySpark and AWS".This section focuses on PySpark RDDs.
Instructional Video11:50
Curated Video

PySpark and AWS: Master Big Data with PySpark and AWS - Select DF Columns

Higher Ed
Learn how to select multiple columns from the DataFrames. This clip is from the chapter "Spark DFs" of the series "PySpark and AWS: Master Big Data with PySpark and AWS".This section focuses on Spark DataFrames in depth.
Instructional Video6:24
Curated Video

PySpark and AWS: Master Big Data with PySpark and AWS - RDD Distinct

Higher Ed
In this lesson, you will learn about the Distinct() RDD function. This clip is from the chapter "Spark RDDs" of the series "PySpark and AWS: Master Big Data with PySpark and AWS".This section focuses on PySpark RDDs.
Instructional Video15:14
Curated Video

PySpark and AWS: Master Big Data with PySpark and AWS - RDD (saveAsTextFile)

Higher Ed
In this session, you will learn about how to save RDD to a text file using the saveAsTextFile() function. This clip is from the chapter "Spark RDDs" of the series "PySpark and AWS: Master Big Data with PySpark and AWS".This section...
Instructional Video18:06
Curated Video

PySpark and AWS: Master Big Data with PySpark and AWS - RDD (Partition)

Higher Ed
In this lesson, we will explore the concepts of partition. This clip is from the chapter "Spark RDDs" of the series "PySpark and AWS: Master Big Data with PySpark and AWS".This section focuses on PySpark RDDs.
Instructional Video1:30
Curated Video

PySpark and AWS: Master Big Data with PySpark and AWS - Quiz (UDFs)

Higher Ed
It's time for a short quiz on UDFs. This clip is from the chapter "Spark DFs" of the series "PySpark and AWS: Master Big Data with PySpark and AWS".This section focuses on Spark DataFrames in depth.
Instructional Video7:03
Curated Video

PySpark and AWS: Master Big Data with PySpark and AWS - Change Data Capture Pipeline

Higher Ed
In this lesson, we will perform the last step of the project, which is the Change Data Capture (CDC) pipeline. Let's wind up. This clip is from the chapter "Project - Change Data Capture / Replication Ongoing" of the series "PySpark and...
Instructional Video2:50
Curated Video

PySpark and AWS: Master Big Data with PySpark and AWS - Running Spark

Higher Ed
In this session, we will run Spark and confirm if it was configured successfully. This clip is from the chapter "Introduction to Hadoop, Spark Ecosystems and Architectures" of the series "PySpark and AWS: Master Big Data with PySpark and...
Instructional Video5:31
Curated Video

PySpark and AWS: Master Big Data with PySpark and AWS - RDS Networking

Higher Ed
Now that our database is ready, we will go ahead and work on RDS networking. This clip is from the chapter "ETL Pipeline" of the series "PySpark and AWS: Master Big Data with PySpark and AWS".This section covers Extract Transform Load...