What you'll get

  • Job Credibility
  • Certification Valid for Life
  • Live Classes
  • Certificate of Completion

Exam details

  • Mode of Exam : Online
  • Duration : 1 Hour
  • Multiple Choice Questions are asked
  • No. of Questions are asked : 50
  • Passing Marks : 25 (50%)
  • There is no negative marking

The course, "Certified Apache Spark with Scala" is a valuable skill which you will be taught here in this course which is the recent technology in big data. Many big industries like Amazon, EBay, NASA JPL, and Yahoo all use Spark even if your dream company is among them you have an opportunity to get a job there. You are going to learn many features and techniques here. 

In this course you will learn about the introduction to Spark, how to Install Java and Git, setting up Spark projects with IntelliJ IDEA, how to run our first Apache Spark job, the troubleshooting: Run our first Apache Spark job. 

You will also learn about RDD Basics in Apache Spark, how to Create RDDs, map and Filter Transformation in Apache Spark, the Solution to Airports by Latitude Problem, what are the flatMap Transformation in Apache Spark, Set Operation in Apache Spark,Solution for the Same Hosts Problem, Actions in Apache Spark, etc and many things you will learn in details. 

Here in this course many other topics are covered like:

  • Spark Architecture
  • Spark Components
  • Introduction to Pair RDD in Spark
  • Create Pair RDDs in Spark
  • Filter and MapValue Transformations on Pair RDD
  • Data Partitioning in Apache Spark
  • Join Operations in Spark
  • Use Dataset or RDD
  • Dataset and RDD Conversion
  • Performance Tuning of Spark SQL
  • Run Spark Application on Amazon EMR (Elastic MapReduce) cluster

Everything is discussed here from basic to advanced level from the very scratch. 

You will be required some previous programming or scripting experience though not compulsory as this course is designed in such a way that everything is covered. You just need a desktop PC and an Internet connection. 

This course is for Software engineers, programming enthusiasts, people related to IT, engineers, students pursuing computer science can take this course. 

This course is very helpful and given here to the point which is very comprehensive and concise. You will also get real-life examples. 

Enroll now, and enjoy the course!

Course Content

Total: 43 lectures
  • Introduction to Spark
  • Install Java and Git
  • Set up Spark project with IntelliJ IDEA
  • Run our first Apache Spark job
  • Trouble Shooting: Run our first Apache Spark job
  • RDD Basics in Apache Spark
  • Create RDDs
  • Map and Filter Transformation in Apache Spark
  • Solution to Airports by Latitude Problem
  • FlatMap Transformation in Apache Spark
  • Set Operation in Apache Spark
  • Solution for the Same Hosts Problem
  • Actions in Apache Spark
  • Solution to Sum of Numbers Problem
  • Important Aspects about RDD
  • Summary of RDD Operations in Apache Spark
  • Caching and Persistence in Apache Spark
  • Spark Architecture
  • Spark Components
  • Introduction to Pair RDD in Spark
  • Create Pair RDDs in Spark
  • Filter and MapValue Transformations on Pair RDD
  • Reduce By Key Aggregation in Apache Spark
  • Sample solution for the Average House problem
  • GroupBy Key Transformation in Spark
  • SortBy Key Transformation in Spark
  • Sample Solution for the Sorted Word Count Problem
  • Data Partitioning in Apache Spark
  • Join Operations in Spark
  • Accumulators
  • Solution to StackOverflow Survey Follow-up Problem
  • Broadcast Variables
  • Introduction to Apache Spark SQL
  • Spark SQL in Action
  • Spark SQL practice: House Price Problem
  • Spark SQL Joins
  • Strongly Typed Dataset
  • Use Dataset or RDD
  • Dataset and RDD Conversion
  • Performance Tuning of Spark SQL
  • Introduction to Running Spark in a Cluster
  • Package Spark Application and Use spark-submit
  • Run Spark Application on Amazon EMR (Elastic MapReduce) cluster

Reviews

Please login or register to review