This course gives an overview of Big Data, i.e., storage, retrieval, and processing of big data.

It also focuses on the “technologies”, i.e., the tools/algorithms that are available for the storage, and processing of Big Data.

It helps to gain an in-depth understanding and practical experience of Apache Spark and the Spark Ecosystem, which covers Spark RDD, Spark SQL, Spark MLlib, and Spark Streaming.

It helps a student perform various “analytics” on different data sets and arrive at positive conclusions.

Syllabus

Unit1: Big Data Analytics introduction

Introduction to Big data: Types of data -Evolution of big data -Definition of big data - Characteristics of big data- challenges with big data-Introduction to Big data analytics-Technologies help to meet the challenges posed by big data. Big data technology landscape: Introduction to Hadoop – Hadoop architecture- Hadoop distributed file systems – Processing data with Hadoop -Hadoop Ecosystems.

Unit2: PySpark

Introduction to Spark- Why Spark with python- Spark core concepts-Spark core components-Spark architecture- How Spark works? -Environment setup- Spark RDD- Programming with RDDs: Create RDD- RDD Common transformation and actions-Key-Value pairs-RDD vs Data Frame - Aggregate and Group By operations- Filters- Joins- Programming with Data Frame-Data preprocessing methods- Data Exploration-Data Manipulation-Machine learning using Spark- Data analysis use cases with real-world applications.

NOTE:

Python is a prerequisite for this course.

Reviews (14)

4 out of 5 14 reviews

Subathra https://p.urbanpro.com/tv-prod/member/photo/11137958-small.jpg Kovaipudur

4.90514

Subathra

Atharv

Reviewed on 08 Jan, 2024

BTech Tuition BTech Branch:BTech Computer Science Engineering BTech Computer Science subjects:Data Structures and Algorithms,Java Programming,Object Oriented Programming & Systems

"Great experience, helped me in clearing my exams, and also taught in easier manner, was helpful for me. "

Subathra

Chaitanya Swami

Verified Student

Reviewed on 13 Dec, 2023

BTech Tuition

"Good teacher. She explains all concepts in a systematic manner and clears all my doubts. Her PPTs and notes are very helpful "

Subathra

Sumit Patil

Verified Student

Reviewed on 16 Nov, 2023

BTech Tuition

I loved

Audio/Video Quality

Class Content

Teaching Method

Teacher's Knowledge

Subathra

Anish Kn

Reviewed on 15 Nov, 2023

BTech Tuition

"Subathra ma'am has been taking introduction to data science classes for me. The sessions have been very interactive and i am learning a lot. "

View All

Have you attended any class with Vivek K?

About the Trainer

Reviews (14)

Vivek K