Course: 1 Understanding Fundamentals of Big Data
Duration: 1 Day
Level: Basic
Fundamentals of Big Data
- Understanding Big Data
- Big Data Drivers
- Big Data Use cases
- Understanding Big Data Dimensions
- Characteristics of Big Data
- Big Data Governance
Big Data tools
- Hadoop a brief Introduction
- File System (HDFS) A technical Introduction
- Hadoop Ecosystem
- Using Data Processing Tools: Pig, Hive,Sqoop and HBase
- YARN Architecture
Big Data Application Development
- Big Data patterns
- Application Development Framework
- Execution Model
- Comparing MapReduce and other Frameworks
Using NoSQL for Big Data management
- Understanding NoSQL
- Schema-less Model Benefits
- Data Modelling using Key/Value, Document Driven, Column Oriented and Graph
- NoSQL Product Comparison
Big data Analytics
- Understanding Concept of Analytics
- Building a POC to implement Big data lake
- Evaluating Platforms for Big Data Implementation
- Concept of Data Science and its driving force
Course: 2 Hadoop Fundamentals
Duration: 1 day
Level: Basic
Hadoop Fundamentals
- Big Data introduction
- Hadoop in Making: History
- Hadoop capabilities and Scenarios
- Building Data Lake using Hadoop
- Analytical and Operational Big Data Frameworks and Hadoop
Components of Hadoop
- Understanding Components of File System
- HDFS: Features and Benefits
- Cluster management (YARN and MapReduce)
- Building Hadoop Distribution using Open Source
- Installing and Configuring Hadoop for Big Data.
Data Ingestion in Hadoop
- Understanding Data in Motion and data at Rest
- Basic Copy Data
- Using Flume
- Sqoop
- Fundamentals of Kafka
Application Development in Hadoop
- Using java/Python to implement MapReduce
- Working with Pig and Pig Latin
- Hive and Impala
- Oozie
Critical Components of Hadoop
- Zookeeper
- HBase
- Lucene and Solr
- Avro
Course: 3 NoSQL Fundamentals
Duration: 1
Fundamentals of NoSQL
- NoSQL: why?
- Overview of NoSQL
- Characteristics of NoSQL
- NoSQL storage types
- Advantages and Drawbacks
- Comparing NoSQL Products
Understanding Storage Architecture
- Understanding Key/Value stores using Memcached and Redis
- Document Driven and Column oriented Architectures(MongoDB/Cassandra0
- Graph Oriented Database
- Performing CRUD operations
- Querying NoSQL Stores
Understanding NoSQL Principles
- Understanding CAP Theorem
- Understanding BASE and Comparing with ACID
- Concept of Scalability
- Indexing and Ordering DataSets(MongoDB)
NoSql Internals
- MongoDB
- Membase
- Apache Cassandra
- Berkeley DB
Using NoSQL in the Cloud
- Benefits of Cloud Architectures
- Working with Google App Engine: Workshop (2 Hours)
- Amazon SimpleDB:WorkShop(2 hours)