NEXPERTS ACADEMY 

GET THE RIGHT CERTIFICATION

Live Online Training 
Classroom Training 

Online Training 

Learning is now made easy with the advancement of technology. In Nexperts Academy, we engage and collaborate with industry experts and committed trainers to provide quality training with best user experience.

Learn from anywhere with our most flexible training hours

BIG DATA ANALYTICS

COURSE OUTLINE

APACHE SPARK & SCALA

What is Spark
Comparison with Hadoop
Components of Spark 

Apache Spark- Introduction, Consistency, Availability, Partition
Unified Stack Spark
Spark Components
Comparison with Hadoop – Scalding example, mahout, storm, graph 

Explain python example
Show installing a spark
Explain driver program
Explaining spark context with example
Define weakly typed variable
Combine scala and java seamlessly
Explain concurrency and distribution
Explain what is trait
Explain higher order function with example
Define OFI scheduler
Advantages of Spark
Example of Lamda using spark
Explain Mapreduce with example 

Difference between RISC and CISC
Define Apache Mesos
Cartesian product between two RDD
Define count
Define Filter
Define Fold
Define API Operations
Define Factors

How hadoop cluster is different from spark
Define writing data
Explain sequence file and its usefulness
Define protocol buffers
Define text file, CSV, Object Files and File System
Define sparse metrics
Explain RDD and Compression
Explain data stores and its usefulness

Define Elastic Search
Explain Streaming and its usefulness
Apache bookeeper
Define Dstream
Define mapreduce word count
Explain Paraquet
Scala ORM
Define Mlib
Explain multi graphix and its usefulness
Define property graph 

Persistence
Motivation
Example
Transformation
Scala and Python
Examples – K-means
Latent Dirichlet Allocation (LDA)

Motivation
Broadcast Variables
Example: Join
Alternative if one table is small
Better version with broadcast
How to create a Broadcast
Accumulators motivation
Accumulator Rules
Custom accumulators
Another common use
Creating an accumulator using spark context object 

Spark SQL main capabilities
Spark SQL usage diagram
Spark SQL
Important topics in Spark SQL- Data frames
Twitter language analysis 

Advantages of Scala
REPL (Read Evaluate print loop)
Language Features
Type Interface
Higher order function
Option
Pattern Matching
Collection
Currying
Traits
Application Space 

Uses of scala interpreter
Example of static object timer in scala
Testing of String equality in scala
Implicit classes in scala with examples
Recursion in scala
Currying in scala with examples
Classes in scala 

Constructor
Constructor overloading
Prop erties
Abstract classes
Type hierarchy in Scala
Object equality
Val and var methods  

Sealed traits
Case classes
Constant pattern in case classes
Wild card pattern
Variable pattern
Constructor pattern
Tuple pattern 

Java equivalents
Advantages of traits
Avoiding boilerplate code
Linearization of traits
Modelling a real world example

How traits are implemented in scala and java
How extending multiple traits is handled

Classification of scala collections
Iterable
Iterator and iterable
List sequence example in scala 

Array in scala
List in scala
Difference between list and list buffer
Array buffer
Queue in scala
Dequeue in scala
Mutable queue in scala
Stacks in scala
Sets and maps in scala
Tuples

Different import types
Selective imports
Testing-Assertions
Scala test case- scala test fun. Suite
Junit test in scala
Interface for Junit via Junit 3 suite in scala test
SBT
Directory structure for packaging scala application