Apache Spark

Home / Apache Spark

Apache Spark & Scala

Best Apache Spark and Scala Training institute in Chennai  are used to develop Spark Applications operating Scala programming and that helps user to become a Spark developer. Apache Spark can operate data from a different type of data repositories, such as includs the Hadoop Distributed File System (HDFS), NoSQL databases and relational data stores, such as Apache Hive. Spark Streaming is a one of spark module that starts stream processing of live data streamer. All data can also be taken from varies sources as such Kinesis, Twitter, TCPsockets, Kafka and also including WebSockets. Then the active data can be pushed out of the pipleline to file systems, databases and dashboards. Here stream data may proceed with high level functions like map, join or reduce.

APACHE SPARK:
1.INTRODUCTION TO SPARK

2.SPARK INSTALLATION DEMO

3.OVERVIEW OF SPARK ON A CLUSTER

4.SPARK STANDALONE CLUSTER

5.SPARK RDD

6.TRANSFORMATIONS IC RDD

7.ACTIONS IN RDD

8.PERSISTENCE IN RDD

9.LOADING DATA IN RDD
10.SAVING DATA THROUGH RDD
11.KEY-VALUE PAIR RDD

12.MAP REDUCE AND PAIR RDD OPERATIONS
13.SCALA AND HADOOP INTEGRATION

14.SPARK SQL

15.DATA FRAME CONCEPT

16.SQL CONTEXT WITH EXAMPLE – JSON