b/training4all edited 2 years ago by Foreverloving

Big Data Analytics with Hadoop and Apache Spark

This post was published 2 years ago. Download links are most likely obsolete. If that's the case, try asking the uploader to re-upload.

Big Data Analytics with Hadoop and Apache Spark

MP4 | Video: AVC, 1280x720 30 fps | Audio: AAC, 48 KHz, 2 Ch | Duration: 1h 1m
Skill Level: Intermediate | Genre: eLearning | Language: English + Subtitles | Size: 143 MB

Apache Hadoop was a pioneer in the world of big data technologies, and it continues to be a leader in enterprise big data storage. Apache Spark is the top big data processing engine and provides an impressive array of features and capabilities. When used together, the Hadoop Distributed File System (HDFS) and Spark can provide a truly scalable big data analytics setup. In this course, learn how to leverage these two technologies to build scalable and optimized data analytics pipelines. Instructor Kumaran Ponnambalam explores ways to optimize data modeling and storage on HDFS; discusses scalable data ingestion and extraction using Spark; and provides tips for optimizing data processing in Spark. Plus, he provides a use case project that allows you to practice your new techniques.

Topics include

Data modeling for analytics
Best practices for HDFS data storage
Ingesting and extracting data with Spark
Effectively managing partitions
Improving the join process
Best practices for optimizing Spark processing

Homepage

Screenshots

Big Data Analytics with Hadoop and Apache Spark