 | BIG DATA ANALYTICS WITH APACHE HADOOP AND SPARK: A guide to using Hadoop and Spark frameworks for big data analytics and real-time processing (2025) by Sloane, Renata |
 | Ultimate Big Data Analytics with Apache Hadoop: Master Big Data Analytics with Apache Hadoop Using Apache Spark, Hive, and Python (English Edition) (2024) by Simhadri Govindappa |
 | Data Engineering with Databricks Cookbook: Build effective data and AI solutions using Apache Spark, Databricks, and Delta Lake (2024) by Chadha, Pulkit |
 | Mastering Apache Hadoop: A Comprehensive Guide to Learn Apache Hadoop (2023) by Ltd, Cybellium, Hermans, Kris |
 | Data Algorithms with Spark: Recipes and Design Patterns for Scaling Up using PySpark (2022) by Parsian, Mahmoud |
 | Learning Spark: Lightning-Fast Data Analytics (2020) by Damji, Jules S., Wenig, Brooke, Das, Tathagata, Lee, Denny |
 | Stream Processing with Apache Spark: Mastering Structured Streaming and Spark Streaming (2019) by Maas, Gerard, Garillot, Francois |
 | Mastering Hadoop 3: Big data processing at scale to unlock unique business insights (2019) by Singh, Chanchal, Kumar, Manish |
 | Spark: The Definitive Guide: Big Data Processing Made Simple (2018) by Chambers, Bill, Zaharia, Matei |
 | Big Data from Scratch: Building a 4-nodes Hadoop cluster and use of the Map-Reduce Simple Skyline Algorithm (MR-SSA) based on the R Language (Big Data & MR-SSA) (2017) by Leliopoulos, Panagiotis |
 | Apache Spark 2.x Machine Learning Cookbook: Over 100 recipes to simplify machine learning model implementations with Spark (2017) by Amirghodsi, Siamak, Rajendran, Meenakshi, Hall, Broderick, Mei, Shuen |
 | Moving Hadoop to the Cloud: Harnessing Cloud Features and Flexibility for Hadoop Clusters (2017) by Havanki, Bill |
 | Advanced Analytics with Spark: Patterns for Learning from Data at Scale (2017) by Ryza, Sandy, Laserson, Uri, Owen, Sean, Wills, Josh |
 | Mastering Apache Spark 2.x - Second Edition: Scale your machine learning and deep learning systems with SparkML, DeepLearning4j and H2O (2017) by Kienzler, Romeo |
 | Apache Spark 2.x for Java Developers: Explore big data at scale using Apache Spark 2.x Java APIs (2017) by Gulati, Sourav, Kumar, Sumit |
 | Moving Hadoop to the Cloud: Harnessing Cloud Features and Flexibility for Hadoop Clusters (2017) by Havanki, Bill |
 | High Performance Spark: Best Practices for Scaling and Optimizing Apache Spark (2017) by Karau, Holden, Warren, Rachel |
 | Hadoop in 24 Hours, Sams Teach Yourself (2017) by Aven, Jeffrey |
 | Learning Apache Spark 2 (2017) by Abbasi, Muhammad Asif |
 | Top 50 Apache Spark Interview Questions & Answers (2017) by Powerhouse, Knowledge |
 | 99 Apache Spark Interview Questions for Professionals: A GUIDE TO PREPARE FOR APACHE SPARK INTERVIEW QUESTIONS (2017) by Kumar, Yogesh, Kumar, Hitesh |
 | Top 50 Apache Hadoop Interview Questions and Answers (2016) by Powerhouse, Knowledge |
 | Pro Hadoop Data Analytics: Designing and Building Big Data Systems using the Hadoop Ecosystem (2016) by Koitzsch, Kerry |
 | Programming Pig: Dataflow Scripting with Hadoop (2016) by Gates, Alan, Dai, Daniel |
 | Apache Spark for Data Science Cookbook (2016) by Chitturi, Padma Priya |
 | Practical Data Science with Hadoop and Spark: Designing and Building Effective Analytics at Scale (Addison-wesley Data & Analytics) (2016) by Mendelevitch, Ofer, Stella, Casey, Eadline, Douglas |
 | Expert Hadoop Administration: Managing, Tuning, and Securing Spark, YARN, and HDFS (Addison-Wesley Data & Analytics Series) (2016) by Alapati, Sam |
 | Spark in Action (2016) by Zecevic, Petar, Bonaci, Marko |
 | Apache Spark 2 for Beginners (2016) by Thottuvaikkatumana, Rajanarayanan |
 | Practical Hadoop Ecosystem: A Definitive Guide to Hadoop-Related Frameworks and Tools (2016) by Vohra, Deepak |
 | Big Data SMACK: A Guide to Apache Spark, Mesos, Akka, Cassandra, and Kafka (2016) by Estrada, Raul, Ruiz, Isaac |
 | Apache Spark in 24 Hours, Sams Teach Yourself (2016) by Aven, Jeffrey |
 | Apache Spark Interview Question & Answers (2016) by Goel, Naman |
 | Spark GraphX in Action (2016) by Malak, Michael, East, Robin |
 | Spark Tutorials with Scala: The Beginner's Guide (2016) by McGrath, Todd |
 | Pro Spark Streaming: The Zen of Real-Time Analytics Using Apache Spark (2016) by Nabi, Zubair |
 | Apache Spark Machine Learning Blueprints (2016) by Liu, Alex |
 | Professional Hadoop (2016) by Antony, Benoy, Boudnik, Konstantin, Adams, Cheryl, Shao, Branky, Lee, Cazen, Sasaki, Kai |
 | Spark: Big Data Cluster Computing in Production (2016) by Ganelin, Ilya, Orhian, Ema, Sasaki, Kai, York, Brennon |
 | Learning Spark: Analytics With Spark Framework (2016) by Moore, Joseph |
 | Apache Spark Scala Interview Questions: Shyam Mallesh (2016) by Mallesh, Shyam |
 | Kick Start Hadoop: Apache Pig: Getting started with Data Science on Hadoop (2015) by Meir-Huber, Mario |
 | Big Data Analytics with Spark: A Practitioner's Guide to Using Spark for Large Scale Data Analysis (2015) by Guller, Mohammed |
 | Hadoop 2 Quick-Start Guide: Learn the Essentials of Big Data Computing in the Apache Hadoop 2 Ecosystem (Addison-wesley Data & Analytics Series) (2015) by Eadline, Douglas |
 | Mastering Apache Spark: Gain expertise in processing and storing data by using advanced techniques with Apache Spark (2015) by Frampton, Mike |
 | Apache Spark Graph Processing (2015) by Ramamonjison, Rindra |
 | Data Algorithms: Recipes for Scaling Up with Hadoop and Spark (2015) by Parsian, Mahmoud |
 | Hadoop Security: Protecting Your Big Data Platform (2015) by Spivey, Ben, Echeverria, Joey |
 | Spark Cookbook: Over 60 recipes on Spark, covering Spark Core, Spark SQL, Spark Streaming, MLlib, and GraphX libraries (2015) by Yadav, Rishi |
 | Apache Oozie: The Workflow Scheduler for Hadoop (2015) by Islam, Mohammad Kamrul, Srinivasan, Aravind |