An introduction to the big data ecosystem, covering concepts of HDFS, MapReduce, and a deeper look at Apache Spark for large-scale data processing.