Apache Spark is an open-source distributed general-purpose cluster-computing framework. Spark is a powerful open-source unified analytics engine built around speed, ease of use, and streaming analytics distributed by Apache. Apache Spark is a unified analytics engine for big data processing, with built-in modules for streaming, SQL, machine learning and graph processing. Apache Spark is an open-source, distributed processing system used for big data workloads. Fast, flexible, and developer-friendly, Apache Spark is the leading platform for large-scale SQL, batch processing, stream processing, and. Mike Olson, Chief Strategy Officer and Co-Founder at Cloudera, provides an overview of Apache.