Accelerating Batch Processing with Apache Spark
Duration: 60 minutes
Track: BI Platform Architecture, Development & Administration
The Apache Spark framework can accelerate processing 100x or more on Hadoop from diverse data sources such as Cassandra, Hive, or HBase. We’re going to look at the architecture and go step by step through examples using R and Python against public data sources.