January 10, 2017

 

Notes on Apache Spark

Based on Datastax intro.

1. Distributed computation engine (aim on low latency)
2. Could be used both batch-mode or interactive
3. In-memory
4. Faster than Hadoop
5. Fault-tolerance out-of-the box

Labels: , , ,

Comments: Post a Comment



<< Home

This page is powered by Blogger. Isn't yours?