On-Demand: Big Data Analytics with Cassandra & Spark
Special Presentation From DataStax and KPI Partners.
Learn how to combine real-time data collection with deep analytical insight through Cassandra and Spark.
The joining of Spark and Cassandra provides a powerful combination of real-time data collection with analytics. DataStax and KPI Partners conducted this online event that explores how to combine these technologies for advanced analytics and best practices for implementation.
What Is Cassandra?
Apache Cassandra is the leading distributed database in use at thousands of sites with the world's most demanding scalability and availability requirements. Cassandra's bread and butter is being able to serve up millions of concurrent transactions (reads, writes, updates) while providing zero downtime and linear scalability. The world is not just transactional data, however, and there is a need to analyze the data captured and served in this online transactional system.
What Is Spark?
Apache Spark is a distributed data analytics computing framework that has gained a lot of traction in processing large amounts of data in an efficient and user-friendly manner. It comes with a suite of tools from bulk analytics, SQL support, machine learning, graph analytics, and streaming capabilities. All it needs is data to process.