Sequel for NoSQL

NoSQL leader Apache Cassandra gets big update in Cassandra v1.1

Chris Mayer

Already gaining heavyweight admirers for its linear scalability, the Apache Foundation releases the next iteration of its NoSQL powerhouse database

Since Apache Cassandra 1.0 emerged in October, the Big Data tectonic plates have shifted somewhat. There’s been a lot of traction across the board from enterprises – some choosing Apache Hadoop, some Cassandra or even MongoDB as their selection for embracing a highly scalable and distributed solution. Whether they need it to distribute data quickly or analyse the incoming data, there’s a choice for all.

NoSQL leading light Cassandra has certainly been one to watch with some heavyweights picking that solution for their needs. The enterprises putting their faith in Cassandra cover a broad spectrum such as Adobe, Cisco, Digg, Expedia, Formspring, IBM, Netflix, Rackspace, Reddit, Spotify, Twitter and even the US Government. The first major version was a signal that Cassandra was ready to lead the Big Data charge. The largest Cassandra production cluster so far exceeds 300 terabytes of data over 400 machines which definitely supports that claim.

Now just six months on, the Apache Foundation released the next iteration of Apache Cassandra after testing the water. The highly-scalable, distributed database which can handle thousands of requests across several hundred clusters, has received plaudits for its efficiency and flexibility as it can be used in the Cloud or partially on-premise in a hybrid data store.

Jonathan Ellis, Vice President of Apache Cassandra proclaimed the first steps in 1.0 were a ‘huge milestone’ for Cassandra, but is looking to the future now. He speaks highly of the next version saying “v1.1 improves on that foundation with many features and enhancements that developers and administrators have been asking for.”

His statement holds water. Aside from improved caching, Apache Cassandra v1.1 features a revised query language (CQL –Cassandra Query Language– a subset of SQL), storage control, schema/structure, Hadoop integration/output , data directory control, and scalability. The linkup with fellow Apache Big Data stablemate Hadoop is hardly surprising and seeing the two in tandem could indeed make a Big Data goliath.

This release signals Cassandra’s intentions for the long-run as it enters a stage of maturity. We can’t wait to see what’s next – can it get even better? Probably.

Apache Cassandra software is released under the Apache License v2.0, and is overseen by a Project Management Committee (PMC) that guides the Project’s day-to-day operations, community development, and product releases. Apache Cassandra source code, downloads, documentation, mailing lists, and related resources are available at
Inline Feedbacks
View all comments