search

#Big Data

They share a closely knitted future

Understanding the relationship between IoT and Big Data

Internet of Things(IoT) and big data are closely intertwined and although they are not the same thing, it is very hard to talk about one without the other. Before we analyze their connection, let us take a much closer look at these two practices.

Starring: Java!

What role will Java play in the future of Big Data and IoT?

Java’s not going anywhere, no doubt about that. But why are people choosing to use Java? And what sort of role will it play in the future development of Big Data and IoT? In this article, Jane Reyes explores the relationship between this old favorite of a programming language and the newest tech in the field.

Still the “sexiest job of the 21t century”

Understanding data engineering and what it means for the future

The idea of collecting and analyzing data to gather insights isn’t really new. However, the specific roles involved in the collection and analysis of data have grown and evolved considerably over the last decade as the amount of data being created has increased at a staggering rate. In this article, Cher Zavala explains why data engineers are so important.

SMACK — Next generation Big Data

Big Data becomes Fast Data

Big Data is changing. Buzzwords such as Hadoop, Storm, Pig and Hive are not the darlings of the industry anymore —they are being replaced by a powerful duo: Fast Data and SMACK. Such a fast change in such a (relatively) young ecosystem begs the following question: What is wrong with the current approach? What is the difference between Fast and Big Data? And what is SMACK?

“If you can cache everything in a very efficient way, you can often change the game”

Netflix OSS: Change the game with Hollow

Netflix Hollow is a Java library and comprehensive toolset for harnessing small to moderately sized in-memory datasets which are disseminated from a single producer to many consumers for read-only access. It is built with servers busily serving requests at or near maximum capacity in mind and its aim is to address the scaling challenges of in-memory datasets. Let’s see the advantages that come from using Netflix Hollow.

For a good cause

IBM joins R Consortium, aims to make analytics easier

As a (new) member of the R Consortium, IBM will work side by side with the R user community and support the project’s mission to pinpoint, create and implement infrastructure projects that drive standards and best practices for R code.

Open-source streaming analytics

Big Data with Apache Apex

It’s touted as the industry’s only open-source enterprise grad unified stream and batch processing platform. Apache Apex community manager Desmond Chan show’s us what exactly that means and how this open-source engine handles big data.

Data analysis tool in a new version

Apache Spark 1.6 with Dataset API

After a preview version had been published at the end of November 2015, the final version of Apache Spark 1.6 is at long last ready for download. The update contains a total of over 1,000 changes; release highlights include a variety of performance improvements, the new Dataset API and expanded data science functions.