The new Flink stable release is here and makes streaming and batch data processing super smooth. The 1.5.0 release comes stuffed with a number of new features and major improvements.
The GDPR provides for the privacy of EU citizens regardless of where in the world their data is stored and used, so today’s CDO will need to have the ability to manage data collection, storage, and usage accordingly. Maggie Kimberl explains how this regulation complicates data collection.
Couchbase and the dynamics of the NoSQL market: “There is life beyond analytical and relational databases”
2017 has been an interesting year in the NoSQL space. The big data market is full of opportunities but there are also a lot of changes happening right now. We talked with Perry Krug, Principal Architect at Couchbase about the future of the NoSQL space, the market dynamics and what’s next for Couchbase.
Hadoop is back! The latest version [3.0.0] of the Open Source software framework for reliable, scalable, distributed computing brings a lot of new features, including an early preview (alpha 2) of a major revision of YARN Timeline Service, shell script rewrite and more.
What does the future of tech look like? We asked four experts about their predictions for the next year. Expect to hear lots about DevOps, artificial intelligence, cloud technologies, security, and more.
Instaclustr, an open source as-a-service company, delivering reliability at scale wants to improve and maintain Apache Cassandra. Their intent is to build a team of committers that are actively involved and can provide some real operational experience to the community. We talked with Ben Bromhead, CTO and Co-Founder at Instaclustr about all this and more.
Our first ML Conference will debut in December in Berlin. Until then, we’d like to give you a taste of what’s to come. We talked with, Markus Ehrenmüller-Jensen, Business Intelligence Architect at Runtastic about how the company involves machine learning into their daily business, the benefits, the battle scars and everything in between. Also, you’ll get a sneak peek at his talk.
Internet of Things(IoT) and big data are closely intertwined and although they are not the same thing, it is very hard to talk about one without the other. Before we analyze their connection, let us take a much closer look at these two practices.
Lightbend recently revealed the findings of a new survey of 2,457 global developers. We talked with Mark Brewer, the CEO of Lightbend about the key findings, differences between fast data and big data, misconceptions and more.
Java’s not going anywhere, no doubt about that. But why are people choosing to use Java? And what sort of role will it play in the future development of Big Data and IoT? In this article, Jane Reyes explores the relationship between this old favorite of a programming language and the newest tech in the field.
The idea of collecting and analyzing data to gather insights isn’t really new. However, the specific roles involved in the collection and analysis of data have grown and evolved considerably over the last decade as the amount of data being created has increased at a staggering rate. In this article, Cher Zavala explains why data engineers are so important.
Containers revolutionize the way modern software is being developed and operated. We talked to Johannes Unterstein, Distributed Applications Engineer at Mesosphere and JAX DevOps speaker about container tools and technologies and containers’ usefulness in a DevOps context.
It’s been one year since Yahoo open-sourced CaffeOnSpark so the tech giant has found a way to celebrate it — by open-sourcing TensorFlowOnSpark, its latest open source framework for distributed deep learning on big data clusters.
Apache Beam has successfully graduated from incubation, becoming a new Top-Level Project at the Apache Software Foundation. We invited the Apache Software Foundation’s Davor Bonaci and Jean-Baptiste Onofré to talk about the project’s journey to becoming a Top-Level Project and concrete plans for its future.