It’s touted as the industry’s only open-source enterprise grad unified stream and batch processing platform. Apache Apex community manager Desmond Chan show’s us what exactly that means and how this open-source engine handles big data.
After a preview version had been published at the end of November 2015, the final version of Apache Spark 1.6 is at long last ready for download. The update contains a total of over 1,000 changes; release highlights include a variety of performance improvements, the new Dataset API and expanded data science functions.
VMTurbo founders Yechiam Yemini and Yuri Rabover, as well as Principal Solutions Engineer Eric Wright have braved a look into the future and identified a few trends for the upcoming year.
If you search Google Scholar for “machine learning”, it returns over 1,800,000 publications. As the buzz around this technology grows, so too does its complexity. Sebastian Raschka, author of Packt’s “Python Machine Learning”, introduces us to the three types of machine learning.
At the SAP TechEd in Barcelona, SAP brought its new technology down to developer level, showcasing the latest in SAP Hana, such as the Hana Cloud Platform’s usage of Cloud Foundry, while calling on IT to innovate and build a ‘digital core’, rather than just integrate.
It’s been 10 years since Big Data made the rounds for the first time as a mainstream concept and many questions are still unanswered. Emmanuel Letouzé, in his W-JAX Keynote, looks at the relationship between data, ethics, politics and human rights.
Do you think with the mind of a young developer? How do you keep re-educating yourself? What is Flux? And why should programmers be wary of copying patterns? Five IT wisdoms from day #1 at the W-JAX 2015.
With a plethora of logging tools available at a range of price points, it might be hard to decide on what to use. Rather than diving into a tonne of research, we’ve done it for you – a host of popular approaches to data processing have been fact checked and outlined for your convenience.
Every employee and every end user should have the right to find answers using data analytics. But the current reliance on IT for key information is creating an unnecessary bottleneck, says DataHero’s Chris Neumann.
Nowhere else are business decisions as hype-oriented as in IT. And while NoSQL is all well and good, MySQL is often the sensible choice in terms of operational cost and scalability, says JAX London speaker Aviran Mordo.
Considering a change in your architecture? If you’re looking at Apache Spark, it might be worth seeing what Alex Zhitnitsky has to say about the top 5 things you should consider before the jump. Software architecture is hard.
Data crunchers can rejoice at the sight of Spark 1.4 – support for R, Python 3 plus a load of clustering and container management improvements all make their way to the top of the highlights reel for this cluster computing framework.
A total of approximately 480 JIRA tickets is what it takes to update Apache Hive to Version 1.2. The data warehouse software for Apache Hadoop has already reached its third release of the year, with the Hive community continuing its growth.
Although Facebook famously ditched Cassandra to use HBase for its messenger service, the NoSQL database remains largely overlooked. Ubeeko CEO Ghislain Mazars takes a look under the hood of HBase features.