Eagle eye on Big Data

Talend eyes up Big Data and NoSQL advances in Open Studio 5.2

Chris Mayer
talend-2

Eclipse Strategic Developer Member and JCP member Talend yesterday unveiled the latest version of their data integration hub Open Studio, honing in on helping enterprises understand their data woes

Eclipse Strategic Developer Member and JCP member Talend yesterday unveiled the latest
version of their data integration hub Open Studio, honing in on
helping enterprises understand their data woes.

Like so many other vendors at the moment, Talend appear keen on
keeping their software solution open to as many avenues as
possible. Talend Open Studio 5.2 bolsters the offering somewhat
with new capabilities such as Hadoop data profiling and support to
a plethora of popular NoSQL databases.

One common complaint in the world of Big Data is the inability to
obtain value from amassed data. Storage isn’t really the problem
but working out what to do next definitely is. Talend even go
further saying “data quality issues are endemic in most
organizations.”

“Data profiling is also an emerging, but critical, big data best
practice. Without the ability to assess big data and its relevance
to their needs, organizations are likely to accumulate data that is
at best irrelevant or at worst detrimental to their use of Hadoop
and other big data platforms,” they say in a press release.

To solve that, they’ve brought in Hadoop data profiling allowing
users to get out of “neutral” to analyse data in Hive databases on
Hadoop, scale up to additional servers for volume and crucially,
produce custom graphical reports of this information.

Elsewhere, a number of NoSQL database connectors have been ushered
into the fold for Open Studio 5.2. Connectors for MongoDB,
Cassandra and HBase have arrived for Talend Platform for Big Data
and Talend Open Studio for Big Data. There’s also native connectors
for Hadoop Distributed File System (HDFS), HCatalog, Hive, Oozie,
Pig and Sqoop.

“Talend version 5.2 delivers on our vision of simplifying the
development, integration and management of big data so that
businesses can focus on using that data to make faster and more
informed decisions,” said Fabrice Bonan, co-founder and chief
technical officer at Talend.

“We provide the most powerful and versatile open source, big data
solution to help organizations load, extract and improve disparate
data while leveraging the massively parallel processing power of
big data technologies including Apache Hadoop and leading NoSQL
databases.”

Talend’s collection of 5.2 commercial products are set to arrive
before the end of 2012, but you can grab the community
version
right now.

Author
Comments
comments powered by Disqus