Strata Standards Stories: Different Stores For Different Chores

Has HDFS joined MapReduce in the emerging “legacy Hadoop project” category, continuing the swap-out of components that formerly answered the question “what is Hadoop?” Stores for data were certainly a focus at Strata/Hadoop World in NY, O’Reilly’s well-run, well-attended, and always impactful fall event. The limitations of HDFS, including its append-only nature, have become inconvenient enough to push the community to “invent” something DBMS vendors like Oracle did decades ago: a bypass. After some pre-event leaks about its arrival, Cloudera chose its Strata keynote to announce Kudu, a new columnstore written in C++, bypassing HDFS entirely. Kudu will use an Apache license and will be submitted to the Apache process at some undetermined future time.


Now, What is Hadoop?

This perennial question resurfaced recently in a thoughtful blog post by Andreas Neumann, Chief Architect of Cask, called What is Hadoop, anyway?. Ultimately, after a careful deconstruction of the terms in the question, Andreas concludes with

“Does it really matter to agree on the answer to that question? In the end, everybody who builds an application or solution on Hadoop must pick the technologies that are right for the use case.”

We’ve agreed from the beginning – that is the only answer that really matters. Still, the question continues to come up for  end users of the stack and for vendors like Cask (it helps them think about what to support in their application development offering Cask Data App Platform (CDAP).

Analysts too: I’ve discussed it several times, including a post a year ago called What Is Hadoop….Now? tracking the path from 6 commonly supported projects in 2012 to 15 in June 2014, across a set of distributors that included Cloudera, Hortonworks, MapR and IBM. “Support” here means you pay for subscription that explicitly includes the named project.

This year, the expansion process has continued – and it does matter.

–more on Gartner blog–



Perspectives on Hadoop Part Two: Pausing Plans

By Merv Adrian and Nick Heudecker 

In the first post in this series , I looked at the size of revenue streams for RDBMS software and maintenance/support and noted that they amount to $33B, pointing out that pure play Hadoop vendors had a high hill to climb. (I didn’t say so specifically, but in 2014, Gartner estimates that the three leading vendors generated less than $150M.)

In this post, Nick and I turn from Procurement to Plans and examine the buying intentions uncovered in Gartner surveys.


–more in Gartner blog–

Perspectives on Hadoop: Procurement, Plans, and Positioning

I have the privilege of working for the world’s leading information technology research and advisory company, covering information management with a strong focus for the past few years on an emerging software stack called Hadoop. In the early part of 2015, that particular technology is moving from early adopter status to early majority in its marketplace adoption. The discussions and published work around it have been exciting and controversial, so in this post (and a couple to follow) I describe three interlocking research perspectives on Hadoop: procurement (counting real money actually spent); plans (surveys of intentions to invest) and positioning (subjective interpretations of what the first two mean.)

Procurement Perspective: Hadoop is a (Very) Small Market Today

–more on Gartner blog–



Hadoop Questions from Recent Webinar Span Spectrum

This is a joint post authored with Nick Heudecker
There were many questions asked after the last quarterly Hadoop webinar, and Nick and I have picked a few that were asked several times to respond to here.

–More on my Gartner blog

Which SQL on Hadoop? Poll Still Says “Whatever” But DBMS Providers Gain

Since Nick Heudecker and I began our quarterly Hadoop webinars, we have asked our audiences what they expected to do about SQL several times, first in January 2014. With 164 respondents in that survey, 32% said “we’ll use what our existing BI tool provider gives us,” reflecting the fact that most adopters seem not to want to concern themselves overmuch with the details.

–More on my Gartner blog

Who Asked for an Open Data Platform?

This is a joint blog post between Nick Heudecker and Merv Adrian.

It’s Strata week here in San Jose, and with that comes a flood of new announcements on products, partners and funding. Today’s big announcement came in the form of the Open Data Platform (ODP). A number of companies have signed on, but in short, it’s got some Hadoopers, some service providers and systems integrators, as well as some analytics apps vendors.

–more on my Gartner blog

Hadoop Adoption? Moving, But Not Necessarily Forward

Gartner’s quarterly Hadoop webinar in February 2015 showed that adoption of Hadoop is not rising quite as dramatically as some might believe. It’s flat compared to Q42014. Of nearly 1200 attendees, 465 shared their thinking with us via the usual polling, and the Deployed percentage was the same. Not that surprising for only 3 months between polls. And Q1 is not a big month for most software, especially a category that is at best generating a few hundred million dollars in revenues.

–more on my Gartner blog

Prediction Is Hard – Especially About the Future

OK, I admit it – I stole the title from a much smarter man. I thought that man was Yogi Berra, but maybe not – more about that at the end of this post.

Every year, Gartner issues a series of Predicts documents. This year I had the pleasure of doing one for my team on Information Infrastructure Technology. Now, I’m a software guy, and the team I’m on is all software people, so a document assigned to our team would typically be about – well, information software technology. But that would have missed the point rather dramatically, so I connected with a few colleagues and got their OK to use some of their predictions in the small set any document can include.

— more on Gartner blog —

Hadoop Deployments – Slow to Grow So Far

How have Hadoop deployments grown this year? Slowly.

Here’s a little anecdata for you:

During 2014, my colleague Nick Heudecker and I conducted quarterly webinars on the State of Hadoop, and in the Q2, Q3 and Q4 sessions we asked our (steadily growing) audience about their deployments via online polls. These results should not be considered definitive (they’re unqualified – though attendees do have to jump through a hoop or two to attend, we don’t keep extensive firmographics, titles, etc.)

See the data on my Gartner blog here.


Get every new post delivered to your Inbox.

Join 21,684 other followers