August 2020 Hadoop Apache Project Tracker

Welcome to my co-author, Gartner analyst Sanjeev Mohan It’s been an eventful 6 months since Merv published the last of these trackers. The Hadoop ecosystem is far from dead, as many pundits predicted. Cloudera Data Platform (CDP) has begun to ship in bare metal, public cloud and private cloud versions. MapR is now HP EzmeralContinue reading “August 2020 Hadoop Apache Project Tracker”

December 2017 Tracker – Where’s Hadoop?

The leading 2017 story of Hadoop distributions is that nobody seems to want to be accused of being in the business of providing them. Some former champions are expanding their shiny new positioning: Cloudera is selling Enterprise Data Hubs and Analytic DBs; Hortonworks offers DataPlanes and Next-Gen Data Platforms; MapR touts the Converged Data Platform. In the cloud world, Amazon’s EMR is at least designed to “run andContinue reading “December 2017 Tracker – Where’s Hadoop?”

Hadoop Commercial Support Component Tracker – March 2017

Stack expansion has ground to a halt. The last time an Apache project was added to the list of those most supported by leading Hadoop distribution vendors was July 2016, when Kafka joined the other 14 then commonly included. Since then, no broad support for new projects has emerged. The only project that does seem successfulContinue reading “Hadoop Commercial Support Component Tracker – March 2017”

Hadoop Project Commercial Support Tracker July 2016

There are now 15 projects supported by all 5 distributors I track, and several have had new releases since April. Kafka is the newest addition, and I believe the remaining 4-supporter offerings, Mahout and Hue, will remain unsupported by IBM, who has its own alternatives. –More–

Hadoop Apache Project Commercial Support Tracker April 2016

There are now 19 commonly supported projects: Avro, Flume and Solr join the group supported by all 5 distributors and other changes appear as well. For this version of the tracker (last updated in December), I’ve made one sizable change: Pivotal has been dropped as a “leading distributor,” dropping the number to five. Pivotal relies on Hortonworks’ distro (asContinue reading “Hadoop Apache Project Commercial Support Tracker April 2016”

Strata Standards Stories: Different Stores For Different Chores

Has HDFS joined MapReduce in the emerging “legacy Hadoop project” category, continuing the swap-out of components that formerly answered the question “what is Hadoop?” Stores for data were certainly a focus at Strata/Hadoop World in NY, O’Reilly’s well-run, well-attended, and always impactful fall event. The limitations of HDFS, including its append-only nature, have become inconvenient enough toContinue reading “Strata Standards Stories: Different Stores For Different Chores”

Now, What is Hadoop?

This perennial question resurfaced recently in a thoughtful blog post by Andreas Neumann, Chief Architect of Cask, called What is Hadoop, anyway?. Ultimately, after a careful deconstruction of the terms in the question, Andreas concludes with “Does it really matter to agree on the answer to that question? In the end, everybody who builds an application or solutionContinue reading “Now, What is Hadoop?”

Perspectives on Hadoop Part Two: Pausing Plans

By Merv Adrian and Nick Heudecker  In the first post in this series , I looked at the size of revenue streams for RDBMS software and maintenance/support and noted that they amount to $33B, pointing out that pure play Hadoop vendors had a high hill to climb. (I didn’t say so specifically, but in 2014, Gartner estimates thatContinue reading “Perspectives on Hadoop Part Two: Pausing Plans”

Hadoop Questions from Recent Webinar Span Spectrum

This is a joint post authored with Nick Heudecker There were many questions asked after the last quarterly Hadoop webinar, and Nick and I have picked a few that were asked several times to respond to here. –More on my Gartner blog—

Which SQL on Hadoop? Poll Still Says “Whatever” But DBMS Providers Gain

Since Nick Heudecker and I began our quarterly Hadoop webinars, we have asked our audiences what they expected to do about SQL several times, first in January 2014. With 164 respondents in that survey, 32% said “we’ll use what our existing BI tool provider gives us,” reflecting the fact that most adopters seem not to wantContinue reading “Which SQL on Hadoop? Poll Still Says “Whatever” But DBMS Providers Gain”