IBM and Ahana – A Lakehouse Down Payment

IBM has taken another open source technology provider into its portfolio, acquiring Ahana, one of the leading vendors behind the massively parallel distributed in-memory SQL query engine Presto. Presto, first created and still actively developed at Meta, has attracted a broad array of open source contributors, and is championed by several vendors. It has alsoContinue reading “IBM and Ahana – A Lakehouse Down Payment”

August 2020 Hadoop Apache Project Tracker

Welcome to my co-author, Gartner analyst Sanjeev Mohan It’s been an eventful 6 months since Merv published the last of these trackers. The Hadoop ecosystem is far from dead, as many pundits predicted. Cloudera Data Platform (CDP) has begun to ship in bare metal, public cloud and private cloud versions. MapR is now HP EzmeralContinue reading “August 2020 Hadoop Apache Project Tracker”

Open – For Business – At the ASF

The Apache Software Foundation is about to celebrate an anniversary, and its extraordinary contribution to the economic refactoring of software stacks seems to be gaining more momentum with every passing year. After three Gartner Data and Analytics events on 3 continents with thousands of attendees in the past 4 weeks, I find myself more impressedContinue reading “Open – For Business – At the ASF”

January 2018 Hadoop Tracker

Last month’s update was obsolete before it published. This often happens because of multiple moving parts and my extended gestation period. I needed to correct entries for both AWS and Hortonworks. The new Tracker is correct as far as I know as of January 2, 2018. Enjoy. –more–

December 2017 Tracker – Where’s Hadoop?

The leading 2017 story of Hadoop distributions is that nobody seems to want to be accused of being in the business of providing them. Some former champions are expanding their shiny new positioning: Cloudera is selling Enterprise Data Hubs and Analytic DBs; Hortonworks offers DataPlanes and Next-Gen Data Platforms; MapR touts the Converged Data Platform. In the cloud world, Amazon’s EMR is at least designed to “run andContinue reading “December 2017 Tracker – Where’s Hadoop?”

Symposium Notes – Day Four Returns to Data Security, and to Hadoop

Thursday, the final day, reinforced a theme for the week: data security is heating up, and organizations are not ready. It came up in half of today’s final 10 meetings. “Is my data more secure, or less, in the cloud?” “Does using open source software for data management compromise how well I can protect it?”Continue reading “Symposium Notes – Day Four Returns to Data Security, and to Hadoop”

Symposium Notes – Day Three Features Data Assembly

With 24 meetings under my belt from the first two days at Orlando Symposium, Wednesday’s 13 (and a presentation) didn’t look quite as daunting. It began well, with enough time for a muffin and some tea at 730 AM in the analyst workroom near to the cubicle I’d spend the day in. Then I launched rightContinue reading “Symposium Notes – Day Three Features Data Assembly”

Symposium Notes – Day Two Jumps in the (Data) Lake

My second day of Symposium 1:1 meetings continued the “security of big data” theme (4 of the day’s 15 conversations – usually, but not always, about HDFS-based data), with a data lake flavor. The concerns were retroactive – often driven by an internal audit. “We built it, now how do we secure it?” is aContinue reading “Symposium Notes – Day Two Jumps in the (Data) Lake”

Symposium Notes – Day One Features Hadoop

Gartner Symposium is always exciting, challenging and stimulating for analysts; we get to interact with many organizations in a brief time during 1on1 meetings scheduled based on our coverage. It offers an fascinating snapshot of what is on people’s minds – enough so that they have traveled to a conference in part to have that discussion.Continue reading “Symposium Notes – Day One Features Hadoop”

Hadoop 2013 – Part Two: Projects

In Part One of this series, I pointed out that how significant attention is being lavished on performance in 2013. In this installment, the topic is projects, which are proliferating precipitously. One of my most frequent client inquiries is “which of these pieces make Hadoop?” As recently as a year ago, the question was pretty simple for most people:Continue reading “Hadoop 2013 – Part Two: Projects”