Last month’s update was obsolete before it published. This often happens because of multiple moving parts and my extended gestation period. I needed to correct entries for both AWS and Hortonworks. The new Tracker is correct as far as I know as of January 2, 2018. Enjoy. –more–
Tag Archives: Apache
December 2017 Tracker – Where’s Hadoop?
The leading 2017 story of Hadoop distributions is that nobody seems to want to be accused of being in the business of providing them. Some former champions are expanding their shiny new positioning: Cloudera is selling Enterprise Data Hubs and Analytic DBs; Hortonworks offers DataPlanes and Next-Gen Data Platforms; MapR touts the Converged Data Platform. In the cloud world, Amazon’s EMR is at least designed to “run andContinue reading “December 2017 Tracker – Where’s Hadoop?”
Strata Standards Stories: Different Stores For Different Chores
Has HDFS joined MapReduce in the emerging “legacy Hadoop project” category, continuing the swap-out of components that formerly answered the question “what is Hadoop?” Stores for data were certainly a focus at Strata/Hadoop World in NY, O’Reilly’s well-run, well-attended, and always impactful fall event. The limitations of HDFS, including its append-only nature, have become inconvenient enough toContinue reading “Strata Standards Stories: Different Stores For Different Chores”
Hadoop Projects Supported By Only One Distribution
The Apache Software Foundation has succeeded admirably in becoming a place where new software ideas are developed: today over 350 projects are underway. The challenges for the Hadoop user are twofold: trying to decide which projects might be useful in big data-related cases, and determining which are supported by commercial distributors. In Now, What is Hadoop? And What’s Supported? I list 10 supportedContinue reading “Hadoop Projects Supported By Only One Distribution”
Hadoop Summit Recap Part Two – SELECT FROM hdfs WHERE bigdatavendor USING SQL
Probably the most widespread, and commercially imminent, theme at the Summit was “SQL on Hadoop.” Since last year, many offerings have been touted, debated, and some have even shipped. In this post, I offer a brief look at where things stood at the Summit and how we got there. To net it out: offerings todayContinue reading “Hadoop Summit Recap Part Two – SELECT FROM hdfs WHERE bigdatavendor USING SQL”
Hadoop Distributions And Kids’ Soccer
The big players are moving in for a piece of the big data action. IBM, EMC, and NetApp have stepped up their messaging, in part to prevent startup upstarts like Cloudera from cornering the Apache Hadoop distribution market. They are all elbowing one another to get closest to “pure Apache” while still “adding value.” NumerousContinue reading “Hadoop Distributions And Kids’ Soccer”
At Oracle, Closed May be the New Open. Whither MySQL?
I hope I can be forgiven the cute headline. It speaks to a series of events that were heard in Oracle Open World messaging, where the word “open” appeared much less frequently than in years past. Oracle is fortifying its borders, opening new fronts in its market battles, and slowly closing itself off from someContinue reading “At Oracle, Closed May be the New Open. Whither MySQL?”