In February 2012, Gartner published How to Choose The Right Apache Hadoop Distribution (available to clients). At the time, the leading distributors were Cloudera, EMC (now Pivotal), Hortonworks (pre-GA), IBM, and MapR. These players all supported six Apache projects: HDFS, MapReduce, Pig, Hive, HBase, and Zookeeper. Things have changed. –more–
Tag Archives: Hadoop
Hadoop is in the Mind of the Beholder
This post was jointly authored by Merv Adrian (@merv) and Nick Heudecker (@nheudecker) and appears on both of our Gartner blogs. In the early days of Hadoop (versions up through 1.x), the project consisted of two primary components: HDFS and MapReduce. One thing to store the data in an append-only file model, distributed across anContinue reading “Hadoop is in the Mind of the Beholder”
Data Security for Hadoop – Add-on Choices Proliferating
In my post about the BYOH market last October, I noted that increasing numbers of existing players are connecting their offerings to Apache Hadoop, even as upstarts enter their markets with a singular focus. And last month, I pointed out that Nick Heudecker and I detected a surprising lack of concern about security in a recent Hadoop webinar. Clearly,Continue reading “Data Security for Hadoop – Add-on Choices Proliferating”
Aspirational Marketing and Enterprise Data Hubs
In the Hadoop community there is a great deal of talk of late about its positioning as an Enterprise Data Hub. My description of this is “aspirational marketing;” it addresses the ambition its advocates have for how Hadoop will be used, when it realizes the vision of capabilities currently in early development. There’s nothing wrongContinue reading “Aspirational Marketing and Enterprise Data Hubs”
AAA is Not Enough Security in the Big Data Era
Talk to security folks, especially network ones, and AAA will likely come up. It stands for authentication, authorization and accounting (sometimes audit). There are even protocols such as Radius (Remote Authentication Dial In User Service, much evolved from its first uses) and Diameter, its significantly expanded (and punnily named) newer cousin, implemented in commercial andContinue reading “AAA is Not Enough Security in the Big Data Era”
BYOH – Hadoop’s a Platform. Get Used To It.
When is a technology offering a platform? Arguably, when people build products assuming it will be there. Or extend their existing products to support it, or add versions designed to run on it. Hadoop is there. The age of Bring Your Own Hadoop (BYOH) is clearly upon us. Specific support for components such as PigContinue reading “BYOH – Hadoop’s a Platform. Get Used To It.”
What, Exactly, Is “Proprietary Hadoop”? Proposed: “distribution-specific.”
Many things have changed in the software industry in an era when the use of open source software has pervaded the mainstream IT shop. One of them is the significance – and descriptive adequacy – of the word “proprietary.” Merriam-Webster defines it as “something that is used, produced, or marketed under exclusive legal right of theContinue reading “What, Exactly, Is “Proprietary Hadoop”? Proposed: “distribution-specific.””
Hadoop Summit Recap Part Two – SELECT FROM hdfs WHERE bigdatavendor USING SQL
Probably the most widespread, and commercially imminent, theme at the Summit was “SQL on Hadoop.” Since last year, many offerings have been touted, debated, and some have even shipped. In this post, I offer a brief look at where things stood at the Summit and how we got there. To net it out: offerings todayContinue reading “Hadoop Summit Recap Part Two – SELECT FROM hdfs WHERE bigdatavendor USING SQL”
That Exciting New Stuff? Yeah… Wait Till It Ships.
A brief rant here: I am asked with great frequency how this RDBMS will hold off that big data play, how data warehouses will survive in a world where Hadoop exists, or whether Apple is done now that Android is doing well. There is a fundamental fallacy implicit in these questions. –more–
Hadoop Summit Recap Part One – A Ripping YARN
I had the privilege of keynoting this year’s Hadoop Summit, so I may be a bit prejudiced when I say the event confirmed my assertion that we have arrived at a turning point in Hadoop’s maturation. The large number of attendees (2500, a big increase – and more “suits”) and sponsors (70, also a significant uptick) madeContinue reading “Hadoop Summit Recap Part One – A Ripping YARN”