big data – IT Market Strategy

December 2017 Tracker – Where’s Hadoop?

The leading 2017 story of Hadoop distributions is that nobody seems to want to be accused of being in the business of providing them. Some former champions are expanding their shiny new positioning: Cloudera is selling Enterprise Data Hubs and Analytic DBs; Hortonworks offers DataPlanes and Next-Gen Data Platforms; MapR touts the Converged Data Platform. In the cloud world, Amazon’s EMR is at least designed to “run andContinue reading “December 2017 Tracker – Where’s Hadoop?”

Strata Standards Stories: Different Stores For Different Chores

Has HDFS joined MapReduce in the emerging “legacy Hadoop project” category, continuing the swap-out of components that formerly answered the question “what is Hadoop?” Stores for data were certainly a focus at Strata/Hadoop World in NY, O’Reilly’s well-run, well-attended, and always impactful fall event. The limitations of HDFS, including its append-only nature, have become inconvenient enough toContinue reading “Strata Standards Stories: Different Stores For Different Chores”

Hadoop Projects Supported By Only One Distribution

The Apache Software Foundation has succeeded admirably in becoming a place where new software ideas are developed: today over 350 projects are underway. The challenges for the Hadoop user are twofold: trying to decide which projects might be useful in big data-related cases, and determining which are supported by commercial distributors. In Now, What is Hadoop? And What’s Supported? I list 10 supportedContinue reading “Hadoop Projects Supported By Only One Distribution”

Hadoop Summit Recap Part Two – SELECT FROM hdfs WHERE bigdatavendor USING SQL

Probably the most widespread, and commercially imminent, theme at the Summit was “SQL on Hadoop.” Since last year, many offerings have been touted, debated, and some have even shipped. In this post, I offer a brief look at where things stood at the Summit and how we got there. To net it out: offerings todayContinue reading “Hadoop Summit Recap Part Two – SELECT FROM hdfs WHERE bigdatavendor USING SQL”

Diary of an Asian Swing: Day 5

Second day of Singapore meetings. APJ market conversations with IT vendors, watching the emergence of big data here. Seeing activity in the field is always a fascinating counterpoint to the briefings and conferences back home. But the big data phenomenon is surprisingly rapid. Certainly the user conversations have been similar in some ways to thoseContinue reading “Diary of an Asian Swing: Day 5”

Apache Hadoop 1.0 Doesn’t Clear Up Trunks and Branches Questions. Do Distributions?

In early January 2012, the world of big data was treated to an interesting series of product releases, press announcements, and blog posts about Hadoop versions. To begin with, we had the announcement of Apache version 1.0 at long last, in a press release. Although there were grumblings here and there in the twittersphere thatContinue reading “Apache Hadoop 1.0 Doesn’t Clear Up Trunks and Branches Questions. Do Distributions?”

Cloudera Convenes Colleagues to Crunch Content (Make Mine Membase)

Over the past two years, Cloudera has demonstrated the power of surrounding emerging open source software with support services, expertise and its own IP. The firm has racked up over 30 customers since its founding in late 2008, and emerged as the leading source of Apache Hadoop. Cloudera’s recent C round of financing brought itsContinue reading “Cloudera Convenes Colleagues to Crunch Content (Make Mine Membase)”

IBM Showcases Software Vision and Hadoop Research

At IBM’s 8th annual Connect meeting with analysts, Steve Mills, Senior VP and Group Executive, had much to crow about. Software is the engine driving IBM’s profitability, anchoring its customer relationships, and enabling the vaulting ambition to drive the company’s Smarter Planet theme into the boardroom. Mills’ assets are formidable: 36 labs worldwide have more than 100Continue reading “IBM Showcases Software Vision and Hadoop Research”