Hadoop Summit Recap Part Two – SELECT FROM hdfs WHERE bigdatavendor USING SQL

Probably the most widespread, and commercially imminent, theme at the Summit was “SQL on Hadoop.” Since last year, many offerings have been touted, debated, and some have even shipped. In this post, I offer a brief look at where things stood at the Summit and how we got there. To net it out: offerings today range from the not-even-submitted to GA – if you’re interested, a bit of familiarity will help. Even more useful: patience.


Hadoop 2013 – Part Four: Players

The first three posts in this series talked about performance projects and platforms as key themes in what is beginning to feel like a  watershed year for Hadoop. All three are reflected in the surprising emergence of a number of new players on the scene, as well as some new offerings from additional ones, which I’ll cover in another post. Intel, WANdisco, and Data Delivery Networks recently entered the distribution game, making it clear that capitalizing on potential differentiators (real or perceived)  in a hot market is still a powerful magnet. And in a space where much of the IP in the stack is open source, why not go for it? These introductions could all fall into the performance theme as well – they are all driven by innovations intended to improve Hadoop speed.

– more – 

Hadoop 2013 – Part Three: Platforms

In the first two posts in this series, I talked about performance and projects as key themes in Hadoop’s watershed year. As it moves squarely into the mainstream, organizations making their first move to experiment will have to make a choice of platform. And – arguably for the first time in the early mainstreaming of an information technology wave – that choice is about more than who made the box where the software will run, and the spinning metal platters the bits will be stored on.There are three options, and choosing among them will have dramatically different implications on the budget, on the available capabilities, and on the fortunes of some vendors seeking to carve out a place in the IT landscape with their offerings.

– more –

Hadoop 2013 – Part One: Performance

It’s no surprise that we’ve been treated to many year-end lists and predictions for Hadoop (and everything else IT) in 2013. I’ve never been that much of a fan of those exercises, but I’ve been asked so much lately that I’ve succumbed. Herewith, the first of a series of posts on what I see as the 4 Ps of Hsdoop in the year ahead: performance, projects, platforms and players.

– more –

IBM STG Trip Report: Hardware-Software Synergy Yielding Dividends

Every year in the fourth quarter, IBM assembles its Systems & Technology Group (STG – the hardware guys) executives for discussions with the analyst community to review results and discuss the year ahead. STG’s Senior VP Rod Adkins teed up this year’s meeting with a reminder that STG and Software Group (SWG) both now report to Steve Mills, SVP and Group Executive – Software & Systems. This change naturally suggests the possibilities for increased synergies between the two parts of IBM, and although much collaboration has been in place over the years, IBM’s attention to leveraging the opportunity has clearly come into sharper focus. The interaction was a recurrent theme. Read more of this post

Microsoft Leaps Late, Lags with SQL Server PDW

Microsoft chose a user group meeting, Professional Association for SQL Server (PASS), for the rollout of its long-awaited, and late, SQL Server 2008 R2 Parallel Data Warehouse (note, yet again, how foolish it is for vendors to trap themselves with dates in product names.) PDW is late to market; there are other MPP DBMS players there already, and Microsoft is behind in functionality compared to some of them. Some of the most eagerly–awaited features are evidently not slated for the first release. It’s also far behind its originally planned ship date following the acquisition of DatAllegro in 2008. Read more of this post

At Oracle, Closed May be the New Open. Whither MySQL?

I hope I can be forgiven the cute headline. It speaks to a series of events that were heard in Oracle Open World messaging, where the word “open” appeared much less frequently than in years past. Oracle is fortifying its borders, opening new fronts in its market battles, and slowly closing itself off from some former partners and community relationships. It’s Fortress Oracle time. Its overall posture has hardened, and the implications for any but the largest MySQL customers are worrisome.

Many actions support this interpretation. The “fork you” message to Red Hat at OOW was an obvious indicator, tightening the OS play that accompanies the hardware ownership now rounding out Oracle’s full-stack story. Now, a few weeks later, Oracle’s move to drop low-end MySQL support, abandoning/conceding low-end customers to others, seems indicative both of Oracle’s willingness to move away from “open,” and to minimize investment in low-end customers. Mark Hurd is the new owner of support, and his reputation for cost-cutting should not be ignored in considering this; moreover, Windows is the majority platform for MySQL, and Oracle doesn’t want to invest there either. Read more of this post

IBM Acquires Netezza – ADBMS Consolidation Heats Up

IBM’s bid to acquire Netezza makes it official; the insurgents are at the gates. A pioneering and leading ADBMS player, Netezza is in play for approximately $1.7 billion or 6 times revenues [edited 9/30; previously said "earnings," which is incorrect.] When it entered the market in 2001, it catalyzed an economic and architectural shift with an appliance form factor at a dramatically different price point. Titans like Teradata and Oracle (and yes, IBM) found themselves outmaneuvered as Netezza mounted a steadily improving business, adding dozens of new names every quarter, continuing to validate its market positioning as a dedicated analytic appliance. It’s no longer alone there; some analytic appliance play is now in the portfolio of most sizable vendors serious about the market. Read more of this post

Attunity Scores a Win With RMS CDC Support

Today’s email brought a reminder of an old, valued data format: RMS. When I posted about Attunity earlier this year, I noted the value of its replication and changed data capture (CDC) technology as the major software infrastructure vendors continue to look at ways to consolidate the management of their customer’s data assets. Attunity is in the rare position of having its software OEMd by many of them somewhere in their portfolios; IBM, Oracle, and Microsoft [edit - removed Sybase, listed due to my error] all use and sometimes resell Attunity’s technology. RMS is a more recent addition to Attunity’s CDC portfolio, and its win at Southeastern Freight Lines bodes well for a new addition to its revenue stream. Read more of this post

More TDWI Notes – ParAccel Rolling On, HP Stalled, Vertica Leading Insurgents

On my second day at TDWI, I was in meetings all day – events like this are a great opportunity for analysts to catch up with many of the companies they follow at one time, and this particular one was packed with sponsors. Congrats to the folks who sell sponsorships – they had a packed exhibit hall, and a lot of very interested attendees. I got a chance to chat at a few booths (all buzzing), ask a few attendees some real-world questions (and was asked some surprising ones myself), and get a sense of the workload in the trenches (heavy and growing.)

Read more of this post


