Hadoop Distributions And Kids’ Soccer

The big players are moving in for a piece of the big data action.  IBM, EMC, and NetApp have stepped up their messaging, in part to prevent startup upstarts like Cloudera from cornering the Apache Hadoop distribution market. They are all elbowing one another to get closest to “pure Apache” while still “adding value.” Numerous other startups have emerged, with greater or lesser reliance on, and extensions or substitutions for, the core Apache distribution. Yahoo! has found a funding partner and spun its team out, forming a new firm called Hortonworks, whose claim to fame begins with an impressive roster responsible for most of the code in the core Hadoop projects. Think of the Doctor Seuss children’s book featuring that famous elephant, and you’ll understand the name.

While we’re talking about kids – ever watch young kids play soccer? Everyone surrounds the ball. It takes years to learn their position on the field and play accordingly. There are emerging alphas, a few stragglers on the sidelines hoping for a chance to play, community participants – and a clear need for governance. Tech markets can be like that, and with 1600 attendees packing late June’s Hadoop Summit event, all of those scenarios were playing out. Leaders, new entrants, and the big silents, like the absent Oracle and Microsoft.

more

Will AEP Replace RDBMS? A Dialogue With Charles Brett

Analytic Event Processing (AEP) is hot. But does it mean RDBMS begins to decline in importance? Charles Brett of C3B Consulting and I recently had a quick dialogue about it and came up with different conclusions. That conversation is reproduced here. It’s only the beginning – l hope you will weigh in with your thoughts. Read more of this post

What’s An Eigenbase?

The open source community is remarkable in many ways. For me, one of the most significant aspects of it is exactly that: it IS a community. It’s composed of people who communicate and share in deep and productive ways. One of the most interesting manifestations of that spirit I’ve run across is the Eigenbase project, an extensible platform being used by some very creative folks for the creation and continuing development of databases for data warehousing (the LucidDB DBMS) and stream processing (the SQLstream continuous query engine). I haven’t posted about either of those yet but will, and I’m watching their continuing evolution with great interest.

Read more of this post

InfoSphere Streams Is A Game Changer

IBM has made it clear that InfoSphere Streams, the commercialized part of the System S research project that has been underway for some years, is a priority, and they are committing substantial investments to it.  In fact, the release was hurried a bit, as I noted in my (hopefully) humorous post about naming for complex event processing (CEP) and related technologies. At a major financial analyst meeting in May 2009, CEO Sam Palmisano called it out as an IBM opportunity, and Software Group honcho Steve Mills listed it as one of four themes within his topline Information Agenda message. That kind of push makes things happen. Read more of this post

CEP: The Tech That Dare Not Speak Its Name

IBM has decided to go after the complex event processing (CEP) market with InfoSphere Streams a bit sooner than expected (by about a year, it seems.) The Product Formerly Known As System S (apologies to Prince – but at least it doesn’t have a weird symbol instead of a name) has been released into the wild because it’s ready enough. Or because Sam Palmisano wanted to talk about it at a recent analyst event. Or (Curt Monash theorizes here) because Microsoft announced that their CEP offering will be in SQL Server Real Soon Now.  Read more of this post