Hadoop Summit Recap Part Two – SELECT FROM hdfs WHERE bigdatavendor USING SQL

Probably the most widespread, and commercially imminent, theme at the Summit was “SQL on Hadoop.” Since last year, many offerings have been touted, debated, and some have even shipped. In this post, I offer a brief look at where things stood at the Summit and how we got there. To net it out: offerings today range from the not-even-submitted to GA – if you’re interested, a bit of familiarity will help. Even more useful: patience.

–more–

IBM STG Trip Report: Hardware-Software Synergy Yielding Dividends

Every year in the fourth quarter, IBM assembles its Systems & Technology Group (STG – the hardware guys) executives for discussions with the analyst community to review results and discuss the year ahead. STG’s Senior VP Rod Adkins teed up this year’s meeting with a reminder that STG and Software Group (SWG) both now report to Steve Mills, SVP and Group Executive – Software & Systems. This change naturally suggests the possibilities for increased synergies between the two parts of IBM, and although much collaboration has been in place over the years, IBM’s attention to leveraging the opportunity has clearly come into sharper focus. The interaction was a recurrent theme. Read more of this post

IBM’s IOD Showcases DB2, Informix, InfoSphere. Now, About Marketing….

It was hard to decide where to look first in Las Vegas this year at IBM’s flagship information management event. Coming as it did on the heels of a massive, sprawling Oracle Open World, it was also overwhelming, but distinguished itself immediately by its focus. Whereas Oracle has smashed together hardware systems, apps, middleware, java and development, systems management and database into a bewildering multi-site show, IBM continues to run separate events for Websphere, Rational, Tivoli, and Lotus. No single IBM event trumpets “we’re the biggest,” and they don’t take over the towns they’re in; the content seems a bit more manageable. And as an attendee who hopes to get a broad view, I’m happy with that. However, as I’ll discuss below, Oracle is winning the messaging war nonetheless.

There was indeed talk of systems at IoD this year, as Smart Analytics Systems got a refresh and some added units on x-based platforms. Flash memory additions to the x-based 5600, bundling InfoSphere and Cognos along with an updated Linux release, provide the basis for a good story along with more cores, memory and storage. A similar story is possible for the POWER-based 7700, which also added the new Blue Darter solid state disk (SSD.) And the z audience gets the 9600, with its sidecar, the transparent offload to the Smart Analytics Optimizer. Yes, IBM has a column-based database, with innovative storage tweaks and an optimizer that knows when to use it and when not to. Great promise there.

So what’s wrong with this picture? Try this: ask 10 IT people what Exadata is, and what Smart Analytics Systems are. Ask them who makes the offerings, and what they do. Go ahead…I’ll wait….

Back? OK. Here’s what I learned, after doing that experiment at 3 events attended by IT people (data people, in fact.) 8 of 10 I asked knew Oracle makes Exadata and it’s a wicked fast platform for data. 4 of 10 knew who makes the other one, and fewer knew why. On visibility and buzz, game Oracle.

There is much more to talk about, and visibility and buzz are not everything. IBM’s numbers continue to be good, and nobody in Armonk is complaining. But the IBM Software brand needs to get more attention, more investment, and a tighter, more focused story. The good news? Conversations I’ve been having suggest that it will in 2011, and it’s about time. Read more of this post

Database Benchmarks – The Gift That Keeps on Giving

Yes, I know – not everyone believes database benchmarks are useful. My position is that there is value in benchmarks’ role in helping engineers wring out bottlenecks, bugs and performance impediments in their products. Berni Schiefer, Technical Executive , Information Management Performance and Benchmarks for DB2, MDM and SolidDB, recently told me that “every time we run [TPC-C] we are astonished at how effectively it hammers every element of the system. We always find bugs, room for tuning. It’s the nastiest, most punishing combination there is.” Read more of this post

More TDWI Notes – ParAccel Rolling On, HP Stalled, Vertica Leading Insurgents

On my second day at TDWI, I was in meetings all day – events like this are a great opportunity for analysts to catch up with many of the companies they follow at one time, and this particular one was packed with sponsors. Congrats to the folks who sell sponsorships – they had a packed exhibit hall, and a lot of very interested attendees. I got a chance to chat at a few booths (all buzzing), ask a few attendees some real-world questions (and was asked some surprising ones myself), and get a sense of the workload in the trenches (heavy and growing.)

Read more of this post

EMC Buys Greenplum – Big Data Realignment Continues

EMC’s acquisition of Greenplum, announced today as a cash transaction, reaffirms the obvious: the Big Data tsunami upends conventional wisdom. It has already reshaped the market, spawning the most ferment in the RDBMS (and non-R DBMS via the noSQL players) space in years. When I first posted on Greenplum over a year ago, I said that

Open source + capital has created an intriguing new model of rapid innovation in “mature” markets, and the database space - like BI – is not a done deal. It is indeed possible to escape the gravity well, if you execute. Greenplum is getting it done, and is among the new stars to watch.”

Why the open source reference? Greenplum uses a parallelization layer atop PostgreSQL (like Aster, another of the new breed of ADBMS.)

Now EMC has written the next chapter in that story. In the process, it adds a new piece (after literally dozens of others in the past few years) to its own portfolio, which already includes unstructured data (via Documentum) and virtualization (via VMWare), layered in among the industry-leading storage and information management pieces. Disruptive? You bet. Is EMC finished? I doubt it. Candidates? BI tools, ETL, MDM, data integration come to mind. Losers? At least one big one. Read on. Read more of this post

The Mainframe Skills Shortage Urban Myth

Contributed by Joe Clabby (www.clabbyanalytics.com)

In this CounterOpinion, we challenge the advice and opinions of the Gartner Group, a well-known and highly respected IT research and analysis firm, on the subject of IBM System z (mainframe) migration. In March, 2007, Gartner put forward research suggesting that, due to the aging of the current generation of System z managers, mainframe customers might someday find themselves short of the skilled labor they need to manage their systems.  As a result, Gartner recommended that organizations might want to consider moving from mainframes to other, “more modern” platforms. (This opinion is still available on the company’s web site — search for the “Impact of Generational IT Skill Shift on Legacy Applications” [document ID number: G00146492]). Since that first report, Gartner has published other reports suggesting that enterprises reevaluate their application portfolios — and move various applications to (again) “more modern” platforms. (Note: Gartner never quite articulates which more modern platforms it is talking about…).

Clabby Analytics has two major issues with Gartner’s perspective and advice:

  1. Where is the proof that there has been/will be a major decline in mainframe skills that should cause IT executives to abandon their mainframe platforms? And,
  2. Which “more modern” platform(s) does Gartner have in mind? Read more of this post

Oracle Exadata: Early Signs Promising

Exadata is looking good. In the past few months, I’ve had the chance to talk to several early adopters of Oracle Exadata V2, some in connection with a sponsored white paper Oracle has just published. It’s still early, but I see this product as a milestone, regardless of its commercial success. That is still to be determined, although I wouldn’t bet against it. How it will be affected by Oracle’s execution of the Sun acquisition is another open question, and the recent surprise layoffs, which showed that either the announced expectations were laughably off base or Ellison’s early announcements about  hiring plans were less than candid, don’t bode too well for the near term. Rob Enderle made some strong and provocative points in his guest post here. Read more of this post

Attunity – An Independent Alternative For Data Replication

Attunity (ATTUF), a small OTC-traded company out of Massachusetts, is quietly building up its base, expanding a 1000-customer foothold in real-time change data capture (CDC) and data replication that has made it one of the few remaining independent players standing. With Oracle’s acquisition of GoldenGate and SAP’s announced plan to acquire Sybase, many firms are thinking about having an alternative supplier. Attunity’s competitors these days include iWay and Progress DataDirect - few firms can offer robust support for data sources like RMS, VSAM, NonStop SQL, Enscribe and Adabas as well as common RDBMSs like DB2, SQL Server and Oracle, and that leaves Attunity a relatively wide-open opportunity. Attunity recently announced a 53% year-over-year growth in license revenues; it’s profitable (although GAAP profitability, while in sight, has yet to be achieved) and beginning to repay its debt. With less than $2M in revenues, it may well find itself an acquisition target, to boot.Attunity logo

Read more of this post

IBM Gets Feisty — Mobilizes Analytics for Oracle Battle

In July 2009, IBM announced the Smart Analytics System 7600, a workload-optimized, pre-integrated bundle of hardware and software targeted at the business analytics market. Included in that package are an IBM POWER 550 running AIX, storage, plus InfoSphere Warehouse Enterprise Edition (which consists of DB2, Warehouse design and management tools + Cubing, Data Mining and Text Analytics services), and Cognos 8 Business Intelligence, configured and tuned, and “health check” features. Accommodations are made if the customer already has licensed some of the software and wants to use it on the platform; in this sense, the software is described as “optional.” This month, IBM broadened the story and upped the ante, making Smart Analytics System a key weapon in its widening battle with Oracle.

This post is a slightly updated version of a piece that appeared in the PUND-IT newsletter. Read more of this post

Follow

Get every new post delivered to your Inbox.

Join 134 other followers