EMC Buys Greenplum – Big Data Realignment Continues

EMC’s acquisition of Greenplum, announced today as a cash transaction, reaffirms the obvious: the Big Data tsunami upends conventional wisdom. It has already reshaped the market, spawning the most ferment in the RDBMS (and non-R DBMS via the noSQL players) space in years. When I first posted on Greenplum over a year ago, I said that

Open source + capital has created an intriguing new model of rapid innovation in “mature” markets, and the database space – like BI – is not a done deal. It is indeed possible to escape the gravity well, if you execute. Greenplum is getting it done, and is among the new stars to watch.”

Why the open source reference? Greenplum uses a parallelization layer atop PostgreSQL (like Aster, another of the new breed of ADBMS.)

Now EMC has written the next chapter in that story. In the process, it adds a new piece (after literally dozens of others in the past few years) to its own portfolio, which already includes unstructured data (via Documentum) and virtualization (via VMWare), layered in among the industry-leading storage and information management pieces. Disruptive? You bet. Is EMC finished? I doubt it. Candidates? BI tools, ETL, MDM, data integration come to mind. Losers? At least one big one. Read on. Read more of this post

IBM Gets Feisty — Mobilizes Analytics for Oracle Battle

In July 2009, IBM announced the Smart Analytics System 7600, a workload-optimized, pre-integrated bundle of hardware and software targeted at the business analytics market. Included in that package are an IBM POWER 550 running AIX, storage, plus InfoSphere Warehouse Enterprise Edition (which consists of DB2, Warehouse design and management tools + Cubing, Data Mining and Text Analytics services), and Cognos 8 Business Intelligence, configured and tuned, and “health check” features. Accommodations are made if the customer already has licensed some of the software and wants to use it on the platform; in this sense, the software is described as “optional.” This month, IBM broadened the story and upped the ante, making Smart Analytics System a key weapon in its widening battle with Oracle.

This post is a slightly updated version of a piece that appeared in the PUND-IT newsletter. Read more of this post

New TPC-H Record – Virtualized by ParAccel, VMware

You can set performance records in a virtualized environment – that’s the message of the new 1 Tb TPC-H benchmark record (scroll down to see the 1Tb results) just released by ParAccel and VMware. Running on VMware’s vSphere 4, the ParAccel Analytic Database (PADB) delivered a one-two punch: not only the top performance number for a 1 terabyte (TB) benchmark, but the top price-performance number as well. The results in a nutshell: 1,316,882 Composite Queries per Hour (QphH), a price/performance of 70 cents/QphH, and a data load rate of over 3.5 TBs per hour. ParAccel moved quickly to promote the result; oddly, VMware seems to have been asleep at the switch, with no promotion on its site as the release hit the wires, and a bland quote from a partner exec in the release itself.

Read more of this post

And Then There Were Three: POWER, x86 and z

by Joe Clabby, President, Clabby Analytics. Updated from a November 2009 publication

There is a major shakeout underway in the midrange/high-end server marketplace as sales of Sun SPARC/CMT (cellular multi-threading) and Hewlett-Packard (HP) Itanium-based servers decline significantly — and as new, more powerful versions of Intel’s Xeon and IBM’s POWER micro-architectures come to market. Read more of this post