Hadoop Distributions And Kids’ Soccer

The big players are moving in for a piece of the big data action.  IBM, EMC, and NetApp have stepped up their messaging, in part to prevent startup upstarts like Cloudera from cornering the Apache Hadoop distribution market. They are all elbowing one another to get closest to “pure Apache” while still “adding value.” Numerous other startups have emerged, with greater or lesser reliance on, and extensions or substitutions for, the core Apache distribution. Yahoo! has found a funding partner and spun its team out, forming a new firm called Hortonworks, whose claim to fame begins with an impressive roster responsible for most of the code in the core Hadoop projects. Think of the Doctor Seuss children’s book featuring that famous elephant, and you’ll understand the name.

While we’re talking about kids – ever watch young kids play soccer? Everyone surrounds the ball. It takes years to learn their position on the field and play accordingly. There are emerging alphas, a few stragglers on the sidelines hoping for a chance to play, community participants – and a clear need for governance. Tech markets can be like that, and with 1600 attendees packing late June’s Hadoop Summit event, all of those scenarios were playing out. Leaders, new entrants, and the big silents, like the absent Oracle and Microsoft.

more

Want Broader BI Usage? Crystal Reports Founders Offer Indicee

Mark Cunningham has reunited some of the team that built Crystal Reports (now part of SAP Business Objects) and launched Indicee, a SaaS-based BI reporting play that is pointed squarely at the continuing difficulty of extending BI beyond its seemingly permanent minority usage model.

It’s commonly understood that users continue to fend for themselves manually, moving data to spreadsheets for analytic manipulation because IT is unable to respond quickly enough to their needs. Indicee tackles this by re-using existing report and spreadsheet content (not surprisingly, Crystal reports lead the source list), moving it to the cloud for data mart-based interaction, and innovating a different approach to user interaction. It’s worth a look, and a free download for trial use sweetens the deal. Read more of this post

Pentaho Goes “Open Core” With Lucidera OLAP Viewer

Open-source BI vendor Pentaho has purchased technology rights from failed BI SaaS vendor LucidEra, and plans to combine LucidEra’s Clearview, a reporting and analysis OLAP front end for non-technical users, with the Mondrian open source OLAP engine used by Pentaho Analysis,  in a new offering called Pentaho Analyzer Enterprise Edition, available both on-premise and on-demand. Clearview will not be available in the free community edition of Pentaho. Existing Pentaho Analysis Enterprise Edition and Pentaho BI Suite Enterprise Edition customers will not be charged additional fees. Clearview adds substantial value to the priced portion of Pentaho’s portfolio – another example of the “open core” business model. Open core is not without its detractors, and a brief flurry of chatter erupted about it in the blogosphere. Read more of this post

What’s An Eigenbase?

The open source community is remarkable in many ways. For me, one of the most significant aspects of it is exactly that: it IS a community. It’s composed of people who communicate and share in deep and productive ways. One of the most interesting manifestations of that spirit I’ve run across is the Eigenbase project, an extensible platform being used by some very creative folks for the creation and continuing development of databases for data warehousing (the LucidDB DBMS) and stream processing (the SQLstream continuous query engine). I haven’t posted about either of those yet but will, and I’m watching their continuing evolution with great interest.

Read more of this post