Guest Post: Leading the Logical Data Warehouse Charge Has its Challenges

From my colleague Mark Beyer, who speculates about how leadership in moving toward the logical data warehouse (LDW) will be received: 

The logical data warehouse is already creating a stir in the traditional data warehouse market space. Less than 5% of clients with implemented warehouses that we speak with are pursuing three or more of the six aspects of a logical warehouse: 

  • repositories
  • data virtualization
  • distributed processes
  • active auditing and optimization
  • service level negotiation
  • ontological and taxonomic metadata

That means we are in a very early stage regarding the adoption trend, and vendors who are aggressively moving toward it are ahead of their customers.

..more…

Apache Hadoop 1.0 Doesn’t Clear Up Trunks and Branches Questions. Do Distributions?

In early January 2012, the world of big data was treated to an interesting series of product releases, press announcements, and blog posts about Hadoop versions.  To begin with, we had the announcement of Apache version 1.0 at long last, in a press release. Although there were grumblings here and there in the twittersphere that changes to release numbers are meaningless, my discussions with Gartner’s enterprise customers indicate otherwise. Products with release numbers like 0.20.2 make the hair on Procurement’s neck stand on end, and as Hadoop begins to get mainstream attention (Gartner’s clients, see Hype Cycle for Data Management 2011), IT architects and executives find such optics quite important. Hadoop is moving beyond pioneers like Amazon, Yahoo! and LinkedIn into shops like JP Morgan Chase, and they pay attention to such things.

…more…

Mark Beyer, Father of the Logical Data Warehouse, Guest Post

Another guest post, this time from my colleague and friend Mark Beyer.

My name is Mark Beyer, and I am the “father of the logical data warehouse”. So, what does that mean? First,  if like any father, you are not willing to address your ancestry with full candor you will lose your place in the universe and wither away without making a meaningful contribution. As an implementer in the field, I was a student and practitioner of both Inmon and Kimball. I learned as much or more from my clients and my colleagues during multiple implementations as I did from studying any methodology. My Gartner colleagues challenged my concepts and helped hammer them into a comprehensive and complete concept. Simply put, I was willing to consider DNA contributions from anyone and anywhere, but through a form of unnatural selection, persisted in choosing to include the good genes and actively removing the undesirable elements.

more…

Beyond Moneyball: How Organizations Can Use Collaborative Decision Making to Succeed

 Having just seen the movie myself, I was delighted to receive this guest post from my colleague Rita Sallam, a Research Director here who focuses on Analytics, BI, and Performance Management. It’s a good read.

As demonstrated in movie “Moneyball”, starring Brad Pitt and opening in theaters today (http://www.moneyball-movie.com/), professional sports teams are increasingly using data mining and statistical analysis to find the players that best correlate to success.

 This approach has resulted in the displacement of many long-held, but less relevant, performance statistics and “gut feel” recruiting approaches. Many successful teams are building on – and supplementing – this fact-based approach to winning by using collaborative decision making (CDM) platforms that enable key team decision makers to assess, weight and optimize a combination of quantitative and qualitative measures used to select the best players at any one time to meet their specific team needs.

more…

Hadoop Distributions And Kids’ Soccer

The big players are moving in for a piece of the big data action.  IBM, EMC, and NetApp have stepped up their messaging, in part to prevent startup upstarts like Cloudera from cornering the Apache Hadoop distribution market. They are all elbowing one another to get closest to “pure Apache” while still “adding value.” Numerous other startups have emerged, with greater or lesser reliance on, and extensions or substitutions for, the core Apache distribution. Yahoo! has found a funding partner and spun its team out, forming a new firm called Hortonworks, whose claim to fame begins with an impressive roster responsible for most of the code in the core Hadoop projects. Think of the Doctor Seuss children’s book featuring that famous elephant, and you’ll understand the name.

While we’re talking about kids – ever watch young kids play soccer? Everyone surrounds the ball. It takes years to learn their position on the field and play accordingly. There are emerging alphas, a few stragglers on the sidelines hoping for a chance to play, community participants – and a clear need for governance. Tech markets can be like that, and with 1600 attendees packing late June’s Hadoop Summit event, all of those scenarios were playing out. Leaders, new entrants, and the big silents, like the absent Oracle and Microsoft.

more

IBM Fills Out Netezza Lineup With High Capacity Appliance

In the months since IBM closed its Netezza acquisition, the data warehouse appliance pioneer has been busy, if the announcements at this week’s Enzee are any indication. An enthusiastic crowd – 1000 strong – heard CEO Jim Baum deliver the news: new hardware, software and partnerships.The biggest news was The Appliance Formerly Known As Cruiser, now known as the Netezza High Capacity Appliance (HCA). A wag made up some t-shirts bearing the acronym TAFKAC and did quite well. IBM is aiming to push the size perception for Netezza higher. How high? Half a PB in a rack. You can scale it to 10PB.

more

First Gartner Post Up

Yes, I know it’s been a while. But I’ve had a lot to learn at Gartner. Systems, people, existing positions…I’ve now put up my first Gartner blog post, at http://blogs.gartner.com/merv-adrian/2011/03/23/learning-the-ropes

Feel free to go there and check it out. You don’t have to be a Gartner client. My next task will be figuring out what to do here, and what to do there – if they don’t turn out to be fundamentally the same thing…

Twitter Drafting – Marketing in the Tweetstream

Racing fans - cars or bicycles – are familiar with the concept of drafting – travelling close behind another vehicle to reduce wind resistance. The concept is sometimes applied to marketing by savvy practitioners who use the spend of others to multiply their own impact in public consciousness. In recent months, I’ve noticed a growing use of twitter by companies – including research firms – to exploit the new social channel this way.

It’s not new. Back in the days when Comdex was the biggest, most interesting show in IT, one could sometimes find the most intriguing offerings, especially from small firms with correspondingly small budgets, in hotel rooms nearby. Rather than paying the sizable fees required to participate officially, these nimble players bypassed the show entirely and found ways to reach attendees and draw them to parties, chili cook offs, etc. to get their message across.

The age of twitter, and especially hashtags, has created a new opportunity. If you don’t know what a hashtag is, they are words preceded by the # sign, a convention that allows an event – like #IBMPulse or #OracleOpenWorld – to sustain what is known as a tweetstream of messages the community interested in the event can follow. (Hashtags are used for other topics too, but that’s not relevant here.)

How does the crafty practitioner use hashtags to get an extra kick for his or her message? By putting out their own tweets and using someone else’s hashtag. Examples: tweeting about your competing product during a vendor show, or in a tweetstream that has sprung up around an event like a product intro or news story.

Analysts can jump into a tweetstream to promote their own work inside the stream around a vendor’s show. I’ve done that myself – it’s a great way to find new followers.

Some have gone further still – lately I’ve seen twitter-savvy analysts like Ray Wang promote their firm’s work even inside a competitor firm’s event tweetstream.  It’s a great way to find the right audience at the right time, and another powerful example of how twitter and other social media challenge existing business models.

Update: in the day following this post, I observed two more instances at the Gartner MDM event I was attending. Another independent analyst used the hashtag to add commentary – on the mix of attendees of the event, which he was not present for. And a vendor, not sponsoring, retweeted something I said, adding a link to their own website. What’s your opinion of these tactics? Leave your comments below – I hope to have a lively discussion. 

IBM STG Trip Report: Hardware-Software Synergy Yielding Dividends

Every year in the fourth quarter, IBM assembles its Systems & Technology Group (STG – the hardware guys) executives for discussions with the analyst community to review results and discuss the year ahead. STG’s Senior VP Rod Adkins teed up this year’s meeting with a reminder that STG and Software Group (SWG) both now report to Steve Mills, SVP and Group Executive – Software & Systems. This change naturally suggests the possibilities for increased synergies between the two parts of IBM, and although much collaboration has been in place over the years, IBM’s attention to leveraging the opportunity has clearly come into sharper focus. The interaction was a recurrent theme. Read more of this post

Cloudera-Informatica Deal Opens Broader Horizons for Both

Cloudera‘s continuing focus on the implications of explosive data growth has led it to another key partnership, this time with Informatica. Connecting to the dominant player in data integration and data quality expands the opportunity for Cloudera dramatically; it enables the de facto commercial Hadoop leader to find new ways to empower the “silent majority” of data. The majority of data is outside; not just outside enterprise data warehouses, but outside RDBMS instances entirely. Why? Because it doesn’t need all the management features database management software provides – it doesn’t get updated regularly, for example. In fact, it may not be used very often at all, though it does need to be persisted for a variety of reasons. I recently mentioned Cloudera’s success of late; it’s going to be challenged by some big players in 2011, notably IBM, whose recent focus on Hadoop has been remarkably nimble. So these deals matter. A lot. The Data Management function is being refactored before our eyes; both these vendors will play in its future. Read more of this post

Follow

Get every new post delivered to your Inbox.

Join 86 other followers