A Must Read: The Top 3 questions surrounding Big Data

Please click on the link directly below to bring up my latest blog entry, thanks!

Top 3 Questions around Big Data 9-4

“The way to get started is to quit talking and begin doing.”  Walt Disney


IBM Big Data: Vivisimo Acquisition and BigInsight’s Support Of Cloudera

Armonk, N.Y. – 25 Apr 2012: IBM (NYSE: IBM) today announced a definitive agreement to acquire Vivisimo, a leading provider of federated discovery and navigation software that helps organizations access and analyze big data across the enterprise. Vivisimo is a privately held company based in Pittsburgh, Pennsylvania. Financial terms were not disclosed.

Vivisimo software excels in capturing and delivering quality information across the broadest range of data sources, no matter what format it is, or where it resides. The software automates the discovery of data and helps employees navigate it with a single view across the enterprise, providing valuable insights that drive better decision-making for solving all operational challenges. 

Today’s news accelerates IBM’s big data analytics initiatives with advanced federated capabilities allowing organizations to access, navigate, and analyze the full variety, velocity and volume of structured and unstructured data without having to move it.

The combination of IBM’s big data analytics capabilities with Vivisimo software will further IBM’s efforts to automate the flow of data into business analytics applications, helping clients better understand consumer behavior, manage customer churn and network performance, detect fraud in real-time, and perform data-intensive marketing campaigns. 

“Navigating big data to uncover the right information is a key challenge for all industries,” said Arvind Krishna, general manager, Information Management, IBM Software Group. “The winners in the era of big data will be those who unlock their information assets to drive innovation, make real-time decisions, and gain actionable insights to be more competitive.”

“Businesses need a faster and more accurate way to discover and navigate big data for analysis” said John Kealey, Chief Executive Officer, Vivisimo. “As part of IBM, we can bring clients the quickest and most accurate access to information necessary to drive growth initiatives that increase customer satisfaction, streamline processes, and boost sales.”

IBM estimates 2.5 quintillion bytes of data are created every day from a variety of sources including sensors, social media, and billions of mobile devices around the world, making it difficult for businesses to navigate and analyze it to improve competitiveness, efficiency, and profitability. IDC estimates the market for big data technology and services will grow at an annual rate of nearly 40 percent to reach $16.9 billion by 2015.

Vivisimo brings over a decade of experience and innovation in data navigation and visualization technologies for both structured and unstructured data, making it easier for business users to get value from all of their data and content. Vivisimo’s ability to index and search data across multiple repositories is a distinguishing capability, applicable to all industries and clients.

Vivisimo has more than 140 customers in industries such as government, life sciences, manufacturing, electronics, consumer goods and financial services. Clients include Airbus, U.S. Air Force, Social Security Administration, Defense Intelligence Agency, U.S. Navy, Procter & Gamble, Bupa, and LexisNexis among others. Upon the closing of the acquisition, approximately 120 Vivisimo employees will join IBM’s Software Group. IBM will incorporate Vivisimo technology into its big data platform.

IBM Expands Partner Ecosystem for Big Data Platform

IBM is unique in having developed an enterprise big data platform that allows clients to manage, access, and gain intelligence on the full variety, velocity and volume of structured and unstructured data.

IBM’s big data platform is based on open source Apache Hadoop. The platform makes it easier for data-intensive applications to manage and analyze petabytes of big data by providing clients with an integrated approach to analytics, helping them turn information into insights for improved business outcomes.

The platform provides clients with the industry’s broadest array of advanced business analytics, Hadoop-based analytics, stream computing, data warehousing, integration, visualization, systems management, governance, and consulting services.

IBM’s approach to big data challenges is differentiated as it blends traditional data management technologies that are well suited for structured, repeatable tasks, together with complementary new technologies that address speed and flexibility, and are ideal for data exploration, discovery and unstructured analysis.

IBM is expanding its big data platform to run on other distributions of Hadoop, beginning with Cloudera. Cloudera is a top contributor to the Hadoop development community, and an early provider of Hadoop-based systems to clients across a broad range of industries including financial services, government, telecommunications, media, retail, energy and healthcare. As a result, Cloudera Hadoop clients can now take advantage of IBM’s big data platform to perform complex analytics and build a new generation of software applications.

IBM has the industry’s broadest portfolio of big data capabilities with software, hardware, services, and innovations developed by IBM Research such as the Watson system. Over 100 IBM Business Partners have adopted IBM’s big data platform, bringing a new class of solutions to market and extending the reach of IBM analytics offerings for clients.

And the Award goes to…

The IBM Big Data Ecosystem of Business Partners! 

But wait, before I get ahead of myself and start fielding acceptance speeches from our Business Partners, I wanted to provide some backdrop. 

We started our Big Data journey about a year ago in the sense of defining to the world how we viewed Big Data.  Extracting insight from an immense volume, variety and velocity of data, in context, beyond what was previously possible. In addition, we released our first version of InfoSphere BigInsights, our Hadoop based product. It joined our existing streaming analytics product InfoSphere Streams. Today our Big Data Platform encompasses Streaming Computing, Hadoop, Information Integration, and Data Warehousing.

As the eco-system got built out in 2011, we finished the year as an industry leader in terms of the number of partners in our rich eco-system for Big Data. Although, I would like to cover each and every business partner in detail here, we do have a microsite that provides a more indepth look (see the link at the end of this post).  I will however profile a couple of partners to give you a flavor of the quality of our ecosystem. 

Revolution Analytics

Revolution Analytics delivers advanced analytics software by building on open source R— very powerful statistics software—with innovations in big data analysis, integration and user experience, Revolution Analytics meets the demands and requirements of modern data-driven businesses.  Revolution Analytics not only supports InfoSphere BigInsights, but Netezza as well.


Datameer utilizes and runs on IBM’s Hadoop based platform (InfoSphere BigInsights), which provides a dependable, enterprise ready implementation of Apache Hadoop. Datameer is leveraged in leading enterprises in the financial services, telecommunications, internet security, gaming and retail industries as well as government agencies to analyze structured and unstructured data. 

Datameer enables users to rapidly integrate, analyze and visualize massive amounts of data.  Datameer users take advantage of wizard-based data integration to instantly access and load virtually all data sources into Hadoop without the requirement of building time-consuming data models. Once complete, the results and insights are then visualized in reports, charts, maps and dashboards in all manner of use cases including customer behavior analytics, fraud detection, web analytics, IT infrastructure analysis and text analytics.  


Combined with IBM’s Big Data Platform, ClickFox’s cross-channel analytic engine will analyze structured and unstructured data from warehouses to build a visual mapping of product and customer experiences across the enterprise. The experience analytics solution will yield powerful insight into consumer behavior, channel conflict, marketing and sales effectiveness, retention strategies and the impact of product and service issues on customers.

By revealing hidden bottom-line connections between products, touch points, consumers and business outcomes, organizations can reduce costs, increase satisfaction and loyalty and drive revenue and competitive advantage. Organizations benefit from a multi-dimensional view of their business from several lenses, including: employee, customer, product, service, company and influence, and can focus analysis on the most critical areas.


TerraEchos builds ‘big data-in-motion’ analytic solutions. Employing IBM’s InfoSphere Streams, their systems handle very large Volumes of a wide Variety of data at high Velocity (V3), enabling clients to aggregate and analyze data from a variety of sources as it streams in real time. Applicable areas include:

  •      Protection of sensitive infrastructure (oil & gas, power grids, etc.)
  •      Monitoring and control of complex systems (power grids, manufacturing, medical)
  • Traffic flow control, including disaster evacuation route optimization, and air traffic control
  • Real-time predictive disaster avoidance (tsunamis and extreme weather)
  • Large-scale environmental monitoring and proactive eco-system protection (air and water temperature, flow and contaminant analysis)
  • Vetting and tracking of known criminals
  • Near-zero latency decision and trade execution on capital markets
  • Scalable architecture against volumes of unstructured or structured data
  • Computationally intensive calculations at unprecedented velocity
  • Dynamic correlation and fusion of a variety of data


Atigeo delivers an experience that benefits communication service providers, media content owners, healthcare organizations and others. With IBM Streams, information can be easily aggregated and correlated across large volumes of data flowing from all aspects of life. And by leveraging IBM Languageware to derive semantic understanding and Cognos Now! for real-time Business Intelligence and Analytics, and Atigeo xPatterns are able to gain useful insights from this vast amount of information that can then be used to create personal and portable profiles, while always preserving the highest degree of privacy.

Persistent Systems

Persistent Systems offers solutions based on IBM technologies. Persistent’s proven BigInsights implementation methodology for Big Data solutions leverages –

  • In-depth knowledge of BigInsights product and underlying Hadoop technologies
  • Specialized tools and frameworks that significantly reduce time to value and also provide ease of maintenance
  • A large Analytics practice that has been involved in all aspects of BigInsights product, all the way from installation, implementation, integration, quality assurance

The Persistent team also has deep knowledge of different components of the Hadoop ecosystem such as Hive, JAQL, PIG and other open source platforms where we have contributed IP and actively support these systems. See today’s announcement: 



Concord is an established IBM Premiere Business Partner with a proven track record delivering industry solutions based on IBM’s Information Management, and WebSphere product lines as well as Hadoop.

In addition, Concord has created ComplETE suite that complements and enhances the BigInsights platform by providing end-to-end business process visibility in mainframe & distributed environments as well as environments where establishing precise transaction relationships seems impossible. They offer true end-to-end correlation. The suite includes transaction monitoring, transaction trending, transaction analytics, event management and payload forensics. Concordwill also leverage IBM InfoSphere Streams to handle streaming analytics.

The suite couples the power of Hadoop with in-memory MOLAP cubes embedded in our RETE rules engine to deliver the fastest real-time analytics & simulation platform on the market.

For more details and information on IBM big data business partners please visit:


In the end, it is you, the customer who will decide who gets the Award for being the best IBM big data Business Partner.  In the mean time I want to make sure you have many high quality ones to choose from…  – Bruce Weed

“The secret of success is constancy of purpose.” – Benjamin Disraeli


I thought I would ease into the Leadership discussion.

The first two areas around leadership to layout are around the following two facts:

1)  Leaders need to make difficult decisions in a well thought out and timely manner.

In the picture below, Washington’s decision to cross the Delaware River in a surprise attack changed the course of the war.     

2) Leaders need to lead.  The statement sounds obvious, but in order for a leader to effectively lead they need to be in a position to do that job.  

As you can see in the picture below there is a lot going on.  People paddling, someone holding the flag, someone steering the boat, someone pushing away from ice chunks, etc.  If Washington had to do all of these jobs, not only would he not have time to lead (direct the boat, think through the impending battle on the other side, etc.), the over all mission would not be very effective.

They say a picture speaks a 1000 words, the picture below speaks volumes on leadership. 


“Few men have virtue to withstand the highest bidder.”  –  George Washington

Published in: on September 13, 2011 at 8:20 pm  Leave a Comment