90% of the world's data was created in the last two years.
This torrent of data shows no signs of slowing down. Big data analytics solutions built on the IBM PowerLinux platform are architected to harness the data explosion facing your business. This is why clients and innovators choose to collaborate with IBM, building optimized PowerLinux solutions that are designed for data.
Watch this space for the latest highlights as this story evolves.
Announcing the IBM PowerLinux Big Data Solution for Apache Hadoop
To create Watson, IBM Research combined PowerLinux technology with Apache Hadoop software, creating the perfect platform for their DeepQA algorithms.
With the launch of IBM PowerLinux Big Data solutions, we took a leadership role in offering optimized solutions ready for immediate deployment, built on IBM InfoSphere software like BigInsights and Streams. We’ve also taken our role in the open source community very seriously, with heavy contributions to projects like Apache Hadoop enabling continuous development in the fields of analytics and high performance computing.
Now, clients and solution builders wishing to innovate on top of a high performance data analytics platform can take advantage of the proven flexibility, throughput, and resiliency of POWER, as well as the immediate value provided by PowerLinux solutions. Using the IBM PowerLinux Big Data Solution for Apache Hadoop, clients and developers can take advantage of optimizations designed by IBM engineers to allow Hadoop to excel on Power.
IBM PowerLinux raises the bar for Hadoop cluster performance
IBM Research continues to push the envelope when it comes to boosting PowerLinux performance in processing large quantities of data. Businesses with a big data problem know that results from large data analysis queries are often useless unless they are delivered immediately upon demand. IBM PowerLinux Big Data solutions, built on the POWER platform, are ideal for these challenges because they are designed to move a fire hose of data into and out of the compute nodes extremely quickly.
Our research team has been working with a 10-node Hadoop cluster of PowerLinux 7R2 nodes, running InfoSphere BigInsights 1.3 software, for several weeks. While exploration and tuning continues, the team has been able to sort through a terabyte of data in less than 10 minutes. This is 58% faster* than the next commodity ~10-node result of 24 minutes, measured on an x86 Hadoop cluster.
TerraEchos chooses IBM PowerLinux for its Kairos stream computing solution
TerraEchos, an award winning IBM Business Partner headquartered in Missoula, Montana, specializes in transforming streaming data (for example, from environmental sensors) into knowledge and decisions. Their Kairos software can complete in fractions of a second what used to take hours, and the parallel processing and file system capabilities of PowerLinux Big Data solutions can cut that response time by a factor of two.
The TerraEchos Kairos computing appliance is built on top of IBM InfoSphere Streams, and can be easily sized and deployed to solve any customer’s streaming data problem. PowerLinux’s scalability, multithreading capabilities, and parallel file system enable the appliance to analyze a fire hose of data instantaneously, enabling real-time decisions while minimizing data storage needs.
Big data analytics require a platform designed to perform
Workload optimized system designed for complex analytics, built with POWER7
Server virtualization showdown
PowerVM throughput 78% higher than VMware