With more and more intelligent and interconnected devices and systems, the data companies are collecting is growing at unprecedented rates. As much as 90% of that data is unstructured, coming from social media, electronic documents, machine data etc..., and growing at rates as high as 50% per year. This is big data.
Extracting insights from big data can make your business more agile, more competitive and provide insights that, in the past, were beyond reach. The emergence of recent technologies such as the real-time analytics processing capabilities of stream computing and the massive MapReduce scale-out capabilities of Hadoop® has opened the door to a world of possibilities. This has also created the need for robust infrastructures that combine computing power, memory and data bandwidth to process and move large quantities of data -- fast.
Based on this need, IBM Power Systems designed a solution to create a big data environment built on a heritage of strong resiliency, availability and security -- the IBM Solution for Hadoop - Power Systems Edition.
The IBM Solution for Hadoop - Power Systems Edition
With a data-centric design, this solution offers a tightly-integrated and performance-optimized infrastructure for MapReduce-based big data workloads. Unlike traditional x86 infrastructures, the IBM Solution for Hadoop can be tailored specifically to meet your Big Data workloads by using a simple building block approach to match the mix of CPU and storage to application requirements. This approach gives you the best possible infrastructure for your big data workload.
Features and benefits
Power Systems are proven to be more reliable with 73% fewer outages and 92% fewer performance problems than x86.1
POWER8 delivers 2.5x performance on Big Data/Hadoop compared to the best x86 published result.2
2 IBM Analytics Stack: IBM Power System S822L; 8 nodes each with 24 cores / 192 threads, POWER8; 3.0GHz, 512 GB memory, RHEL 6.5, InfoSphere BigInsights 3.0
Cisco Stack: 16 high-density Cisco UCS C240 M3 Rack Servers each with 16 cores / 32 threads, Intel Xeon E5-2665; 2.4 GHz, 256 GB of memory, Cisco UCS VIC 1225, and LSI 9266 8i with 24 1-TB SATA 7200-rpm disk running Apache Hadoop open source distribution.