Skip to main content

Table 1 Basic features of 14 Hadoop distributions and related download links

From: Applications of the MapReduce programming framework to clinical big data analysis: current landscape and future trends

Vendor Features Download URL
Amazon Web Services Inc • Amazon Elastic Block Store http://aws.amazon.com/
• Amazon Virtual Private Cloud
• GPU Instances
• High Performance Computing (HPC) Cluster
IBM Corp • Social and Machine Data Analytics Accelerator http://www-03.ibm.com/software/products/en/infobigienteedit/
• Provides a workload scheduler
• Includes Jaql, a declarative query language.
• Allows executing R jobs directly from the BigInsights web console.
Pivotal Corp • A Fast, Proven SQL Database Engine for Hadoop http://www.gopivotal.com/products/pivotal-hd
• Enterprise Real-Time Data Service on Hadoop
• Familiar SQL Interface
• Hadoop In the Cloud: Pivotal HD Virtualized by VMware
Cloudera Inc • HDFS Snapshots http://www.cloudera.com/content/cloudera/en/products-and-services/cloudera-enterprise.html
• Support for running Hadoop on Microsoft Windows
• YARN API stabilization
• Binary Compatibility for MapReduce applications built on hadoop-1.x
MapR Technologies Inc • Finish small jobs quickly with MapR ExpressLane http://www.mapr.com/products/only-with-mapr
• Enable atomic, consistent point-in-time recovery with MapR Snapshots
Hortonworks Inc • Use rich business intelligence (BI) tools such as Microsoft Excel, PowerPivot for Excel and Power View http://hortonworks.com/products/hdp/
• HDP for Windows is the ONLY Hadoop distribution available for Windows Server.
Karmasphere Inc • Ability to Use Existing SAS, SPSS and R Analytic Models http://www.karmasphere.com/product-overview/key-features/
Hadapt Inc • Analyze both structured and unstructured data in a single, unified platform http://hadapt.com/product/
Super Micro Computer Inc • Fully-validated, pre-configured SKUs optimized for Hadoop solutions http://www.supermicro.com/products/rack/hadoop.cfm
Pentaho Corp • Visual development for Hadoop data preparation and modeling http://www.pentahobigdata.com/ecosystem/platforms/hadoop
Zettaset Inc • Enterprise-Grade Hadoop Cluster Management http://www.zettaset.com/platform.php
Datastax Inc • Powered by Apache Cassandra™, Certified for Production http://www.datastax.com/what-we-offer/products-services/datastax-enterprise/apache-hadoop
Datameer Inc • Data Integration, Analytics, and Visualization http://www.datameer.com/
Dell Inc • Cloudera distribution for Hadoop http://www.dell.com/learn/us/en/555/solutions/hadoop-big-dataSolution?c=us&l=en&s=biz&cs=555