Skip to main content

Table 1 Basic features of 14 Hadoop distributions and related download links

From: Applications of the MapReduce programming framework to clinical big data analysis: current landscape and future trends

Vendor Features Download URL
Amazon Web Services Inc • Amazon Elastic Block Store
• Amazon Virtual Private Cloud
• GPU Instances
• High Performance Computing (HPC) Cluster
IBM Corp • Social and Machine Data Analytics Accelerator
• Provides a workload scheduler
• Includes Jaql, a declarative query language.
• Allows executing R jobs directly from the BigInsights web console.
Pivotal Corp • A Fast, Proven SQL Database Engine for Hadoop
• Enterprise Real-Time Data Service on Hadoop
• Familiar SQL Interface
• Hadoop In the Cloud: Pivotal HD Virtualized by VMware
Cloudera Inc • HDFS Snapshots
• Support for running Hadoop on Microsoft Windows
• YARN API stabilization
• Binary Compatibility for MapReduce applications built on hadoop-1.x
MapR Technologies Inc • Finish small jobs quickly with MapR ExpressLane
• Enable atomic, consistent point-in-time recovery with MapR Snapshots
Hortonworks Inc • Use rich business intelligence (BI) tools such as Microsoft Excel, PowerPivot for Excel and Power View
• HDP for Windows is the ONLY Hadoop distribution available for Windows Server.
Karmasphere Inc • Ability to Use Existing SAS, SPSS and R Analytic Models
Hadapt Inc • Analyze both structured and unstructured data in a single, unified platform
Super Micro Computer Inc • Fully-validated, pre-configured SKUs optimized for Hadoop solutions
Pentaho Corp • Visual development for Hadoop data preparation and modeling
Zettaset Inc • Enterprise-Grade Hadoop Cluster Management
Datastax Inc • Powered by Apache Cassandra™, Certified for Production
Datameer Inc • Data Integration, Analytics, and Visualization
Dell Inc • Cloudera distribution for Hadoop