Gartner Research

How to Choose the Right Apache Hadoop Distribution

Published: 09 February 2012

ID: G00227159

Analyst(s): Merv Adrian


IT architects, business leaders and data scientists involved in "big data" projects can easily go wrong when they construct an Apache Hadoop stack, because the 20 or more potential components ("projects") are not integrated as commercial software packages are.

Table Of Contents
  • Overview

What You Need to Know


  • Project Structure Makes Implementing Apache Hadoop a Challenge
  • Vendor Choices Show Distributions' Focus
  • Projects Found in Distributions
  • Take These Steps to Increase Your Chances of Making the Right Choice

Recommended Reading

©2021 Gartner, Inc. and/or its affiliates. All rights reserved. Gartner is a registered trademark of Gartner, Inc. and its affiliates. This publication may not be reproduced or distributed in any form without Gartner’s prior written permission. It consists of the opinions of Gartner’s research organization, which should not be construed as statements of fact. While the information contained in this publication has been obtained from sources believed to be reliable, Gartner disclaims all warranties as to the accuracy, completeness or adequacy of such information. Although Gartner research may address legal and financial issues, Gartner does not provide legal or investment advice and its research should not be construed or used as such. Your access and use of this publication are governed by Gartner’s Usage Policy. Gartner prides itself on its reputation for independence and objectivity. Its research is produced independently by its research organization without input or influence from any third party. For further information, see Guiding Principles on Independence and Objectivity.

Already have a Gartner Account?

Purchase this Document

To purchase this document, you will need to register or sign in above

Become a client

Learn how to access this content as a Gartner client.