Gartner Research

The Data Lake Fallacy: All Water and Little Substance

Published: 23 July 2014

ID: G00264950

Analyst(s): Andrew White , Nick Heudecker

Summary

The data lake concept promises a centralized pool of disparate data sources in one location, and treats alignment as a technical exercise. Information management leaders should understand the gaps in this concept — such as semantics, governance and security — and take the necessary precautions.

Table Of Contents
  • Impacts

Analysis

Impacts and Recommendations

  • Data lakes focus on storing data from disparate sources and ignore how or why data is used, governed, defined and secured by an organization's information management leaders
  • End users are misinformed on the skill level required to capitalize on the data lake concept. Vendors are exploiting the hype with no intent to resolve the lack of programming, analytical and data manipulation skills necessary to improve specific business outcomes.
  • Data lakes typically begin as ungoverned data stores addressing a limited data science audience. Meeting the needs of wider audiences requires curated repositories with governance, semantic consistency and security — elements already found in data warehouses.

Gartner Recommended Reading

©2019 Gartner, Inc. and/or its affiliates. All rights reserved. Gartner is a registered trademark of Gartner, Inc. and its affiliates. This publication may not be reproduced or distributed in any form without Gartner’s prior written permission. It consists of the opinions of Gartner’s research organization, which should not be construed as statements of fact. While the information contained in this publication has been obtained from sources believed to be reliable, Gartner disclaims all warranties as to the accuracy, completeness or adequacy of such information. Although Gartner research may address legal and financial issues, Gartner does not provide legal or investment advice and its research should not be construed or used as such. Your access and use of this publication are governed by Gartner’s Usage Policy. Gartner prides itself on its reputation for independence and objectivity. Its research is produced independently by its research organization without input or influence from any third party. For further information, see Guiding Principles on Independence and Objectivity.

Already have a Gartner Account?

Become a client