Gartner Research

Create a Data Strategy for Machine Learning in Advanced Analytics Initiatives

Published: 10 May 2019


Organizations struggle to use data effectively and efficiently to support machine learning in advanced analytics initiatives due to growing diversity in data projects. This research guides data and analytics technical professionals on developing a data strategy to support successful deployments.

Included in Full Research

  • Prework: Build a Business Motivation Framework for ML
    • Defining the End Objective
    • Defining the Means Objectives
    • Providing Assessment and Governance to Support the Data Strategy
    • Defining Influencers Critical to the Success of the Data Strategy
  • Step 1: Develop a Targeted Acquisition Strategy
    • 1.1 Determine Where to Get Data
    • 1.2 Select an Approach to Acquiring Internal and External Data
    • 1.3 Enable Data Engineering Pipelines
    • 1.4 Establish Data Science Pipeline
    • 1.5 Enable Data Science Workflows
    • 1.6 Enable Supervised Learning Workflows
    • 1.7 Enable Unsupervised Learning Workflows
    • 1.8 Secure Data Science Pipelines
  • Step 2: Define Data Preprocessing Architecture
    • 2.1 Refine the Architecture With Storage Options
  • Step 3: Connect ML Analytic Engines
    • 3.1 Feed Big Data Analytic Engines to Support ML Initiatives
    • 3.2 Complement With Automated Machine Learning Engines
  • Step 4: Deliver to ML Workloads
    • 4.1 Complement ML Workloads With Pretrained Networks and Packaged Datasets
    • 4.2 Work With Different Technology Approaches to Sourcing ML Workloads
  • Step 5: Perform a Business Process Review of ML Output
    • 5.1 Identify and Prioritize Processes to Review
    • 5.2 Gather and Analyze Current Process Data
    • 5.3 Conceptualize Future State
    • 5.4 Integrate ML Output Into Business Process
    • 5.5 Evaluate Outcomes
  • Follow-Up
    • Manage Data Pipelines and ML Workloads
    • Adopt Flexible Data Quality Strategies for Machine Learning
  • Risk No. 1: Building Data Science Pipelines Can Be Especially Challenging When Dealing With Big Data Without the Right Tools
  • Risk No. 2: Poor Data Quality Will Significantly Impact Performance and Accuracy
  • Risk No. 3: Techniques for Securing Data Science Pipelines Are Still in Their Infancy
  • Pitfall: Bounded Rationality Exists Even Within ML Applications


Carlton Sapp

Access Research

Already a Gartner client?

To view this research and much more, become a client.

Speak with a Gartner specialist to learn how you can access peer and practitioner research backed by proprietary data, insights, advice and tools to help you achieve stronger performance.

By clicking the "Continue" button, you are agreeing to the Gartner Terms of Use and Privacy Policy.

Gartner research: Trusted insight for executives and their teams

What is Gartner research?

Gartner research, which includes in-depth proprietary studies, peer and industry best practices, trend analysis and quantitative modeling, enables us to offer innovative approaches that can help you drive stronger, more sustainable business performance.

Gartner research is unique, thanks to:

Independence and objectivity

Our independence as a research firm enables our experts to provide unbiased advice you can trust.

Actionable insights

Not only is Gartner research unbiased, it also contains key take-aways and recommendations for impactful next steps.

Proprietary methodologies

Our research practices and procedures distill large volumes of data into clear, precise recommendations.

Gartner research is just one of our many offerings.

We provide actionable, objective insight to help organizations make smarter, faster decisions to stay ahead of disruption and accelerate growth.

Tap into our experts

We offer one-on-one guidance tailored to your mission-critical priorities.

Pick the right tools and providers

We work with you to select the best-fit providers and tools, so you avoid the costly repercussions of a poor decision.

Create a network

Connect directly with peers to discuss common issues and initiatives and accelerate, validate and solidify your strategy.

Experience Technical Professionals conferences

Join your peers for the unveiling of the latest insights at Gartner conferences.

©2022 Gartner, Inc. and/or its affiliates. All rights reserved. Gartner is a registered trademark of Gartner, Inc. and its affiliates. This publication may not be reproduced or distributed in any form without Gartner’s prior written permission. It consists of the opinions of Gartner’s research organization, which should not be construed as statements of fact. While the information contained in this publication has been obtained from sources believed to be reliable, Gartner disclaims all warranties as to the accuracy, completeness or adequacy of such information. Although Gartner research may address legal and financial issues, Gartner does not provide legal or investment advice and its research should not be construed or used as such. Your access and use of this publication are governed by Gartner’s Usage Policy. Gartner prides itself on its reputation for independence and objectivity. Its research is produced independently by its research organization without input or influence from any third party. For further information, see Guiding Principles on Independence and Objectivity.