x
Data science and machine learning can have profound impact on a business, and are becoming critical for differentiation and sometimes survival. Being able to quickly identify that impact into one of five categories this research presents will help data and analytics leaders further drive results.
Data and analytics leaders responsible for analytics and BI strategies should:
By 2020, more than 40% of data science tasks will be automated, resulting in increased productivity and broader usage by citizen data scientists.
As data is already everywhere and consistently growing in volume and complexity, so data science problems are becoming increasingly prevalent. Some organizations are facing a large number of use cases to which data science could be applied. To better cope with the sheer mass of projects, some leading organizations are starting data science teams whose general mission is to become a shared resource across the organization.
Analytics themes still outrank in executives priorities other popular technology topics. Organizations are actually funding that curiosity by increasing their investment in analytics from $31 billion in 2013 to an estimated $114 billion in 2018.1 The same study also confirmed that 60% of executives in 2016 believed that analytics would disrupt their industry within the next three years and it has. According to McKinsey Global Institute, such disruption, fueled by analytics, could actually generate much bigger benefits and savings. In the U.S. for example, where healthcare spending is 18% of GDP, savings could amount to $600 annually per person, or 1% to 2% of GDP. In transportation, thanks to data-driven thinking and innovation initiatives (including high-scale real-time analytics), improvements to inefficient supply-demand matching could potentially create between $850 billion to $2.5 trillion in economic impact.2
At their level, leveraging such radical societal changes and disruptions, organizations with data science expertise can expect significant returns. Figure 1 summarizes the impact of disruption levels (and type of business value) brought about by data science teams.

LOB = line of business; ROI = return on investment; SWAT = special weapons and tactics
Source: Gartner (October 2017)
At the macrolevel, data and analytics leaders can use data science projects to deliver the following high-level business impacts, which we discuss throughout the note in more detail:
At the microlevel, of course, data science projects and teams can contribute in many more ways:
Figure 2 summarizes the top recommendation per area of impact, which we discuss in more detail below.
BU = business unit
Source: Gartner (October 2017)
Without data scientists and their knowledge, many issues surrounding the digital business age will remain unresolved possibly even untouched. Data scientists frame complex business problems as machine-learning or operations research problems. Data scientists know which new information sources should be collected or acquired from external sources, to solve old and pivotal business issues in radically new ways. Some of those iconoclastic ideas can find their way to the most unexpected places; think of Moneyball, the 2003 book and 2011 movie, where sabermetrics was popularized to completely question the old method of evaluating the performance of individuals and teams in baseball.3
There are many more examples of disruptive projects and new business moments (see Note 2 for a business moments definition) that are made possible through data science.
Case Examples: Innovation
Companies also use data and the corresponding analytics in novel ways. For example, Progressive was one of the first insurers to create an insurance product that used GPS-based location intelligence to keep it better informed about the actual risks against which it is insuring.
Many online companies have been masters of data-driven innovation. The likes of Amazon, Google, Airbnb, Uber and Facebook constantly introduce new systems to collect better information. This enables them to create better or new services.
Recommendations for data and analytics leaders:
Data scientists must engage with big data expeditions, especially when there is no clear objective other than to explore the data for insights and tidbits. Such expeditions are a form of inductive thinking or inductive reasoning (see Note 3) an example of "letting the data speak." The process can be tactical and ad hoc. Alternatively, it can be part of a more systematic practice in which you give the data science team or lab (see Note 4) a data dump for diving into and exploring. The lab then looks for anomalies, seeking something new. It then drills deeper into the shape of the data using more-advanced techniques, which might include cluster and factor analysis, anomaly detection, regression, decision trees, Monte Carlo simulation and link analysis.
Case Example: Exploration
The objective of data exploration is always to discover which events are drivers or inhibitors of other events, or of good or bad outcomes (such as reducing equipment failure and increasing customer satisfaction). It could also lead to gaining an understanding of events that could be new customer touchpoints or engagement points. Such information could be used to foster data-based innovation.
However, these kinds of projects can be a bit like fishing expeditions. The available data may give hints about what you may gain from the process or give you a better understanding of underlying business mechanics. They could also help you uncover very valuable data assets seen to that point as merely data side effects (like in the Japanese maritime service provider case). Finally, those projects could validate that the data is clean or point to additional data sources to enhance internal sources.
Recommendations for data and analytics leaders:
Data science and especially machine learning excel in solving complex, data-rich business problems where traditional approaches, such as human judgment and exact solutions, are increasingly showing their limits due to the escalation of problem complexity and ever-expanding volume of available data. Data science methods have been proven to often deliver superior results, when the space of critical variables is highly dimensional and very noisy.
Data science teams could tackle hundreds of new business problems. Companies are already using data science teams for tasks such as:
Case Examples: Prototyping
Recommendations for data and analytics leaders:
Most data scientists work in the production part of their business. In such areas, established models are already "in production." For example:
Case Examples: Refinement
As is the way in all these use cases, organizations must constantly improve their data science practices as new data becomes constantly available, as new products are created, and as consumers or ecosystem partners share data on the usage of these products. Other improvements are induced by customer behavior changes not only day by day or through different seasons, but also year after year, through competition, the zeitgeist and an ever-changing marketplace. Data science teams must also adjust to fast and constant changes around customer touchpoints, with new devices and wearables regularly released by equipment manufacturers and quickly adopted by consumers. Finally, new customer contextualization strategies can lead to better results, and require many existing models and data sources inputs to be adjusted.
Recommendations for data and analytics leaders:
It is not always possible (almost by definition) to avoid a crisis; its causes might be unpredictable or led by a priori uncorrelated events. This situation is a variation of the exploration category. Many analytics projects are triggered by crises whose symptoms are usually well-identified, such as:
This means that the data science team has to identify "only" the cause, which narrows the datasets it must scrutinize.
Everything else in this use scenario is very similar to the work the data science lab does in "exploration" mode that is, the lab does not know at the outset whether it can identify the cause of the problem. If the events are totally uncorrelated or rarely occurring issues, the lab may never be able to identify the cause.
Basic data discovery/self-service BI can often help. However, a deeper dive by a data science team can extract more from the data about what is really happening. For example:
Recommendations for data and analytics leaders:
From tactical and immediate impacts to strategic transformations and even disruptive ideas, data science and machine-learning projects can exert a profound influence on an organization. Impressive business impacts have been documented across industries showing that these technologies are becoming critical factors of differentiation and sometimes survival. Being able to quickly identify and categorize that impact can further improve on those already outstanding results and contributions.
| BI | business intelligence |
1 "Cracking the Data Conundrum: How Successful Companies Make Big Data Operational," Capgemini Consulting, 14 January 2015.
2 "The Age of Analytics: Competing in a Data-Driven World," McKinsey Global Institute, December 2016.
3 M. Lewis, "Moneyball: The Art of Winning an Unfair Game," W. W. Norton & Co., 2003.
4 "How Retailers Can Keep Up With Consumers," McKinsey Quarterly, October 2013.
5 "ORION Backgrounder," UPS
6 "IBM Puts Its Faith in Watson," E-Commerce Times, 20 January 2014.
7 "Pratt & Whitney Taps IBM to Capture Value of Big Data to Improve Aircraft Engine Performance," IBM, 17 July 2014.
8 "ClassNK, IHIMU, DU and IBM Develop Ship Maintenance Software,"The Maritime Executive, 12 November 2012; "ClassNK Develops Ship Maintenance Software With IHIMU, DU and IBM," gCaptain, 12 November 2012.
9 The Diabetic Retinopathy Detection competition on Kaggle started on 17 February 2015 and is due to finish on 27 July 2015. The California Healthcare Foundation sponsored it with a reward of $100,000.
10 "Memphis Police Department Reduces Crime Rates With IBM Predictive Analytics Software," IBM, 21 July 2010.
11 "Honda, Watson IoT and Formula 1," IBM, 22 November 2016.
12 "Infinity Property and Casualty Builds a Smarter System for Fraud," InformationWeek, 30 November 2011.
13 "Why You're Not Getting Value From Your Data Science," Harvard Business Review, 7 December 2016.
Source: Gartner Research Note G00343858, Erick Brethenoux, Alexander Linden, 19 October 2017
Data science is the discipline of extracting nontrivial knowledge from all kinds of data, to improve decision making. It involves a variety of steps, ranging from business understanding and data preparation to building and deploying analytic models. It is, to some extent, a replacement term for data mining, but is also much more: data science is the unification of several quantitative disciplines (statistics, machine learning, operations research, computational linguistics and others). For the first time, people trained in these different disciplines are all willing to unite behind the banner of data science a very profound development.
During the past year, this notion of data science has become more widely used, and many more academic institutions now offer data science courses and degrees. In addition, organizations hiring data scientists and building data science teams and labs are on the rise. Gartner expects that within a few years, the term "data science" will gain widespread recognition as an umbrella term for many forms of sophisticated analytics.
Organizations that want to increase the maturity of their analytics and extend their portfolio of analytics capabilities need to improve their data science skills. They need to leverage new data sources and demonstrate business value using predictive and prescriptive (and often diagnostic) capabilities. However, organizations must recognize that data scientists are in very short supply recruiting them internally may be difficult, but not impossible. They must also leverage their "citizen data scientists" in their lines of business to increase the reach and impact of analytics.
Data science drives a vast array of use cases across all industries for example, customer relationship management, supply chain management, optimization and automation of diverse production processes, drug research, quality and risk management, smart cities, smart systems, and many more.
Gartner defines a business moment as a transient opportunity that is exploited dynamically. It is very short in duration perhaps only seconds, depending on the nature of the opportunity. This catalyst sets in motion a series of events involving people, businesses and "things" that span multiple industries and multiple ecosystems.
Inductive reasoning aims at creating broader generalization from observations. Even though the facts that produce the generalization can be true, the generalization itself might not always be accurate. For example, if it has been sunny each time you have visited Dusseldorf in Germany, you might conclude falsely that it is always sunny in Dusseldorf.
A data science lab is a team disconnected from but close to the BI competency center. Its individual members usually have different skills. For example, these might be in:
A data science team becomes a "lab" when you provide it with resources, such as server and storage sandboxes or relief from daily workload. It often has a ratio of solutions to "dead-end" efforts in the region of 1:10.