New research why service catalogs are the blueprint for the future of value management

October 8, 2020 — Peter Nichol published new research highlighting how services catalogs provide the blueprint for the future of value management.

Executives are conflicted about what to do with failing PMOs. They focused on project volume and forget about project value.

As organizations adopt value-based frameworks for delivery new skills are required for success. It’s no longer acceptable to be leading a portfolio that is 90% focused around data and not be able to name a single data cleansing tool or technology. CIOs are looking for executive and business information officers to champion and drive change. It’s hard to do that when you’re not in the know.

By being transparent in the services the new value management office can provide to the organization business partners can begin to understand how to consume there organizationally provided services to maximize value.

Peter’s research envisions the future of a services catalog that is centered around value management offices. Companies are shifting from project delivery to continuous value delivery. To do this successfully the value management office needs to identify services that are core capabilities and defocused on non-core service. Putting in place a service catalog for your vale management office will take the guesswork away from your business partners and make it clear services the office provides.

The paper explains specific examples of how to stand up a service catalog for a value management office and the benefits that will result.

DOWNLOAD THE FULL RESEARCH PAPER – SERVICE CATALOGS: BLUEPRINT FOR THE FUTURE OF VALUE MANAGEMENT

Abstract

Abstract — This paper aims to present a unique approach to implementing a project-management-office service catalog for organizations focused on continuous value delivery. Business information executives and officers must validate investment decisions and demonstrate value achieved. The process of gaining additional organizational and cross-functional executive buy-in is especially difficult when team members, leaders, and executives don’t understand what services the organization’s project-management office provides. The traditional project-management office—centered around processes and templates—is being transformed into a new-age, value-management office that’s hyper-focused on the value realized. Innovators understand and appreciate that if the services the project-management office provides are vague, business partners won’t consume them. The act of creating a service catalog allows for a tailored or agile approach to delivery. This model accepts that not all projects are created equal. More specifically, leveraging a value-management service catalog refocuses the organization. Core capabilities are provided through the catalog, and investments in these areas are doubled down. Alternatively, capabilities that fall outside the core capabilities of the value-management office are evaluated for outsourcing. This shifts the traditional project-management office from a cost center to a value center and makes services offered transparent to downstream consumers.

What will you learn?

  • Why services catalogs are a key element of a functioning value management office?
  • Practical ways to make the services provided to internal customers more transparent and easier to digest.
  • Why CIOs are looking for leaders that drive strategy with innovation?

New research on value management as an organizational capability

October 4, 2020 — Peter Nichol published new research highlighting how value management can be applied organizational as a strategic capability.

Innovative CIOs are realigning project management offices (PMOs) to drive strategic value maximization.

Traditional project management offices (PMOs) focus on process sets, standard templates, and delivering more projects. This focus is important and necessary. Conventional PMOs are not trained, educated, or equipped to focus on strategic projects diving data science, next-gen technology, and business transformations.

The value management office focuses on the most strategic organizational projects and ensures that value is maximized.

Peter’s research explores how innovative companies are reframing the concept of a PMO. These companies are shifting from a “projects delivered” mindset to a “value delivered” mindset. How do you know if your team is a traditional PMO or a value management office? If you as an executive are hearing about process reengineering efforts, communities of practice, or delivery methodologies you’re in a traditional PMO. The focus and priority of traditional PMOs are not on your most critical strategic initiatives—but it should be.

The paper explains multiple value frameworks to discover, capture, and present value to ensure your strategic initiatives achieve optimal results.

DOWNLOAD THE FULL RESEARCH PAPER – VALUE MANAGEMENT as an ORGANIZATIONAL CAPABILITY

Abstract

Abstract — This paper aims to present a practical approach to institutionalizing business value realization as an organizational capability. Business and technology executives are under continuous pressure to justify investments and validate business outcomes. When executives are asked to explain the value generated by their team, department, or company, only then do they realize they lack the in-house expertise to produce the level of quantifiable outcomes expected. Progressive leaders know that the process of discovering, realizing, and optimizing value is an essential organizational capability. This paper presents a practitioner approach offering multiple value management frameworks to assist executives in making informed decisions when building a value-management office. Traditional project-management offices maintain project-management standards, establish best practices, define common languages, develop a resource-management view, and create and maintain project artifacts and tools. Unfortunately, the heavy burden of these core project-management office activities results in no time for strategic planning. The value-management office oversees the execution of all the company’s strategic programs. The company’s hyper-focus on connecting strategy to execution ensures the value maximization of its strategic initiatives.

What will you learn?

  • Why the most innovative leaders are reassessing their PMO functions? What value frameworks are working for agile companies? Which are the most impactful options to hyper-focus on your most strategic programs to maximize value.

New research on applying Six Sigma to quantify outcomes in portfolio delivery

September 14, 2020 — Peter Nichol published new research highlighting how Six Sigma can be applied to quantify portfolio delivery and execution.

As a leader, you’re either joining a company to lead a new team or working internally to improve your existing team. In both scenarios, results matter.

In the quest for improvements, the fundamentals are often overlooked. Once these fundamental elements are in place the objective to quantify improvements becomes more difficult. We need tools that are fit-for-purpose to make portfolio transformations sticky.

Peter’s research connects the basics of delivery for advanced portfolio management practitioner approaches to quantify organizational transformations. The paper explains how to capture and quantify the organizational value to communicate clearly to executive leadership.

DOWNLOAD THE FULL RESEARCH PAPER – APPLYING SIX SIGMA TO QUANTIFY OUTCOMES IN PORTFOLIO DELIVERY

Abstract

This paper aims to present a practical approach to applying principles of Six Sigma to statistically control portfolio delivery within a program-management office. Portfolio executives are continuously charged with transforming a low-performing team into a high-performance team. Often, the methods of transformation are based on experience, and these imprecise methods frequently produce inconsistent results. The transformation from a low-performing team to a high-performing team can be accelerated by using a quantified methodology to define, measure, analyze, improve, and control desired outcomes. This paper presents a practitioner approach to maximizing the organizational outputs from an agile program-management office by leveraging statistics and mathematical principles to tighten process variance. By applying a fit-for-purpose approach to corporate and operational excellence, portfolio executives can dial in and remediate the root causes of portfolio inefficiencies for maximum agility and value realization.

What will you learn?

  • How to conduct a quick-hit portfolio assessment? The foundational for quantification of results.
  • Explanation and examples of four key tools that can be applied to measure portfolio outcomes.
  • Highlights of 21 measures and brief examples of how to apply them to performance.

Building a world-class data-science team

Data science isn’t about special people in special places. It’s about teams.

We’ve all witnessed the wave of innovations that has washed over business models of late. These innovations didn’t surface as the ideas of individuals. The architecture of businesses, business interactions, data collections, and the use of information is so complex that a single individual in a mid- or large-size company wouldn’t have the knowledge to understand all elements required to make the idea a practical reality.

Also, it’s long been proven that heterogeneity enhances group brainstorming. More diverse groups produce better ideas. This concept is especially important when we’re designing data-science teams.

A part of the whole

You’ve probably been told you need to hire one of two individuals. The first is an astute data developer with a grounded understanding of Python, SQL and data storage, PostgreSQL, Unix and Linux command-line knowledge (mainly to run and schedule cron jobs); Python data libraries (Pandas, Scrapy, Keras, Matplotlib, TensorFlow, Bokeh, Scikit-learn, etc.); Flask, Bottle, and Django to host the analysis of the database as a RESTful API, AWS, or Azure-hosting framework; and, of course, AngularJS for presentation results and DS.js to create data visualizations.

If, for some reason, you botch the hiring of the astute data developer, you only have one other alternative—to hire a data academic. This is a theorist who pontificates about changing the world with data but whose experience rarely ventures outside the educational setting and has few practical applications. The data academic understands core statistics, categorical data analysis, applying statistics with R (multiple linear regressions, qualitative predictors, linear discriminant analysis, resampling methods like k-fold cross-validation, hyperplanes, hierarchical clustering), sequential data models (Markov models, hidden Markov models, linear dynamical systems), Bayesian model averaging, and machine-learning probabilistic theory. You hope some of this learning is connected to causality.

Are these two roles important for a data-science team? Of course. If you, by chance, hire both these roles, do you have a data-science team? No, you do not.

Let’s begin with the origins of data science and, from there, we’ll lead into the critical capabilities required to build a world-class data-science team.

From there to here

The foundation of data science originated with five key areas:

  1. Computer science: the study of computation and information
  2. Data technology: data generated by humans and machines
  3. Visualizations: graphical representation of information and data
  4. Statistics: methodologies to gather, review, analyze, and draw conclusions from data
  5. Mathematics: the science of the logic of shape, quantity, and arrangement

Computer science evolved from Turing machines to cybernetics and information theory by the 1900s. Tree-based methods and graph algorithms surfaced in the 1960s. By the 1970s, computer programming and text or string searches popped up. Data mining, data classification, and similar methods pushed us into the early 2000s.

Data technology began before the 1800s with binary logic and Boolean algebra with punch cards. IBM introduced the first computers in the 1940s as DBMS matured. Removable disks with relational DBMS followed into the 1960s. By the mid-1970s desktops, SQL, and objective-oriented programming was the norm. In early 2001, statistical modeling started to emerge, balancing the stochastic data model by using algorithmic models and treating data mechanisms as unknowns.

Visualizations arose prior to the 1800s with cartography and astronomical mapping of charts. Line and bar charts came out in the 1800s, and statistical graphics were depicted by the mid-1800s. The box plot was created in the 1970s, and word or tag clouds started to form in 1992.

Statistics entered the 1800s with theories of correlation, probability, and Bayes Theorem. In the 1900s the concept of regression, times series and least-squares made the rounds. The 1900s introduced the foundation of modern statistics with the hypothesis and design of experiments. By the mid-1960s, we had Bayesian methods, stochastic methods, and more complex time-series methods such as survival analysis and grouping time-series data. Through the 1980s, more developments occurred in Markov simulation and computational statistics, allowing us to better understand the interface between statistics and computer science. By the late 1990s, decision science, pattern recognition, and machine learning were starting to take shape.

Mathematics entered the 1800s with calculus and logarithms. Next, Newton-Raphson introduced optimization methods. By the 1930s, the military had started to adopt theories for manufacturing and communications. The 1960s were booming with networks, automation, scheduling, and assignment problems, which have only matured in recent years.

Understanding the origins of data science helps demystify it and allows you to develop a concrete capability in your company.

Data-science capabilities

Finding success with data science comes down to four factors: people, data, tools, and security.

The most important elements of your data-science team are the people and the capabilities they enable. Next, to get insights—even with the best people—we ultimately need data and access to data. Usually, data is siloed across teams, departments, and systems, making gaining access difficult. Assuming we have the people and access to the data, next, we need tools. Performing analytics necessitates computational and data-storage resources. Fortunately, today we have many open-source options that are more than adequate. Lastly, data security and privacy protection are crucial as data becomes more centralized. With this convenience comes access—which, in the wrong hands, creates risk.

With this understanding of the origins of data science, it’s fascinating to see the mix of conventional capabilities aligned with the less traditional data-science skills that are required for success. Let’s cover examples of data-science capabilities and complementary data-science team skills that are found within world-class data-science teams.

Data-science capabilities

Data-science team skills

  • Stakeholder management: business-relationships management, project management
  • Storytelling ability: executive presence, presentation skills
  • Business communications: clear and timely communication, governance
  • Consulting: need analysis, solutions aligned to goals
  • Problem-solving: lean six-sigma, agile
  • Topical analytics techniques: statistics, root-cause analysis, statistical-process control, value-stream mapping, flows
  • Domain expertise: knowledge of the data, who’s using it and for what purpose
  • Business analysis: experience evaluating and modeling business cases

The ultimate success of a data-science team depends on how well expectations are managed. When expectations are met, the data-science team will be viewed as impactful. Inversely, a weak perception of delivery is a significant reason why data-science teams eventually get disbanded—they focus on what’s cool, not what’s most impactful for the business.

The hidden art of storytelling

It’s idealistic to believe data-science teams can find value in data from day 1, but, eventually, they’ll connect data to new insights. However, often that data is layered across hundreds or thousands of sources, and the team might be months or years away from collecting it all. Most data-science teams begin with a simple set of questions. These questions are challenging but tangible to answer. This approach also limits the data set required to be integrated into an initial proof-of-concept. Sample questions might include some of the following:

  • Which applications in our portfolio have the most significant security risk?
  • Why is the Durham, NC location the most profitable?
  • What type of patient visit will be the costliest next quarter?
  • Is antibody A or antibody B more likely to achieve FDA approval?
  • Which drone should we bring in first for preventive maintenance?

Building a world-class data-science capability isn’t about individuals; it’s about assembling your team. It’s crucial to ensure that essential data-science capabilities and data-science skills are part of your team design. To tap into the power of data science, we require teams to not only extract insights from data but also tell a compelling story. Quite often, we’re left with a lot of data, confusing insights, and no story. Make sure that the team you build can tell a story.

BRM Institute Recognizes Peter Nichol as a 2020 Top Business Relationship Manager (BRM)

Peter B. Nichol was recognized as a 2020 Top BRM on February 9, 2020 by the BRM Institute.

Each year’s Top BRM list is revealed during #BRMWeek in February. BRM Institute’s global BRM community recognizes the top BRMs that have achieved success through their BRM efforts, strengthened the global BRM community and BRM discipline, enriched lives through excellence in BRM within their organizations, and contributed to the community on a local, national, and global level.

The BRM Institute’s global BRM community recognizes the top BRMs that have achieved success through their BRM efforts, strengthened the global BRM community and BRM discipline, enriched lives through excellence in BRM within their organizations, and/or contributed to the community on a local, national, and global level.

The 2020 Top BRM awards were evaluated based on the following major criteria:

Overall Impact:

  • Explain the impact the BRM has contributed to others around the globe!
  • Share the outstanding accomplishments the BRM has delivered
  • Highlight the amazing organizational accomplishments the BRM has delivered including notable contributions, improvements, discoveries, how they have demonstrated the BRM Code of Conduct, etc.

Leading with Purpose:

  • Bring more personal purpose in the workplace.
  • Identify the convergence of the personal and organization purpose to lead towards happier individuals, stronger relationships and durable communities.
  • Demonstrate how the BRM satisfied their personal or organizational purpose through their work.

Delivering Value:

  • Articulate the value delivered by the BRM engagement
  • Quantify the value realized through BRM organizational empowerment
  • Explain the impact of the value delivered through the BRM’s efforts.

Peter Nichol is a highly respected BRM as the Director, Research and Development, IT Portfolio Management, at Regeneron Pharmaceuticals, a biotechnology company with a market capitalization of $40 billion.

As a BRMP®, CBRM®, MBRM®, and organizational change leader, he has fully embraced the BRM Institute’s approach for business partner value realization.

Peter led a departmental wide initiative to enable the Resource and Demand Management capability to align demand and capacity. Previously, the Research and Development IT department of 404 employees and contractors had no way to match incoming work from business partners (demand) to the provider’s ability to deliver (capacity). This inability to anticipate, predict, and model demand created enormous resourcing and staffing problems throughout the year as we enabled science to medicine across twenty-three scientific areas.

With strong support from a visionary executive team, Peter partnered and educated six departmental BRMs, twenty-nine program and project managers, and eight functional managers who together enabled the science of getting critical medicine to patients.

___

“Your efforts are solidifying the success and adding to the credibility of BRM as a discipline on a global scale. Being recognized as a top BRM recognizes your impact on people, BRMs, and organizations everywhere. Thank you for making a difference!” — Aaron Barnes, CEO BRM Institute

___

“As a Master BRM, I take enormous pride in shaping the future of the BRM discipline. It’s a rare experience when you’re arm-in-arm with fellow BRM practitioners collaborating to design the foundation of relationship management to be used for generations to come. I would be remiss if I didn’t send out a hearty thank you to my amazing team of leaders that day-over-day are enabling science through technology. Success is always shared. My organization’s executive leadership team and fellow BRMs have always been supportive and open to ideas that position us better for tomorrow but require change today. It’s amazing being part of #OneTeam. A big shout out to all the global leaders who have contributed to advancing the BRM profession this year! I look forward to seeing you all at the BRMConnect in Boston, Massachusetts and the BRMConnect in Amsterdam, Netherlands in 2020!” — Peter B. Nichol

References

BRM Institute. (2020). 2020 Top BRMs. https://brm.institute/2020-top-brms/

Measuring the value of data quality

Data intelligence, integrity, and integration are aspects of every CIO strategy in some fashion. It’s not possible to achieve data insights without first cleaning up your bad data. To do that, you must define the value of that data.

The effect that bad data has on our organization is personal. It’s time we put a value on data and, more specifically, a value on bad data. Let’s fix this together.

Wikibon, a community of open-source, advisory-sharing practitioners, estimates that the worldwide data market will grow at an amazing 11.4% CAGR between 2020 and 2027, reaching $103 billion by 2027. That’s the upside. The downside is that IBM estimates that bad data currently costs the US $3 trillion.

One trillion, three trillion, 10 trillion—that sounds like a lot. But do you understand the impact of $3 trillion? Likely no. However, what you do understand is the personal impact that bad data issues have on your family. Yes, your family.

It’s 7 pm, and you’d told yourself you’d wrap up early for a change, but then that email comes in. It’s from your business partner, who just got around to opening the executive status report or those financials she finally had time to read through. Either way, a quick look at the data, and you already know it’s bad and can’t be real. So much for getting out early.

The real cost of bad data isn’t the cost to reacquire a data set that’s been canned or the cost to implement a new system to replace the old. Every executive knows the real cost is personal: time away from friends and family. This is true both for that person, who’s buffering a wave of irate executives, and for their team, which is triaging the root cause and adding weight to an already lopsided work-life balance.

The business impact of poor data

On the topic of data, the first thing that comes to mind is the dimensions of data. These are concepts that include format, authority, credibility, relevance, readability, usefulness, reputation, involvement, accessibility, completeness, consistency, timeliness, uniqueness, accuracy, validity, lineage, currency, precision, accessibility, or representation. These are valid and, at times, very important. They are not, however, the most important consideration. We need to frame the problem of bad data from the lens of our business partners.

Data should be listed as an asset or liability on the company balance sheet. We don’t consider it a hard asset, and we don’t depreciate purchased data. Until that happens, we need to look at the utility of the data and the value we expect to derive from it.

Let’s start by looking at the categories where poor data would inhibit business success. By viewing data errors within a classifying scheme, we begin to quantify the value of bad data. A simple approach is to use four general categories:

  1. Financial
  2. Confidence
  3. Productivity
  4. Risk

Financial primarily focuses on missed opportunities. This could be in the form of decreased revenues, reductions in cash flow, penalties, fines, or increased operating costs.

Confidence and satisfaction have an internal and external impact. Internal impacts on confidence would include employee engagement, decreased organizational trust, low forecasting confidence, or inconsistent operational or management reporting. External confidence addresses how customers or suppliers feel about delays or impacts to their business. Ultimately, bad data leads to bad or incorrect decisions.

Productivity impacts throughput. Therefore, increased cycle time causes drops in product quality, limited throughput, or increased workloads. These are all aspects that involve outcomes.

Risk and compliance center on leakage. This leakage could be value leakage, investment leakage, competitive leakage, or, more seriously, data that directly affects a regulation, resulting in downstream financial and business impacts.

This framework helps to delineate between big-data issues that affect either “running the business” or “improving the business” and those issues that are inconvenient but largely insignificant.

Categorizing the impact

Before we tackle the business expectations for bad data and how poor data impacts our business, we need to create some subcategories to continue our assessment.

Using the four core categories above, let’s deconstruct these into more specific areas of focus:

Financial

  • Direct operating expense
  • Resource overhead
  • Fees
  • Revenue
  • Systems of production
  • Delivery and transport
  • Supplier services
  • Cost of goods sold
  • Demand management
  • Depreciation
  • Leakage
  • Capitalization

Confidence

  • Forecasting
  • Reporting
  • Decision-making
  • Satisfaction (internal and external)
  • Customer interaction
  • Supplier or collaborator trust

Productivity

  • Workloads
  • Throughput
  • Output quality
  • Staffing optimization
  • Asset optimization
  • Service-level alignment
  • Efficiency
  • Defects
  • Downstream

Risk

  • Financial
  • Legal
  • Market
  • Systems
  • Operational downtime
  • Reputation loss
  • Testing
  • Vulnerability remediation
  • Regulatory, compliance and audit
  • Security

Direct operating expenses involve direct labor and materials. Resource overhead accounts for additional staff to run the business such as recruiting or training personnel. Fees are penalties for mergers and acquisitions or other service charges. Revenue pulls in any missed customer opportunities like customer retention. The cost of goods sold is the standard product design and cost of inventory, etc. Depreciation covers inventory markdowns or decreases in property value. Leakage is mainly financial and involves fraud and collections; however, this can be extended to include value leakage such as the impact of not gaining the efficiency from a system that targets saving 250 team members 20% of their day through automation. Capitalization quantifies the value of equity.

Forecasting is the impact of bad decisions based on financial data such as actual vs. budget or the resource management cost from not proactively staffing. Decision-making is usually event-driven and links the time to make a decision and the quality of the decision. Satisfaction is largely the internal service relationship between providers and consumers; e.g., shadow IT, outsourcing, etc. Supplier or collaborator trust measure the optimized procurement process and the vendor confidence in the provider.

Workloads target an increase in work over a baseline. Throughput measures the volume of outputs, typically in cycles; for example, the time required to analyze a process or time taken to prepare data for input into a process. Output quality is about trust in published information; for example, is the data in the status report trusted by the leaders that receive it, or is that report dismissed due to mistrust? Efficiency looks at avoiding waste in people, processes, or technology. Defects highlights imperfections from the norm; this could be a process, system, or product. Downstream evaluates the next process in the chain and the delays experienced because of upstream data-quality issues.

Financial risk targets the bottom line in either hard or soft losses. Legal risk removes protections or increases exposure. The market could involve competitiveness or the loss of customer goodwill. System risk covers delays in deployments. Testing risk is the loss of functionality due to non-working components being released into production or released with defects. Regulatory and compliance risks often deal with reporting or, more importantly, the data and data quality that’s being officially reported. Security risk—a growing concern—addresses data that impacts internal customers (employees) or external customers (suppliers).

These aspects all align poor data quality with negative impacts on the business. Typically, organizations will log quality issues in a tracking system. This is useful to help quantify the impact or play back the value of the data management organization or the office of the chief data officer.

Ask questions

What makes poor-quality data a critical business problem? If the problems are just inconvenient, action doesn’t need to be  taken. Use these questions to elicit ideas to quantify the impact of bad data in your organization:

Financial

  • Which transformation efforts are on hold; e.g., data lake, analytics?
  • How much time is consumed by cleaning portfolio or financial data?
  • What decisions aren’t being made due to lack of vendor management benchmarking?

Confidence

  • How efficient is the vendor-onboarding cycle?
  • Is there organizational confidence in company-specific data; e.g., patients, genes, products, etc.
  • How easily can suppliers’ or collaborators’ data be validated?

Productivity

  • Where can RPA be used to streamline data processing?
  • How many processes are repetitive and require minimal human intervention?
  • How do we remove waste from our process? What was good yesterday might not be good today.

Risk

  • Do events have lineage? If not, what’s the cost to compile that view on request per event?
  • What’s the cost of a single regulatory misfiling or violation?
  • How can an incorrect risk assessment put the company at risk?

Limitless categories and subcategories can quickly become all-encompassing. Start simple. Use the idea of an N-of-1 or one event. We might be talking about the history of a cell line in biotechnology or the history of changes to portfolio financials. For example, instead of estimating the cost of researching data lineage generally, consider one specific example. Collapse the steps into five core phases and put effort into each. Then identify the people involved at each step. From there, add in a blended rate and multiply the effort of each step by the blended rate. The product is the net cost for an N-of-1 event—the cost of a single event. This methodology is very powerful when addressing bad-data problems.

The flip side

We understand the risk of poor data quality. It’s important to quantify the organizational financial impact of not correcting bad data. Putting in place standard data definitions or organizational data dictionaries can build consistent terminology. Implementing guidance on how data should be reported can help explain permitted values and data that’s inaccurate.

Similar to any other discipline, this process requires training and education. Take time to invest in your department and organization. Your team will thank you for it.

Quantifying the value of your enterprise data science initiative

Unlock the competitive potential for your data. Build a business plan for managing your data-as-as-asset.  Put a value on your data.

Have you built the business case for data-as-an-asset in your organization? Most leaders still have this on their to-do list. It’s not one of those activities you can just crank out in an early-morning brainstorming session on Monday before your leadership team arrives.

Building the business case for data requires self-reflection and collaboration with fellow leaders that have already been there and done that—leaders that have found a way to articulate the value of data to their board. It requires communicating a simple, clear, and rational approach.

Data creates value for your company—or, at least, it should establish a foundation to create and capture value. By starting with the end in mind, you’ll have a better framework for communicating and sharing aspects of your data.

To properly analyze the value of enterprise data, information, knowledge, and wisdom, we need to build the business case for data. This business case has three dimensions:

  1. Cost of data
  2. Value of data
  3. Risk of data

Cost of data

Data is an asset, and it has a cost. Your house, car, boat, and your bigger dreamboat all are assets. To better understand data science and organization-data enablement, we need to reframe how we envision data and how we relate to it.

Inside corporate America, we’ve all heard the phrase, “Spend company money like it’s your own.” The less common parallel of that is, “Maintain company data assets like they’re your own.” As we develop the business case for data science, we quickly hit upon what I call the foundation case for data, i.e., the cost of data. Five principles make up the total cost of data:

  1. Cost to acquire
  2. Cost to use and leverage
  3. Cost to replace
  4. Cost to maintain
  5. Cost of decisions

Similar to your new boat, data-as-an-asset isn’t cheap to acquire. With data, you generally have four acquisition options. First is collecting new data, and often this is the costliest. Second is converting or transforming legacy data. This isn’t a speedy process, but, if done correctly, it can yield useful results. Third is sharing or exchanging data. The sharing of data doesn’t only have to be with new collaborators or business partners. This very well could be accomplished by breaking down internal silos and opening up data sets to new internal partners. Purchasing data is the fourth option. If the desired data set is available, this can be the most economical option in many situations.

Before data is useful, it often needs to be manipulated. Data transformation helps to covert the data from one format to another. Typically, this other format is more useful for enterprise consumption. Reconfiguring the data to account for processes is important, as these workflows can transform or manipulate the data in ways that render the data more valuable or useful. Quality control, validation, and the management of data can make the data more extensible across the enterprise and further aid in decision making.

Often, critical data isn’t replaceable. However, some data the enterprise has acquired can be refreshed from the source. Data can be refreshed by reloading the data or purchasing an updated data set. Loss of data rights (patients revoke consent, for example), corruption of media (supplier impact), or data destruction (flood or another natural event) may serve as the driver to explore the cost of data replacement.

A boat needs its propeller and skeg checked for damage, grease points need to be lubricated, bolts must be retorqued, and the water-pump impeller replaced. Likewise, your data needs routine maintenance. The cost of data can include loading, storing, protecting, formatting, indexing, refreshing, and supporting the data over time. To retain the value of your data asset, it needs maintenance to prevent deterioration.

There’s also an impact or cost associated with decisions based on your enterprise data. What’s the cost to a hospital provider of using the wrong blood type for transfusion during a surgery? What’s the cost of exploring the wrong molecule for a biotechnology company? What’s the cost if a patient’s claim should have been approved, but it was denied by the health-insurance payer? Begin to quantify the types of decisions that are being made with data within your organization and the downstream cost of bad decisions.

Quantifying the cost to acquire, use and leverage, replace, maintain, and make decisions based on data establishes our foundational business case.

Value of data

We can place a value on your boat, and we can also put a value on your data. Establishing a value for your enterprise data, of course, is more complex. However, the same four principles apply:

  1. Time value
  2. Performance value
  3. Integration value
  4. Decision value

Data is most valuable when it’s created, after which it decreases in value over time. Unlike your boat that gradually depreciates over time, the value of data can reach a cliff. Let’s consider the value of the piece of data of knowing the winner of a soccer match. Hours before the match, the value of that data is huge. Yet, one second after the game is over, the value of that piece of data drops to zero.

People drive productivity. A primary business case for data is improved productivity—which means, essentially, making existing processes more efficient through the optimization of those processes using data. It could be as simple as saving people time in the day, or it could be as complex as shifting from low-quality, low-value work to high-quality and high-value work. One example is a biotechnology company saving scientists time. By freeing up scientists from doing mundane data entry, they have more opportunities to perform additional experiments. Without the added benefit of the data-entry time savings, the scientists would need to delay performing additional experiments.

By integrating our data, we can improve data relevance and applicability. Data integration allows us to pull from all relevant data sources and, from there, we can see overall trends among them. Helpdesk data can be rolled up from sites to regions to show spreading regional-usage patterns. A decrease in usage by the field staff of critical business systems can be identified and cross segmented by years on the job to better understand if more experienced field staff have greater internal product adoption and usage. Also, data lineage is enhanced by combining data. Data lineage is the lifecycle that includes the data’s origins and where it moves over time.

The greatest value of data is its ability to be used to make insightful decisions. Everyone today wants to foster a data-driven culture, make data-driven decisions, and be perceived as a data-driven company. What’s your data worth?

Tableau, a data-visualization software company, was purchased by Salesforce for $15 billion in August 2019. CANVAS Technology, a robotics company focused on autonomous delivery of goods through AI, was purchased by Amazon for an undisclosed amount in April 2019. Data Artisans, a large-scale streaming company, was purchased by Alibaba for $103 million in January 2019. Data included in these acquisitions might not have been as significant as that obtained when Microsoft acquired Linked In for $26.2 billion in 2016, but it’s highly relevant. Each of these companies builds core technologies that have value. Don’t kid yourself. Their data was a big part of that decision to acquire and contributed to the value of the company.

Risk of data

If companies are valued on their information portfolios, what’s the financial impact if your enterprise loses its data? It’s almost incalculable.

Assessing the risk isn’t as straightforward as determining the cost of data or extrapolating on its value. Data risk is more nebulous. Generally, there are three principles for assessing risk:

  1. Improper-use risk
  2. Regulatory risk
  3. Bad-decision risk

Data misuse is the inappropriate use of data. GDPR does a good job of using corporate controls to protect personal data. GDPR has identified four concepts of improper use of data: the intrusion of solitude, public disclosure of private facts, false light, and appropriation. These categories extend the modern tort law concept of invasion of privacy. This invasion could be personal, public, or corporate.

Regulatory risk varies by industry. Pharmaceutical companies that produce drugs must comply with Title 21 for reporting adverse events. Banks issuing loans are governed by the Housing and Economic Recovery Act of 2008. Utilities are regulated by the Public Utility Regulatory Policies Act of 1978. Each industry can assess risk based on non-compliance, fines, and brand harm.

A local imaging center mixes up a series of MRI scans for two patients. Patient A now has a clean bill of health. Patient B has a metastatic cervical tumor. What’s the improper-use risk of mixing up the patients’ CDs? We have the cancer mix-up case on one end of the spectrum and not approving a corporate expense report because of a receipt mix-up on the other. Regardless, making bad decisions with data is possible, and we need controls to limit these occurrences.

A good case for data management

Your Uber didn’t show up. You need a plan b. While waiting for the bus, you sit down on a bus-stop bench and find a lotto ticket with a Post-It note attached that says, “I’m a winner!” Are you rich? Maybe, but likely you’ll be at work tomorrow just like you were last week. How much value did you apply to that ticket? How valuable is that piece of data? Probably not too valuable.

The data in your enterprise is an asset. Manage it as an asset. Soon your organization will be making data-driven decisions, improving operating efficiencies, and expanding economic impact. This might even lead to you getting that dream boat.

Applying cognitive science to champion data science adoption

Business relationship managers today have new techniques to make data science stickier. Mix it up for greater data-enablement adoption.

The organization knows that data is the future. Data is required for making the best decisions. Data-driven organizations are more profitable. As a result, they can give back more socially by leveraging data to develop better insights. Then why is it that in our last meeting, data wasn’t used to make decisions? Because change is tough.

Great CIOs serve as evangelists for technology and innovation by identifying new, untapped opportunities to enable business objectives and leapfrog the competition.  We can’t, however, do that alone.

The role of the business relationship manager (BRM) has exploded over the last twenty years. The BRM has always been critical for successful convergence between IT visionaries and business partners, but it was only recently, in 2013, when Aaron Barnes and Vaughan Merlyn started the Business Relationship Institute (BRM Institute), that the concept of the BRM as a champion of our business partners started to take off.

BRMs are positioned to be the champion for data science and enablement initiatives as well. We, as CIOs, need to empower BRMS, and we also need them to think differently.

Tilting the classic lens for change

What if we’re leading change all wrong? The book “Make it Stick: The Science of Successful Learning,” by Peter C. Brown, Henry L. Roediger III and Mark A. McDaniel highlights stories and techniques based on a decade of collaboration among eleven cognitive psychologists. The authors claim that we’re doing it all wrong. For example, we attempt to solve the problem before learning the techniques to do so successfully. Using the right techniques is one of the concepts that the authors suggest makes learning stickier.

Rolling out data-management initiatives is complex and usually involves a cross-functional maze of communications, processes, technologies, and players. Our usual approach is to push information onto our business partners. Why? Well, of course, we know best. What if we changed that approach? This would be uncomfortable, but we are talking about getting other people to change, so maybe we should start with ourselves.

Business relationship managers stimulate, surface, and shape demand. They’re evangelists for IT and building organizational convergence to deliver greater value. There’s one primary method to accomplish this: collaboration.

The BRM should start with a series of data workshops with specific data-management problems to solve. Frame the data-management problems for the leadership teams into four categories:

  1. Data requirements
  2. Data-use cases
  3. Data modeling
  4. Data implementation

These categories will offer a good bench from which to develop questions that business partners can validate from a scientific perspective. They’re building knowledge so they can ideate around existing problems to discover new opportunities.

Interleaving concepts to create texture and knowledge depth

The BRM is tasked with increasing awareness of data-management practices such as acquisition, cleansing, and modeling or with data principles like data independence, integrity, and consistency. In either case, the information is often presented in chunks or concepts that build. As it turns out, this isn’t a great way to communicate a new concept.

Interleaving is a learning concept that describes the process of students mixing, or interleaving, multiple topics while they study to improve their learning. However, blocked practice is what’s classically taught—study one concept, master that, and then—and only then—move on to the next. It’s been proven that learning retention using the interleaving method lasts months, not days. Studying related skills in parallel improves retention.

The classical building approach is AAABBBCCC. First, we teach about AAA. Second, we teach about BBB. Third we teach about CCC. The problem is that, by the time we get to BBB, the concept is so boring we’ve already lost people. Interestingly enough, it’s not that the data-management concepts are too complex but rather the opposite—they’re straightforward and make sense.

Interleaving involves using the ABCABCABC approach. First, we cover each of the three ABC concepts. Second, we cover the ABC concepts again using different examples. Third, we cover the same concepts again, only this time use other data and examples.

Applying this methodology to data science , the BRM exposes business partners to multiple versions of a problem, which changes the problem and complexity. Wait, wouldn’t that confuse folks? Yes, you’d think it would. However, as it turns out, we’re holding their interest for longer and, as a result, they internalize the concepts better. We’re no longer pushing concepts. Our business partners are pulling them from us.

Fluency isn’t the same as understanding

Speaking of data science, transformational change isn’t the same as executing on it. Be mindful of those players in your organization that have a lot to say about data science. They might be fluent in the language of data, yet, somehow, they still don’t get it. They have no history of executing and delivering data initiatives.

To be creative, we need a better understanding of the problem space in which we’re trying to find a solution. Being creative and being knowledgeable are both necessary. It’s difficult to be creative and present solutions to problems without the knowledge or a foundational understanding of the concepts.

Lean on the business relationship managers within your organization to champion change. Challenge them to teach the concepts of data science differently. By shifting from pushing information onto your business partners to having information pulled, you’ll change the conversation from, “Here’s some data you’ll find useful,” to “Where can I learn more about this data concept?”

CIOs are the evangelists for innovation. BRMs are the champions of change. To make your data science initiative sticky, you need both roles to think differently to enable continuous value delivery. How about starting from the concept that learning about data science can be fun? It’s not as crazy as it sounds. All you need is a little creativity.

Design success into the office of the CDO

Every obstacle, hurdle and misstep raises awareness and decreases the likelihood of a recurring event. Use experience and wisdom to avoid the mistakes of others and find success when designing and implementing an office of the CDO.

Your office of the CDO needs a vision. Success won’t sprout from a rock. Leveraging data as a strategic asset can’t be done without defining a strategic approach to that data. Building a data strategy doesn’t prevent you from being agile in your approach. At the beginning, your vision of the organization’s data strategy might be fuzzy. That’s okay.

As you develop your data vision, provide guidance on how to unify business and IT perspectives, and promote value metrics from a data-driven culture, remember that change is welcome. If you’re headed down a path and not garnering the necessary organizational buy-in, change your course. There’s a better path forward. You just need to discover it—without going through the difficult journey that Milton Hershey did.

4 failures to success

The sweetest city in the world is Hershey, Pennsylvania. However, it didn’t start out sweet for Milton Hershey, an American chocolatier, businessman, and philanthropist. Milton didn’t see much use for school and only had a fourth-grade education. At the age of 14, he started an apprenticeship for a local printer. That was short-lived, and he was later fired for dropping his straw hat into a machine. He quickly was paired in another apprenticeship with a confectioner named Joseph Royer, based in Lancaster, Pennsylvania.

During his four years with Royer, Hershey learned everything he could about the candy business. Then, at the age of nineteen, he moved to Philadelphia to start his confectionery company. Unfortunately, he couldn’t make it in Philadelphia because of heavy supplier debts, so he moved to Denver, New York, Chicago, and eventually New Orleans, but never made a success of his business in any of them.

Throughout his journey from location to location and with each failure, Hershey was learning. He discovered that fresh milk is vital to good candy. In 1886, at the age of twenty-nine, he was penniless. Eight years later, he sold Lancaster Caramel Company for $1 million and turned to chocolate, where he soon founded the Hershey Chocolate Company in what would become Hershey, Pennsylvania.

Hershey became a successful businessman. When he died, he signed over all his company shares of the Hershey Chocolate Company, via a trust, to the Hershey Industrial School, which was an orphanage he founded. The shares were valued at $60 million. For reference, that same year, Coca-Cola sold for $25 million.

It’s fascinating what an individual can do with a vision and passion. Milton was famous for saying, “The caramel business is a fad.” At the time, he sold Lancaster Caramel Company, profits were at all-time highs. Yet, he sold his business and went into chocolate. Understand what makes your business successful.

4 pillars of success

To discover your company’s data-strategy vision, build around the four core aspects of the office of the CDO. These require careful set-up to enable your successful data office initiative:

  1. Governance
  2. Standards
  3. Architecture
  4. Talent and culture

Governance defines the process, establishes forums, and promotes strategic communication. Architecture creates the guardrails for reference architecture, identifies common lexicons, and develops an edge for the data platform. Standards elaborates operating procedures, specifies technical standards, and identifies design precepts. Talent and culture span education and training, skills, roles, and responsibilities of agile teams—which are the human aspect of change.

These four areas are the pillars of a successful office of the CDO.

Adjusting how we look at opportunities—including data—doesn’t happen overnight. We’re transforming an organization. To do this, we must lean on these four core pillars of a successful office of the CDO, which we’ll now discuss in detail.

Data governance: discovering better insights

Governance offers accountability for data, business agility, better compliance, IT agility, and stronger insights. Setting up seamless data governance facilitates stakeholder interactions and makes the decision-making process easy. Without this framework, decisions will spin, and participants will become frustrated that engagement isn’t consistent or uniform across business areas or divisions.

There are six areas of interest when we’re talking about data governance and the role of the office of the CDO:

  1. Enterprise data governance
  2. Data-quality management
  3. Master-data management
  4. Metadata management
  5. Data-protection management
  6. Data strategy and diagnostics

Enterprise data governance is the management of business data. This includes the overall management of the availability, usability, integrity, and security of enterprise data. Data quality management is a set of practices that aims at generating and maintaining high-quality data used for decision-making. The quality measures ensure that, throughout its lifecycle—from acquisition through distribution—the data is fit for use. Master-data management is a method for an enterprise to link critical data to a common point of reference. This discipline converges IT and business partners to ensure the uniformity, accuracy, stewardship, semantic consistency, and accountability of master-data assets. Metadata management is the administration of data that describes other data. Metadata involves any information that can be integrated, accessed, shared, linked, analyzed, and maintained. This organizational agreement describes enterprise information assets. Data-protection management enables data security across the enterprise—including automation, orchestration, and document management—to control the many data-protection activities required to run an enterprise. Data strategy and diagnostics is a guide for optimizing data, removing redundant data, and simplifying the lifecycle management of data.

There are many more areas we could add to this mix including data modeling and design, data integration and interoperability, documents and control, data storage and operations… the list goes on. However, we want to start with the basics.

Together, these six elements build the foundation of a robust, data-governance program.

Data standards: sharing data we trust

Data standards communicates enterprise data-sharing frameworks. The objective is to improve trust. We can measure the trust of the data by measuring the credibility, reliability, intimacy, and self-orientation of data.

Addressing trust gets us to the primary value of establishing a standard for data sharing. Who should participate? How transparent is the data? Who can share data? How do we remove misaligned interests?

Producers and consumers of enterprise data must meet baseline standards. Additionally, trust frameworks must be tailored to producer and consumer needs. Combined, this approach raises the level of trust and decreases the risk associated with data production and consumption.

Standards for data sharing describes the ways data can be shared. It also highlights how data can be restricted or access to the data can be increased by removing restrictions in the following sequence:

  1. No awareness of the data set
  2. Awareness of the data set
  3. Awareness of data scope and data dictionary
  4. Query highly aggregated, obfuscated, or perturbed data
  5. Query lightly aggregated, obfuscated, or perturbed data
  6. Access aggregated, obfuscated, or perturbed data
  7. Access to data
  8. Ability to share data

No awareness of the data set means the existence of the data set isn’t known. Awareness of the data set makes knowledge of the data set known. Awareness of data scope and data dictionary increases knowledge to include the scope and parameters of a data set—for example, knowledge or access to the data dictionary. Query highly aggregated, obfuscated, or perturbed data enables queries on data sets, but these are highly restricted—in this case, access to a division’s or a department’s data might be removed. Query lightly aggregated, obfuscated, or perturbed data slightly widens the endpoint of data access. For example, in this case, previous access may have been to a state-based population, and this opens access to multi-state searching. Access aggregated, obfuscated, or perturbed data enables the ability to run defined logical operations and pull de-identified data. This level of access provides access to an aggregated data set; however, access to the raw data set isn’t permitted. Access to data unlocks the technical restrictions of operations that may be performed with the data, although specific access rights are usually restricted to certain individuals. The ability to share data may allow sharing to one consumer but may restrict that consumer from sharing the data with another consumer.

Data trust is validated by the enterprise standards in place to share data.

Data architecture: integrating data investments with business strategy

Data architecture guides the integration of enterprise data assets. Architecture is focused on the abstraction of the system, not the system itself. As complexity within your enterprise increases, the need and value provided by data architecture become ever more valuable.

Data architecture has a broad reach and includes many components to build a single version of the corporate “truth.” These ten pieces of enterprise data architecture will ensure the following enterprise data assets are integrated:

  1. Business entities
  2. Business relationships
  3. Data attributes
  4. Business definitions
  5. Taxonomies
  6. Conceptual and logical views
  7. Business glossary
  8. Entity lifecycle and states
  9. Reference-data values
  10. Data-quality rules

Business entities refer to the various components of the business. Business relationships identify how those entities interact and ultimately share data. Data attributes tag and identify data elements using known classifications. Business definitions clarify the intent behind the data sets. Taxonomies establish schemes and classification system for groups or attributes with similarities. Conceptual and logical views identify a high-level relationship between entities and how the data is physically represented in the database. The business glossary defines terms across domains and serves as the authoritative source for the data dictionary. The entity lifecycle and states specify where the entity is in its lifecycle from acquisition to destruction. Reference-data values are standards that can be used by other data fields. Data-quality rules are the requirements that the business sets on its data.

It’s also useful to conduct an information value-chain analysis. This process identifies matrix relationships among data, processes, organizations, roles, locations, objectives, applications, projects, and data platforms.

Talent and culture: moving people not data

When we think of data, we often think of the bits and bytes. We should be thinking about people. Talent and culture are the most difficult aspect to get right when developing a successful office of the CDO.

There are seven main areas in driving culture and getting the right folks in the right roles:

  1. Talent acquisition
  2. Performance management
  3. Competency management
  4. Learning and development
  5. Leadership development
  6. Career management
  7. Succession management

Talent acquisition is the process of finding and onboarding skilled data talent. Performance management ensures that individual activities and outputs meet organizational goals. Competency management is the process of developing the skill sets of individuals. Learning and development attempt to enhance individual performance by tuning and honing skills and knowledge. Leadership development helps to expand an individual’s capability to grow within the organization. Career management is the deliberate planning and coaching of an individual’s activities, engagements, and jobs over a lifetime. Succession management is the systematic process of developing, identifying, and grooming high-potential individuals for more aspirational roles.

By developing a talent management plan, we improve our odds of attracting, developing, motivating, and retaining high-performing employees.

Designing a world-class office of the CDO

Designing a world-class office of the CDO begins with a vision and a data strategy. We establish a strong organizational base for scalability and growth by using the pillars of governance, standards, architecture, and talent and culture.

Take time to understand the intrinsic value of data. Is your data accurate and complete? Determine the cost value of data. If you lost your data, what would it cost to replace it fully? Evaluate the business value of the data. How fit-for-purpose is this data to make data-driven decisions? Measure the performance value of your data. What parts of the data fuel key business drivers?

Designing the office of the CDO is an exciting process. Let’s hope your venture into the data business is less bumpy than Milton Hershey’s entrance into the chocolate business.

Assembling the right resources for the office of the chief data officer

Creating an office of the chief data officer is the first step in developing a data-driven culture and maximum business value.

We’ve come a long way from the first website, which was published on August 6, 1991. The Internet has over 1.94 billion websites. Over seven billion search queries a day are conducted worldwide, and over 15% of those are entered into a search box for the first time. Data is transforming how we do business and, more importantly, how we make business decisions. However, 51.8% of the traffic is solely from machine bots; the remaining 48.2% is from human traffic.

From this ongoing surge of data has emerged the chief data officer role—and, more recently, to support that role, the office of the chief data officer.

Establishing the right structure can have a positive impact on organizational transformations to drive a data-driven culture. Let’s address four questions that clarify the value of the office of the CDO:

  • What’s its purpose?
  • What are the primary office functions?
  • What resources and skills are required?
  • What are the major duties of the office?

The purpose

The CDO is an executive responsible for enabling and championing value creation for the organization through the use of data assets internally and externally. This includes governance, planning, definition, capture, usage of, and access to data and information. Generally, the CDO has accountability in three areas: data management, analytics and technology.

  • Data management captures the care protection and governance of data from establishing a strategy for designing the implementation policies for governance.
  • Analytics includes any capabilities required to analyze data to transform it into useful insights.
  • Technology covers the data architecture, infrastructure, and services for the ingestion, movement, monitoring, and storage of data.

The CDO is accountable for capturing high-quality and timely data and leveraging data assets to stakeholders. To fulfill this mission, we need to understand the purpose of this role. The role of the office of the CDO is simple: create value from data. To frame the context of the role, we’ll dive into its functions.

The functions

There are 100 ways to build a good data office, but there are only a handful of ways to build a great team. The office of the CDO needs to envision, prototype, evangelize, implement, and support existing and new data platforms. There are two broad paths that organizations can take here.

The first path is to have the office of the CDO run IT data operations. This means the CDO assumes responsibility for all database administrators and any resources that support the creation or maintenance of data assets. This could be in the form of custom systems, SaaS solutions or off-the-shelf solutions. The benefit of this approach is that the data increases in value while redundancy and cost decrease. The flip side is that day-to-day operational activities limit the focus to approaches geared to developing strategic data assets.
 
The second path is to have the office of the CDO run the IT “asset” operation. Here, we’re specifically talking about managing existing data assets and leveraging new ones. The benefit of this approach is it facilitates greater collaboration and the ability to share data assets. The disadvantage is the lack of raw-data ownership, budget limitations, and the requirement of additional, cross-functional buy-in before significant transformation can occur. Sometimes this buy-in doesn’t occur, which stifles progressive ideas that push the boundary of normal.

The resources

The resource makeup of the office of the CDO varies greatly based on employees and annual revenue so that this approach can take a number of forms. However, some common themes are observed. The variability is that one company might need one of a particular resource and another might need 100. Use your judgment to scale the primary functions based on your business demand.

Next, we’ll cover the following primary roles and the skills required:

  • Chief data officer
  • Data scientist
  • Data modeler
  • Data architect
  • Data analyst
  • Front-end designer/developer
  • Database administrator
  • Portfolio manager
  • Project manager
  • Business relationship manager

Chief data officers provide leadership on maximizing the value of data assets enterprise-wide. This role is responsible for leading the transformational change to position the organization so it’s data-driven. Driving the use of the right data at the right time, creating a data-driven culture, and leading analytics are vital. However, the most important aspect of the role is establishing and fostering organizational buy-in for the office of the CDO function as well as the future role data will have in the organization. Few leaders will argue that data is transforming business decisions and that business models are changing; the challenge is that those same leaders might not believe that your office of the CDO is the right team to do that. This is why establishing collaborations and building trust outside of IT is essential.

Data scientists help to identify opportunities to improve organizational outcomes by utilizing data, developing predictive models, and sharing stories that present new insights. There are seven major areas of significance to data scientists: data collection (web scraping, HTML, CSS), data ingestion (SQL APIs, JSON, XML), data cleansing (multiple data types), data visualizations (D3, Tableau, Spotfire), basic analysis (R, Python), data mining (variance analysis, measuring bias, feature normalization, feature selection, feature extraction, clustering analysis, association analysis) and predictive modeling (data modeler+, graph analysis, bootstrap or bagging modeling, ensemble models, Bayesian analysis, neural networks, deep learning). An effective data scientist can apply sample and survey methods, determine statistical significance, conduct outlier analysis and make data-driven decisions to identify new data-science opportunities previously undiscovered.

Data modelers use a variety of data types to build and design predictive models. To understand sampling methods and measure statistical significance, data modelers need to have much of the experience of data scientists. For example, data visualization, basic analysis, data mining and predicting models are key skills for this role.

Data architects develop linkages between systems. They need to have experience with multi-architectures and implementing complex database policies and standards. This background allows them to develop complete solutions to validate, clean-up and map data. Ensuring end-to-end data quality requires integrating data from unrelated sources. Having internal knowledge of the organization’s domains is a crucial element.

Data analysts facilitate data collection and aid in data cleansing with primitive analysis skills. Often this role is the initial drafter of organizational policies, standards, and procedures before more experienced resources assume ownership. These resources likely are familiar with R, Excel, and SQL at a high level but hit limits quickly when applying this to SQL APIs, JSON or XML applications.

Front-end designers and developers mainly focus on client-side development using technologies such as HTML, CSS, JavaScript, jQuery and RESTful service APIs. This code is executed inside the user’s browser and can extend into the UI/UX experience for users.

Database administrators specialize in software to store and organize data. Usually, this role includes capacity planning, installations, configuration, database design, data migration, performance monitoring of data, security, backup and recovery, and basic troubleshooting. This role is hands-on regarding data and, as a result, needs to be carefully managed with segregation of duties.

Portfolio managers focus on value realization from products, services, interactions, assets, and capabilities. This includes making investment decisions to balance objectives, asset allocation, and risk for optimal performance. This role aligns strategy with the bottom line to optimize delivery orchestration across the data portfolio of investments, projects, programs or activities.

Project managers lead data-related project initiatives and provide contract support to align with corporate policies. These resources work with multidisciplinary teams like legal, cloud, finance, operations and various business functions to lead projects and get them over the finish line.

Business relationship managers stimulate, surface, and shape business demand to define the full business value envisioned. This involves building credibility for the office of the CDO, establishing partnerships outside of IT to increase awareness of existing capabilities in house, and introducing new data capabilities that have force-multiplier effects for business partners.

Likely there are dozens of resources that could be pulled into a CDO team to align to organizational needs. The foremost that comes to mind are subject-matter data experts that have specific and deep domain knowledge of how your business operates.

Now that you know the critical roles to establish the office of the CDO, spend your time finding the best resources to staff your office. These resources are in high demand, so you must assume it will take longer than planned to recruit the team.

The duties

The primary responsibilities of the office of the CDO used to be focused on data governance, data quality, and compliance drivers. Today, the focus of this office is to enable a data-driven culture and maximum business value.

To exploit data to achieve a competitive advantage and establish the office as a strategic advisor, the responsibilities need to be communicated across the organization.

Leading change and championing a data-driven culture can be enabled with the following defined responsibilities:

  • Envision, design, and communicate a collaborative, enterprise-wide data strategy.
  • Establish a governance structure for managing data assets using a repeatable process and standardized frameworks.
  • Define, implement, and manage organizational data principles, data policies, data standards, and data guidelines.
  • Decrease the cost of collecting, managing, and sharing data while increasing the value.
  • Enable data-as-a-service for enterprise-wide adoption using a data-service strategy.
  • Develop data-quality measures and practices to improve organizational trust in data.
  • Manage the data portfolio to coordinate the investment prioritization of enterprise-wide data initiatives.
  • Identify opportunities for the organization to more fully leverage data for a strategic advantage.
  • Champion organizational change management for a data-driven culture.
  • Advance how enterprise-wide data assets are managed to provide deeper insights.
  • Establish policies and programs for data stewardship and custodianship for stakeholder engagement.
The office of the CDO gets the business of data onto the minds of your organization’s executives. It’s the first major step toward developing a data-driven culture. Data enablement is a change that requires shifting organizational strategies, processes, procedures, technologies and culture. Use these four tips when introducing organization-wide change, transformational change, personnel change, unplanned change or remedial change: Make it clear. Make it real. Make it happen. Make it stick.