Is Data Quality (DQ) synonymous with Data Governance (DG)? Why does it matter? On the surface the two seem alike, are related and more often than not co-exist and sound synonymous.
Improved Data Quality is a key desired outcome from the implementation of Data Governance policies whereas Data Governance is the broader, strategic, enterprise vision of recognizing and managing data as a valued enterprise asset.
One strong theme that is fairly evident is that the skill-sets required for any successful implementation of Data Quality or Data Governance initiative overlap and are very similar. Yet despite all the similarities and closely coupled existence there are some variations both subtle and distinct to classify them as their own independent and distinct domains.
Let’s start with their definitions:
Data Quality (DQ) is defined as the perception or assessment of fitness of data within a given context, meaning it is a measurement of data - the degree to which information consistently meets the expectations and requirements of the users/workers using it to perform their jobs. It is predominantly executed as an IT specialty and managed by those proficient in specialist data quality and data management tools.
Data Governance (DG) refers to the policies and procedures geared towards the overall management of usability, availability, integrity and security of data. In other words it is about a function owned by the business and executed by the business stewards in order to recognize and unlock the value of data to an enterprise and manage it as an enterprise asset.
If this is insufficient to conclude the case to separate them, then their core concepts, competencies and dimensions should be examined.
Data Quality is measured against six dimensions in order to gauge its fitness:
The key distinction here is that the quality of data exists solely in the eyes of the business based on the value they perceive with respect to meeting their needs. Data that is not required to support any business processes, or required to make decisions, or useful in trend-analysis is irrelevant. Even if that data is accurate and has inherent quality, it has no value to the enterprise. The more data can be shared across the organization the more valuable it becomes. Increasing the value of data within the enterprise, is the primary goal for data governance.
Despite the distinctions, there are more synergies and relationships between the two as both work towards increasing the usability and sharing of data. In order for any governance or quality initiative to succeed it must:
The soft-skills really spell the difference between success and failure for both of these initiatives. These include the wider and clear understanding of the challenges faced by business and what are drivers as well as how governance and quality can improve or enhance it. Also key is having passion for it as well as leadership, diplomacy, tact, negotiation skills, communication skills, ability to simplify the problem.
Data Quality can be approached independently on its own, in some cases all that is required is to add a data quality tool with good data quality processes and monitoring. Or alternatively it can be undertaken as the first precursor towards an MDM initiative. Some organizations simply are not ready to implement Master Data Management or it is undertaken over long time frame of 2 to 5 years. In such situations they will unlock value by undertaking a shorter DQ effort. The other common use case is implementing DQ as part of Master Data Management project. Irrespective of the approach, improving quality in one area does not automatically make that data sharable across the enterprise. MDM is a first step towards making the data usable and sharable across the enterprise.
Organizations are moving towards consolidation of environments, applications and data in order to:
Unlike other assets such as money or equipment which are finite and are consumed as you use them, data assets become more valuable the more you use them. The moment that data is shared across business groups, it becomes an enterprise asset that must be governed in order to protect it and maximize its value to the organization as well as enable the organization to reduce costs in maintaining it. Data Quality initiatives have a greater impact and return when it is mandated through the Data Governance entity. Enterprise Risk Management and Compliance are becoming hot button items for every organization and are very data-centric and can only be managed thru an enterprise Governance program.
Governance is inherently an integration challenge and for it to be successful it must me more than a collection of data quality initiatives. A key component for long-term success and efficiency is the establishment of a governance framework and an optimal governance model. The framework and model should clearly establish the operating model and process for how data should be governed and establish priorities linking scope to pain points.
While there are several variations of the governance structure/office, a common theme is to base it on the segregation of responsibilities and functions it is tasked to perform:
Strategic: Define, Agree and Approve
Tactical:
Execution:
A sample Governance model may look something like this:
Fig 1: Sample model of Data Governance
Governance is a journey and will take time, patience and lot of fortitude. Adoption of data governance progresses in varying stages. First, start out small with an initial individual implementation and slowly build and extend to a program level implementation until cross-functional implementations can be coordinated across the enterprise.
Adoption of Governance is linked with the maturity of Governance within an organization. Fig 2 below is an illustration of the maturity levels:
Figure 2: Increasing Value of Data as the Information Management Capability Matures
Adoption will typically occur in three distinct phases:
Figure 3: Data Governance Adoption Phases
As noted in the above figures, governance may start out initially as data quality initiatives. As it matures it will extend and enhance business functions and capabilities across the enterprise. An organization that reaches it’s target state maturity of level 5 “Embedded” will have addressed the following objectives and capabilities:
Once an organization has reached its optimal maturity level, what’s next? Business is continuously evolving and changing. With every evolution and following best practices, organizations will need to review and revise the policies and procedures. The revised policies may call for enhancements or introduction of new capabilities. A fully mature “Embedded” governance organization will find greater success in introducing and implementing new functions while minimizing the risks and costs that typically derails such projects.
So does a focus on data quality confuse or complicate the message related to data governance. While data quality and data governance are closely interlinked and very tightly coupled, they are not the same thing. Data governance has a far reaching enterprise objective. Data quality programs are part of the overall governance goals and objectives. Data quality is implemented through technology supporting data governance objectives for the enterprise.