Following a rigorous methodology is key to delivering customer satisfaction and expanding analytics use cases across the business.
Before beginning a journey to address and remediate data quality issues it is important to have a grasp of the size and scope of the problem. Without a good understanding of the level and depth of data quality issues the project could be drastically underfunded and poorly planned. Performing a data quality audit can provide an initial assessment that can be a key input to the planning and subsequent work to address the issues uncovered.
Data Quality is a key factor in many Data Management projects. The quality of the proposed project source data, in terms of both its structure and content, is a key determinant of the specifics of the business scope and of the success of the project in general.
Problems with the data content must be communicated to senior project personnel as soon as they are discovered. Poor data quality can impede the proper execution of later steps in the project, such as data transformation and load operations, and can also compromise the business ability to generate a return on the project investment. This is compounded by the fact that most businesses underestimate the extent of their data quality problems. There is little point in performing a data warehouse, migration, integration, master data management, artificial intelligence, or business intelligence project if the underlying data is in bad shape.
The Data Quality Audit is designed to analyze representative samples of the source data and discover their data quality characteristics so that these can be articulated to all relevant project personnel. The project leaders can then decide what actions, if any, are necessary to correct data quality issues and ensure that the successful completion of the project is not in jeopardy.
The Data Quality Audit can typically be conducted very quickly, but the actual time required is determined by the starting condition of the data and the success criteria defined at the beginning of the audit. The main steps are as follow:
Two important aspects of the audit are (1) the data quality criteria used, and (2) the type of report generated.
Any number and type of criteria can be defined for data quality. However, there are six standard criteria:
This list is not absolute; the characteristics above are sometimes described with other terminology, such as redundancy or timeliness. Every organization’s data needs are different, and the prevalence and relative priority of data quality issues differ from one organization and one project to the next. Note that the accuracy factor differs from the other five factors in the following respect: whereas, for example, a pair of duplicate records may be visible to the naked eye, it can be difficult to tell simply by eye-balling if a given data record is inaccurate. Accuracy can be determined by applying fuzzy logic to the data or by validating the records against a verified reference data set.
Success
Link Copied to Clipboard