Skip to main content
HomeCheat sheetsData Analysis

Data Quality Dimensions Cheat Sheet

In this cheat sheet, you'll learn about data quality dimensions, allowing you to ensure that your data is fit for purpose.
Mar 2023  · 3 min read

Data Quality Dimensions.png

Have this cheat sheet at your fingertips

Download PDF

What are Data Quality Dimensions?

Data Quality is a measurement of the degree to which data is fit for purpose. Good data quality generates trust in data. Data Quality Dimensions are a measurement of a specific attribute of a data's quality.

Completeness

Completeness measures the degree to which all expected records in a dataset are present. At a data element level, completeness is the degree to which all records have data populated when expected.

Group 427.png

Completeness Example

All records must have a value populated in the CustomerName field.

Group 409.png

Validity

Validity measures the degree to which the values in a data element are valid.

Group 428.png

Validity Example

  • CustomerBirthDate value must be a date in the past.
  • CustomerAccountType value must be either Loan or Deposit.
  • LatestAccountOpenDate value must be a date in the past.

Group 409 (1).png

Uniqueness

Uniqueness measures the degree to which the records in a dataset are not duplicated.

Group 2127.png

Uniqueness Example

All records must have a unique CustomerID and CustomerName.

Group 409 (2).png

Timeliness

Timeliness is the degree to which a dataset is available when expected and depends on service level agreements being set up between technical and business resources.

Group 2128.png

Timeliness Example

All records in the customer dataset must be loaded by the 9:00 am.

Group 2129.png

Consistency

Consistency is a data quality dimension that measures the degree to which data is the same across all instances of the data. Consistency can be measured by setting a threshold for how much difference there can be between two datasets.

Group 416 (1).png

Consistency Example

The count of records loaded today must be within +/- 5% of the count of records loaded yesterday.

Group 418.png

The count of records loaded today must be within +/- 5% of the count of records loaded yesterday.

Group 419.png

Accuracy

All records in the Customer Table must have accurate Customer Name, Customer Birthdate, and Customer Address fields when compared to the Tax Form.

Group 2127.jpg

Accuracy Example

All records in the Customer Table must have accurate Customer Name, Customer Birthdate, and Customer Address fields when compared to the Tax Form.

Screenshot 2023-02-17 at 11.39 1.png

Group 422.png

Topics
Related

blog

The 4 Best Data Analytics Bootcamps in 2024

Discover the best data analytics bootcamps in 2024, discussing what they are, how to choose the best bootcamp, and you can learn.

Kevin Babitz

5 min

blog

A Guide to Corporate Data Analytics Training

Understand the importance of corporate data analytics training in driving business success. Learn about key building blocks and steps to launch an effective training initiative tailored to your organization's needs.

Kevin Babitz

6 min

podcast

[Radar Recap] From Data Governance to Data Discoverability: Building Trust in Data Within Your Organization with Esther Munyi, Amy Grace, Stefaan Verhulst and Malarvizhi Veerappan

Esther Munyi, Amy Grace, Stefaan Verhulst and Malarvizhi Veerappan focus on strategies for improving data quality, fostering a culture of trust around data, and balancing robust governance with the need for accessible, high-quality data.
Richie Cotton's photo

Richie Cotton

39 min

podcast

[Radar Recap] Scaling Data ROI: Driving Analytics Adoption Within Your Organization with Laura Gent Felker, Omar Khawaja and Tiffany Perkins-Munn

Laura, Omar and Tiffany explore best practices when it comes to scaling analytics adoption within the wider organization
Richie Cotton's photo

Richie Cotton

40 min

code-along

A Beginner's Guide to Data Analysis with SQL

In this session, DataCamp's VP of Media Adel Nehme & co-host of the DataFramed podcast, shows you how to get started with SQL.
Adel Nehme's photo

Adel Nehme

code-along

Getting Started With Data Analysis in Alteryx Cloud

In this session, you'll learn how to get started with the Alteryx AI Platform by performing data analysis using Alteryx Designer Cloud.
Joshua Burkhow's photo

Joshua Burkhow

See MoreSee More