In this example, you are working at the University. You noticed some
datasets have been mixed up and you want to check that the correct scholarship
programs have been granted to the correct students.
The data quality rule validates that if the students have the US
citizenship and their status code is 2632, then the students have been granted a
scholarship program taking effect on September 1st, 2021
and whose code ends with 10AB or 10AC.
Here is a sample of the dataset:
Procedure
Log in as a rule manager.
In the left panel, click Data quality rules > Add rule.
Enter the name: ScholarshipProgram.
Enter the description: Checking the status code
2632.
The description is optional. It helps you find a
rule when the rule names are similar.
In the If part, click
Add a row:
Select Variable and enter the name
StatusCode.
The supported characters are [a-z], [A-Z], [0-9] and special
characters: _.@$#.
Information noteNote: Data quality rules are templates. You will associate the variables to
attributes when applying the rule to a data model.
Select the operator is equal
to.
For more information on the operators, see the The operators.
Select Value and
enter 2632.
Add a row to add the subcondition: CitizenshipisUS.
Select the logical operator And.
In the Then part, add three rows:
To group the first two rows, hover over the rows and
select the check boxes on the right.
Define all fields to validate that the students have
been granted a scholarship program taking effect on September 1st, 2021 and whose code ends with 10AB or
10AC.
The Else part allows you to
define an alternative when the condition (If) cannot be fulfilled.
For this example,
leave the Else part empty. The
values that do not fulfill the condition will be categorized as
non-applicable values.