| copyright |
|
||
|---|---|---|---|
| lastupdated | 2026-05-14 | ||
| keywords | |||
| subcollection | watsonx-bi |
{{site.data.keyword.attribute-definition-list}}
{: #reference_data}
Reference datasets define standard values for specific types of data to classify data and measure consistency. {: #shortdesc}
Reference datasets act as lookup tables that map codes and values. Some reference datasets are standardized by organizations, such as the International Organization for Standardization (ISO). Reference data can be hierarchical or mapped across related sets.
Reference data helps you, for example, define a standard set of values for certain fields. It can be useful to create a standard definition of country codes and use this reference data to ensure that country code fields comply. Different designations such as “US”, “USA”, “United States”, and "America" can all be resolved to the same reference data value. As a result you can get much more consistent data.
The predefined reference datasets in {{site.data.keyword.wxbia_short}} provide physical and sovereign locations and can be found in the Locations category:
-
Physical locations: Physical location is the geographical location of the data asset.
-
Sovereign locations: Sovereign location is the governing body that has jurisdiction over the data asset.
An example of a physical location is Tokyo and its sovereign location is Japan.
{: #create_reference}
A reference dataset consists of a number of reference data values, where each reference data value must at least have a code and its value defined.
When you design a reference dataset, you need to decide what format of values to use, which code-value pairs constitute the set, and if the set is related to any other existing sets. You can import the existing reference datasets and modify them to suit your needs, or create a new reference dataset manually.
To create reference data, you must have user permission for Access governance artifacts. Additionally, you must have one of these category collaborator roles in the primary category for reference data:
- Admin
- Owner
- Editor
- A custom role with the permission to create reference datasets
To create reference data:
-
From the Navigation Menu, open Governance > Reference data.
-
Click Add reference data set > New reference data set.
-
Enter the details for the reference data set. Note, reference data sets names must be unique within a category.
-
Under Add Columns, create the columns for the reference data set. If you didn't upload a CSV file, you need to provide details for each column.
-
Review the information under Review and click Create.
You can also import reference data from a CSV file.
-
From the Navigation Menu, open Governance > Reference data.
-
Click Add reference data set > Import from file.
-
Upload the file and choose a method for merging imported and existing artifacts.
For more information, see the related IBM watsonx.data intelligence topic, Reference data{: external}.