Skip to main content

Introduction

Data validation lets you define rules that detect anomalies or unexpected results after a workflow run. Use it to detect missing values, outliers, and schema issues before downstream use.

Prerequisites

  • A completed workflow run with preview data (Kadoa will automatically suggest initial validation rules)

Configure validation

  1. Open the workflow in your dashboard.
  2. In the sidebar, select Issues. You will see 2 tabs there:
    • Rules: Create, view, suggest, and delete validation rules.
    • Results: Review issues found by rules for the selected run.
  3. Select the Rules tab.

Issues Rules sidebar navigation

  1. Choose one of the following:
    • Add new Rule: Manually choose a target column, define a condition, and (optionally) add domain hints for better precision.
    • Suggest Rules: Kadoa auto-suggests rules from your schema and sample data.

Rule creation dialog

Manual rules often provide the most precise results when you know the domain.

Suggested rules

Kadoa suggests validation rules in two ways:
  • Automatically: After preview run, Kadoa analyzes your data and suggests relevant rules
  • On demand: Click “AI suggest rules” to generate additional suggestions

Suggested rules with bulk actions

Suggested rules require approval before activation. Select rules using checkboxes, then approve or delete in bulk.
On-demand rule generation doesn’t yet consider existing rules and may suggest overlapping validations. Review suggestions carefully before approval.

Approving rules

Suggested rules start in PREVIEW status and must be approved before they detect validation issues. To approve rules:
  1. Navigate to the Rules tab in the Issues sidebar.
  2. Review the suggested rules in the “Preview” section - each shows the rule name, target columns, and SQL condition.
  3. Select rules to approve using the checkboxes next to each rule.
  4. Click the Approve button to activate the selected rules.
  5. Approved rules move to the “Enabled” section and will run on the next workflow execution.
Review each rule’s SQL condition before approving. You can click on a rule to see the full condition and verify it matches your data quality requirements.
Once approved, rules transition from PREVIEW to ENABLED status and begin detecting validation issues on subsequent runs.

Deleting rules

To remove rules you no longer need:
  1. Navigate to the Rules tab in the Issues sidebar.
  2. Select rules to delete using the checkboxes next to each rule.
  3. Click the Delete button to remove the selected rules.
  4. Confirm the deletion when prompted.
You can delete rules in any status (preview, enabled, or disabled).
Deleting rules is permanent. Consider disabling rules instead if you may need them later.

Disabling rules

To temporarily stop a rule from running without deleting it:
  1. Click the menu icon on the rule you want to disable.
  2. Select “Disable” from the menu.
  3. The rule moves to the “Disabled” section.
Disabled rules can be re-enabled at any time by clicking “Enable” from the rule menu.

Rule execution

  • Validation is executed at the end of each subsequent pipeline run
  • Preview rules require approval before detecting validation issues
  • Changes take effect on the next run
  • Invalid rules auto-disable when schema changes break them

Working with rules

For detailed information about rule states, SQL structure, and examples, see Data Validation Concepts.

Rule operations

  • Create manually: Select target columns and describe the rule in natural language
  • Generate suggestions: Auto-generate common rules based on data types and sample data
  • View SQL: All rules expose raw SQL for transparency
  • Bulk actions: Select multiple rules and approve or delete them at once
  • Auto-disable: Rules are automatically disabled by Kadoa when schema changes break them

Preview and active rules sections

  • Historical runs: When viewing a past run, you’ll see the rules that were in effect at that time (read-only).

Validation report

After a run finishes, go to IssuesReport.
  • See issues grouped by rule.
  • Click an issue to open the row detail and view all issues associated with that row.

Validation report overview

Results view details

  • Filter by rule: Use the filter to focus on specific rules when multiple are present.
  • Row details: Click an item to open the row and see the offending value and related context.

Row detail view with issues

  • Rule states:
    • PREVIEW: suggested rule awaiting review and approval
    • ENABLED: active rule generating validation issues
    • DISABLED: inactive rule
  • Issue status indicators:
    • NEW: first time the issue appears
    • RESOLVED: issue no longer present
  • Summary chips show change since previous run:
    • +n new issues, –n resolved

Key fields (optional)

Defining key fields lets Kadoa track rows across runs for richer insights.
  • Configure in Schema.
  • Pick one or more fields used to match the same row across runs.
  • Requirements: values should be present for most rows and unique per row (no duplicates).

How to pick key fields

  • Prefer stable identifiers (e.g., product ID, URL, SKU).
  • If a row cannot be matched via the key, it is treated as a new row.

Key‑based insights

When key fields are set, the report shows change indicators between runs:
  • +n: new issues discovered since the previous run
  • -n: issues resolved since the previous run
Individual issues are labeled as “new” or “resolved” when applicable.

Validate now

Use the Validate now button to schedule validation for the current workflow’s latest data. This is available when no specific past run is selected.