Running Your Pipelines in Dagster

Quick Start: Run Your First Pipeline

Method 1: Run All Assets (Recommended for First Run)

Navigate to the Assets page
Click Materialize all in the top right
Confirm the run
Monitor progress in the Runs page

Method 2: Run Specific Assets

Go to Assets page
Select assets by checking boxes
Click Materialize selected
Useful for testing individual components

Method 3: Launch a Job

Navigate to Jobs page
Find your job (e.g., meltano_dbt_pipeline)
Click Launch Run
Configure if needed (usually not required)

Understanding Your Pipeline Structure

Typical Pipeline Flow

Meltano Extractors → Raw Data → dbt Models → Analytics Tables
    (Assets)         (Assets)      (Assets)        (Assets)

Asset Dependencies

Meltano assets must complete before dbt assets
Dependencies are automatically managed
Failed upstream assets prevent downstream execution

Monitoring Your Runs

Run Status Indicators

🟢 Success: Asset materialized successfully
🔵 In Progress: Currently executing
🔴 Failed: Error occurred (check logs)
⚪ Queued: Waiting to start

Viewing Run Details

Click on any run in the Runs page
See:
- Timeline of execution
- Logs for each step
- Asset materializations
- Error messages (if any)

Understanding Logs

Meltano Logs

2024-01-15 10:30:45 [info] Running extract/load for tap-salesforce
2024-01-15 10:30:46 [info] Updated state: 2024-01-15T10:30:46
2024-01-15 10:31:15 [info] Extracted 1,543 records

dbt Logs

31:20  Running dbt...
31:25  Found 23 models, 15 tests
31:30  Completed successfully

Validating Your Migration

Checklist for First Run

All Meltano extractors complete successfully
Data lands in expected raw tables
dbt models build without errors
Row counts match expectations
Schedules show correct next run time

Comparing with Arch

Check Data Freshness

Go to Assets page
Look at "Last Materialized" timestamps
Compare with last Arch run times

Verify Row Counts

-- Run in your data warehouse
SELECT COUNT(*) FROM raw.salesforce_accounts;
SELECT COUNT(*) FROM analytics.customer_summary;

Handling Common Scenarios

Rerunning Failed Assets

Click on the failed asset
Review error in logs
Click Re-materialize after fixing issue

Running Historical Backfills

Select assets needing backfill
Click Materialize selected
Choose "Custom Run Config"
Specify date range for Meltano

Partial Pipeline Runs

Run only dbt models (skip extraction):

Select only dbt assets
Materialize selected
Useful for model development

State Management

Meltano State

Automatically managed by platform
Tracks last successful extraction
Prevents duplicate data
No manual intervention needed

Viewing State

Click on Meltano asset
Check "Metadata" tab
See last bookmark value

Scheduling Runs

View Existing Schedules

Go to Schedules page
See all migrated schedules
Check next tick time

Schedule Status

▶️ Running: Schedule is active
⏸️ Stopped: Manually paused
🔄 Refreshing: Updating schedule

Manual Schedule Control

Start/Stop: Toggle schedule on/off
Test: Run immediately regardless of schedule
Edit: Modify cron expression (requires code change)

Troubleshooting

Common Issues

"Asset not found" Error

Ensure all dependencies materialized
Check if asset definition exists
Reload definitions in UI

Meltano State Issues

DO NOT manually modify state
Contact support for state resets
Check logs for specific state errors

dbt Compilation Errors

Review dbt logs for model name
Check for missing dependencies
Validate warehouse permissions

Getting Run URLs

Click on any run
Copy URL from browser
Share with support for help

Best Practices

Daily Operations

Check Assets page for freshness
Review any failed runs
Monitor schedule execution
Validate critical metrics

Development Workflow

Test changes with single asset runs
Validate in development branch first
Monitor first production run closely
Document any configuration changes

Running Your Pipelines in Dagster

Quick Start: Run Your First Pipeline​

Method 1: Run All Assets (Recommended for First Run)​

Method 2: Run Specific Assets​

Method 3: Launch a Job​

Understanding Your Pipeline Structure​

Typical Pipeline Flow​

Asset Dependencies​

Monitoring Your Runs​

Run Status Indicators​

Viewing Run Details​

Understanding Logs​

Meltano Logs​

dbt Logs​

Validating Your Migration​

Checklist for First Run​

Comparing with Arch​

Check Data Freshness​

Verify Row Counts​

Handling Common Scenarios​

Rerunning Failed Assets​

Running Historical Backfills​

Partial Pipeline Runs​

State Management​

Meltano State​

Viewing State​

Scheduling Runs​

View Existing Schedules​

Schedule Status​

Manual Schedule Control​

Troubleshooting​

Common Issues​

"Asset not found" Error​

Meltano State Issues​

dbt Compilation Errors​

Getting Run URLs​

Best Practices​

Daily Operations​

Development Workflow​

Next Steps​

Quick Start: Run Your First Pipeline

Method 1: Run All Assets (Recommended for First Run)

Method 2: Run Specific Assets

Method 3: Launch a Job

Understanding Your Pipeline Structure

Typical Pipeline Flow

Asset Dependencies

Monitoring Your Runs

Run Status Indicators

Viewing Run Details

Understanding Logs

Meltano Logs

dbt Logs

Validating Your Migration

Checklist for First Run

Comparing with Arch

Check Data Freshness

Verify Row Counts

Handling Common Scenarios

Rerunning Failed Assets

Running Historical Backfills

Partial Pipeline Runs

State Management

Meltano State

Viewing State

Scheduling Runs

View Existing Schedules

Schedule Status

Manual Schedule Control

Troubleshooting

Common Issues

"Asset not found" Error

Meltano State Issues

dbt Compilation Errors

Getting Run URLs

Best Practices

Daily Operations

Development Workflow

Next Steps