Running Your Pipelines in Dagster
Quick Start: Run Your First Pipeline
Method 1: Run All Assets (Recommended for First Run)
- Navigate to the Assets page
- Click Materialize all in the top right
- Confirm the run
- Monitor progress in the Runs page
Method 2: Run Specific Assets
- Go to Assets page
- Select assets by checking boxes
- Click Materialize selected
- Useful for testing individual components
Method 3: Launch a Job
- Navigate to Jobs page
- Find your job (e.g.,
meltano_dbt_pipeline
) - Click Launch Run
- Configure if needed (usually not required)
Understanding Your Pipeline Structure
Typical Pipeline Flow
Meltano Extractors → Raw Data → dbt Models → Analytics Tables
(Assets) (Assets) (Assets) (Assets)
Asset Dependencies
- Meltano assets must complete before dbt assets
- Dependencies are automatically managed
- Failed upstream assets prevent downstream execution
Monitoring Your Runs
Run Status Indicators
- 🟢 Success: Asset materialized successfully
- 🔵 In Progress: Currently executing
- 🔴 Failed: Error occurred (check logs)
- ⚪ Queued: Waiting to start
Viewing Run Details
- Click on any run in the Runs page
- See:
- Timeline of execution
- Logs for each step
- Asset materializations
- Error messages (if any)
Understanding Logs
Meltano Logs
2024-01-15 10:30:45 [info] Running extract/load for tap-salesforce
2024-01-15 10:30:46 [info] Updated state: 2024-01-15T10:30:46
2024-01-15 10:31:15 [info] Extracted 1,543 records
dbt Logs
10:31:20 Running dbt...
10:31:25 Found 23 models, 15 tests
10:31:30 Completed successfully
Validating Your Migration
Checklist for First Run
- All Meltano extractors complete successfully
- Data lands in expected raw tables
- dbt models build without errors
- Row counts match expectations
- Schedules show correct next run time
Comparing with Arch
Check Data Freshness
- Go to Assets page
- Look at "Last Materialized" timestamps
- Compare with last Arch run times
Verify Row Counts
-- Run in your data warehouse
SELECT COUNT(*) FROM raw.salesforce_accounts;
SELECT COUNT(*) FROM analytics.customer_summary;
Handling Common Scenarios
Rerunning Failed Assets
- Click on the failed asset
- Review error in logs
- Click Re-materialize after fixing issue
Running Historical Backfills
- Select assets needing backfill
- Click Materialize selected
- Choose "Custom Run Config"
- Specify date range for Meltano
Partial Pipeline Runs
Run only dbt models (skip extraction):
- Select only dbt assets
- Materialize selected
- Useful for model development
State Management
Meltano State
- Automatically managed by platform
- Tracks last successful extraction
- Prevents duplicate data
- No manual intervention needed
Viewing State
- Click on Meltano asset
- Check "Metadata" tab
- See last bookmark value
Scheduling Runs
View Existing Schedules
- Go to Schedules page
- See all migrated schedules
- Check next tick time
Schedule Status
- ▶️ Running: Schedule is active
- ⏸️ Stopped: Manually paused
- 🔄 Refreshing: Updating schedule
Manual Schedule Control
- Start/Stop: Toggle schedule on/off
- Test: Run immediately regardless of schedule
- Edit: Modify cron expression (requires code change)
Troubleshooting
Common Issues
"Asset not found" Error
- Ensure all dependencies materialized
- Check if asset definition exists
- Reload definitions in UI
Meltano State Issues
- DO NOT manually modify state
- Contact support for state resets
- Check logs for specific state errors
dbt Compilation Errors
- Review dbt logs for model name
- Check for missing dependencies
- Validate warehouse permissions
Getting Run URLs
- Click on any run
- Copy URL from browser
- Share with support for help
Best Practices
Daily Operations
- Check Assets page for freshness
- Review any failed runs
- Monitor schedule execution
- Validate critical metrics
Development Workflow
- Test changes with single asset runs
- Validate in development branch first
- Monitor first production run closely
- Document any configuration changes