Datablast Quickstart Guide
Welcome to the Datablast Quickstart! In ≈10 minutes you will:
- Prepare a repository for Datablast Scheduler
- Define your first pipeline
- Add SQL & Python tasks
- Configure notifications
- Deploy and monitor the run
1. Prepare the repo structure
Section titled “1. Prepare the repo structure”Create a Git repository with the following layout. Datablast automatically scans it to discover assets.
- pipeline.yml
Directorytasks/
- sample_task.sql
pipeline.yml– high-level schedule & configtasks/– contains all SQL, Python or YAML tasks
More details: Project Structure
2. Define a pipeline
Section titled “2. Define a pipeline”pipeline.yml
schedule: "0 3 * * *" # UTC cronnotifications: slack: - name: demo-notifications connection: "demo-slack" failure: ":red_circle: Pipeline has failed!"This pipeline will run daily at 03:00 UTC and post a Slack message on failures.
Learn every field in Pipeline Config.
3. Add tasks
Section titled “3. Add tasks”SQL task
Section titled “SQL task”Create tasks/daily_orders.sql:
-- @blast.name: marts.daily_orders-- @blast.type: bq.sql
SELECT *FROM `raw.orders`WHERE status = 'paid';Python task
Section titled “Python task”tasks/python/cleanup.py:
# @blast.type: python
def run(**kwargs): print(Cleaning temporary files …)Explore task options: SQL Tasks · Python Tasks
4. Configure notifications
Section titled “4. Configure notifications”Datablast ships with Slack & Discord integrations.
Add a Slack web-hook ID to your pipeline.
Full guide: Notifications
5. Deploy & monitor
Section titled “5. Deploy & monitor”- Push your repository to Git (GitHub/GitLab/Bitbucket).
- Datablast detects changes & schedules the pipeline.
- Watch runs in the Datablast UI.
Next steps
Section titled “Next steps”- Browse more guides in the Guides section.
- Need help? Contact us → datablast.io