I setup the dag from the https://airflow.apache.org/tutorial.html as is, the only change being that I have set the dag to run at an interval of 5 minutes with a start date of 20
They are not separate instances. You can see this:
In Tree View, the start/end dates and duration of both circles will be exactly the same.
In Gantt view, you will see the duration for only a single instance of print_date
.
In general, you can't map a DAG to a tree view without duplicating nodes like they've done.
1.Yes, they are separate tasks. To make sure that there is only one print_date
, you can do:
t1 >> t2 >> t3
instead of
t2.set_upstream(t1)
t3.set_upstream(t1)
You can change the order as per your workflow.
2.On my machine, those green dots display the time of the scheduled run in UTC. Are you sure that's not in your database timezone?