问题
I have a Git repository which (among other things) holds Airflow DAGs in airflow directory. I have a clone of the repository besides an install directory of Airflow. airflow directory in Git is pointed to by AIRFLOW_HOME configuration variable.
I would like to allow imports from modules in the repository that are listed outside airflow folder (please see the structure below).
<repo root>
|_airflow
|_dags
|_dag.py
|_module1
|_module2
|_...
So that in dag.py I can do:
from module1 import Module1
Currently, it does not seem possible without tricks like editing sys.path explicitly which is not very elegant and has to be done in each of the dag source files...
Making an installable package out of the module1 is also out of the question.
回答1:
Re-writing conclusion from discussions here
Broadly, there are 2 possible ways
- Package your code into an Airflow plugin
Make your code discoverable to dag-definition-file(s) parsing processes by updating
PYTHONPATH. Here again we have following options(a) Update
PYTHONPATHon system level using bashrc / equivalent (once-and-for-all) or just export the updated PYTHONPATH for current bash session(b) Programmatically update sys.path in the beginning of DAG-definition file
来源:https://stackoverflow.com/questions/57676017/how-to-use-non-installable-modules-from-dag-code