Running docker operator from Google Cloud Composer

后端 未结 1 805
礼貌的吻别
礼貌的吻别 2021-01-02 23:06

As for the documentation, Google Cloud Composer airflow worker nodes are served from a dedicated kubernetes cluster:

I have a Docker contained ETL step that

相关标签:
1条回答
  • 2021-01-02 23:26

    Google Cloud Composer has just recently released into General Availability, and with that you are now able to use a KubernetesPodOperator to launch pods into the same GKE cluster that the managed airflow uses.

    Make sure your Composer environment is at least 1.0.0

    An example operator:

    import datetime
    
    from airflow import models
    from airflow.contrib.operators import kubernetes_pod_operator
    
    with models.DAG(
        dag_id='composer_sample_kubernetes_pod',
        schedule_interval=datetime.timedelta(days=1),
        start_date=YESTERDAY) as dag:
    # Only name, namespace, image, and task_id are required to create a
    # KubernetesPodOperator. In Cloud Composer, currently the operator defaults
    # to using the config file found at `/home/airflow/composer_kube_config if
    # no `config_file` parameter is specified. By default it will contain the
    # credentials for Cloud Composer's Google Kubernetes Engine cluster that is
    # created upon environment creation.
    kubernetes_min_pod = kubernetes_pod_operator.KubernetesPodOperator(
        # The ID specified for the task.
        task_id='pod-ex-minimum',
        # Name of task you want to run, used to generate Pod ID.
        name='pod-ex-minimum',
        # The namespace to run within Kubernetes, default namespace is
        # `default`. There is the potential for the resource starvation of
        # Airflow workers and scheduler within the Cloud Composer environment,
        # the recommended solution is to increase the amount of nodes in order
        # to satisfy the computing requirements. Alternatively, launching pods
        # into a custom namespace will stop fighting over resources.
        namespace='default',
        # Docker image specified. Defaults to hub.docker.com, but any fully
        # qualified URLs will point to a custom repository. Supports private
        # gcr.io images if the Composer Environment is under the same
        # project-id as the gcr.io images.
        image='gcr.io/gcp-runtimes/ubuntu_16_0_4')
    

    Additional resources:

    • Kubernetes' `KubernetesPodOperator docs
    • MoreKubernetesPodOperator examples.
    • KubernetesPodOperator Airflow code
    0 讨论(0)
提交回复
热议问题