Why flink container vcore size is always 1

后端 未结 3 1735
借酒劲吻你
借酒劲吻你 2021-01-16 00:14

I am running flink on yarn(more precisely in AWS EMR yarn cluster).

I read flink document and source code that by default for each task manager container, flink will

相关标签:
3条回答
  • 2021-01-16 00:36

    I get the answer finally. It's because yarn is use "DefaultResourceCalculator" allocation strategy, so only memory is counted for yarn RM, even if flink requested 3 vcores, but yarn simply ignore the cpu core number.

    0 讨论(0)
  • 2021-01-16 00:37

    An answer from Kien Truong

    Hi,

    You have to enable CPU scheduling in YARN, otherwise, it always shows that only 1 CPU is allocated for each container, regardless of how many Flink try to allocate. So you should add (edit) the following property in capacity-scheduler.xml:

    <property>
     <name>yarn.scheduler.capacity.resource-calculator</name>
     <!-- <value>org.apache.hadoop.yarn.util.resource.DefaultResourceCalculator</value> -->
     <value>org.apache.hadoop.yarn.util.resource.DominantResourceCalculator</value>
    </property>
    

    TaskManager memory is, for example, 1400MB, but Flink reserves some amount for off-heap memory, so the actual heap size is smaller.

    This is controlled by 2 settings:

    containerized.heap-cutoff-min: default 600MB
    
    containerized.heap-cutoff-ratio: default 15% of TM's memory
    

    That's why your TM's heap size is limitted to ~800MB (1400 - 600)

    Regards,

    Kien

    0 讨论(0)
  • 2021-01-16 00:50

    @yinhua.

    Use the command to start a session:./bin/yarn-session.sh, you need add -s arg.

    -s,--slots Number of slots per TaskManager

    details:

    1. https://ci.apache.org/projects/flink/flink-docs-release-1.4/ops/deployment/yarn_setup.html
    2. https://ci.apache.org/projects/flink/flink-docs-release-1.4/ops/cli.html#usage
    0 讨论(0)
提交回复
热议问题