发表新帖

发表新帖

When you create an external table in Hive with an S3 location is the data transfered?

后端未结

关注

 2  1567

不思量自难忘°

When you create an external table in Hive (on Hadoop) with an Amazon S3 source location is the data transfered to the local Hadoop HDFS on:

external table creation

相关标签:

2条回答

暖寄归人

2021-02-09 11:17

Map tasks will read the data directly from S3. Between the Map and Reduce steps, data will be written to the local filesystem, and between mapreduce jobs (in queries that require multiple jobs) the temporary data will be written to HDFS.

If you are concerned about S3 read costs, it might make sense to create another table that is stored on HDFS, and do a one-time copy from the S3 table to the HDFS table.

0 讨论(0)
发布评论:

提交评论
- 加载中...
旧时难觅i

2021-02-09 11:19

The data is transferred to your hadoop nodes when queries (MR Jobs) access the data.
Create external table only change Hive metadata and never move actual data.

0 讨论(0)
发布评论:

提交评论
- 加载中...

热议问题