snappy

error with snappy while importing fastparquet in python

夙愿已清 提交于 2019-11-28 07:52:12
问题 I have installed installed the following modules in my EC2 server which already has python (3.6) & anaconda installed : snappy pyarrow s3fs fastparquet except fastparquet everything else works on importing. When I try to import fastparquet it throws the following error : [username@ip8 ~]$ conda -V conda 4.2.13 [username@ip-~]$ python Python 3.6.0 |Anaconda custom (64-bit)| (default, Dec 23 2016, 12:22:00) [GCC 4.4.7 20120313 (Red Hat 4.4.7-1)] on linux Type "help", "copyright", "credits" or

hadoop mapreduce: java.lang.UnsatisfiedLinkError: org.apache.hadoop.util.NativeCodeLoader.buildSupportsSnappy()Z

若如初见. 提交于 2019-11-27 16:36:39
问题 I am trying to write a snappy block compressed sequence file from a map-reduce job. I am using hadoop 2.0.0-cdh4.5.0, and snappy-java 1.0.4.1 Here is my code: package jinvestor.jhouse.mr; import java.io.ByteArrayOutputStream; import java.io.IOException; import java.io.OutputStream; import java.util.Arrays; import java.util.List; import jinvestor.jhouse.core.House; import jinvestor.jhouse.core.util.HouseAvroUtil; import jinvestor.jhouse.download.HBaseHouseDAO; import org.apache.commons.io

UnsatisfiedLinkError: /tmp/snappy-1.1.4-libsnappyjava.so Error loading shared library ld-linux-x86-64.so.2: No such file or directory

倾然丶 夕夏残阳落幕 提交于 2019-11-27 16:03:53
问题 I am trying to run a Kafka Streams application in kubernetes. When I launch the pod I get the following exception: Exception in thread "streams-pipe-e19c2d9a-d403-4944-8d26-0ef27ed5c057-StreamThread-1" java.lang.UnsatisfiedLinkError: /tmp/snappy-1.1.4-5cec5405-2ce7-4046-a8bd-922ce96534a0-libsnappyjava.so: Error loading shared library ld-linux-x86-64.so.2: No such file or directory (needed by /tmp/snappy-1.1.4-5cec5405-2ce7-4046-a8bd-922ce96534a0-libsnappyjava.so) at java.lang.ClassLoader

Hive压缩

本小妞迷上赌 提交于 2019-11-27 12:48:16
主要是对CDH6.0.1平台,Hive的压缩进行设置。 采用ORC+Snappy压缩是比较常用的格式,CDH6已经自动部署了Snappy压缩。 Hive表启用压缩 set hive.exec.compress.output=true; CREATE TABLE `virtual_payment_cp` ( `ID` bigint, `DEVICE_CODE` string COMMENT 'xx', `LOGIN_ACCOUNT` string COMMENT 'xx', `AMOUNT` decimal(11,2) COMMENT 'xx', `PAY_RESULT` int COMMENT 'xx', `CP_GAME_ID` bigint COMMENT 'xx' ) PARTITIONED BY(`DATE` STRING) ROW FORMAT DELIMITED FIELDS TERMINATED BY '\t' STORED AS orc tblproperties ("orc.compress"="SNAPPY"); Map阶段启用压缩 CDH -> YARN ->配置 -> mapred-site.xml -> mapred-site.xml 的 MapReduce 客户端高级配置代码段(安全阀),添加 <property><name>mapreduce.map