Plotting huge data files in R?

前端 未结 5 748
独厮守ぢ
独厮守ぢ 2021-02-09 00:54

I have a input file that has about 20 million lines. The size of the file is about 1.2 G. Is there anyway I can plot the data in R. Some of the columns have categories, most

5条回答
  •  -上瘾入骨i
    2021-02-09 01:36

    The package hexbin to plot hexbins instead of scatterplots for pairs of variables as suggested by Ben Bolker in Speed up plot() function for large dataset worked for me for 2 million records fairly with 4GB RAM. But it failed for 200 million records/rows for same set of variables. I tried reducing the bin size to adjust computation time vs. RAM usage but it did not help.

    For 20 million records, you can try out hexbins with xbins = 20,30,40 to start with.

提交回复
热议问题