问题
- Can any one explain how to rectify this error
- Where do i get a detailed info of dask
- Can it replace pandas.
- How is it different from other dataframes, is it fast in processing
Code:
import dask.dataframe as dd
df = dd.demo.make_timeseries('2000-01-01', '2000-12-31', freq='10s', partition_freq='1M',dtypes={'name': str, 'id': int, 'x': float, 'y': float})
print df
o/p:
Traceback (most recent call last): File "C:/Users/divya.nagandla/PycharmProjects/python/supressions1/dask.py", line 1, in import dask.dataframe as dd File "C:\Users\divya.nagandla\PycharmProjects\python\supressions1\dask.py", line 1, in
import dask.dataframe as dd ImportError: No module named dataframe
回答1:
pip install dask[dataframe]
you can also install complete dask package
pip install dask[complete]
installation guide
for more light on dask follow below links
https://dask.pydata.org/en/latest/_downloads/daskcheatsheet.pdf
Details info on dask https://github.com/dask/dask-tutorial
来源:https://stackoverflow.com/questions/50461126/what-is-dask-and-how-is-it-different-from-pandas