I do a lot of data analysis in perl and I am trying to replicate this work in python using pandas, numpy, matplotlib, etc.
The general workflow goes as follows:
To glob your files, use the built-in glob
module in Python.
To read your csv files after globbing them, the read_csv
function that you can import using from pandas.io.parsers import read_csv
will help you do that.
As for MultiIndex
feature in the pandas dataframe that you instantiate after using read_csv
, you can then use them to organize your data and slice them anyway you want.
3 pertinent links for your reference.
MultiIndex
dataframes in pandas - understanding MultiIndex and Benefits of panda's multiindex?glob
in a directory to grab and manipulate your files - extract values/renaming filename in python