I am trying to populate a pyspark dataframe simultaneously from s3 by passing a path_list of parquet files.
path_list = [\'s3a://bucket/folder/yy