I try to solves following action in "Apach Spark Built-in SQL API" first or pyspark-sql if less ressource intensive.
Contexte: I have a set of files