I have the following PySpark DataFrame df:
df
df.printSchema() |-- yearday: integer (nullable = true) |-- month: integer (nullable = true) |--