Pyspark : select specific column with its position

前端 未结 2 734
时光取名叫无心
时光取名叫无心 2021-01-18 08:42

I would like to know how to select a specific column with its number but not with its name in a dataframe ?

Like this in Pandas:

df = df.iloc[:,2]
         


        
2条回答
  •  一向
    一向 (楼主)
    2021-01-18 09:27

    Same solution as mirkhosro:

    For a dataframe df, you can select the column n using df[n], where n is the index of the column.

    Example:

    df = df.filter(df[3]!=0)
    

    will remove the rows of df, where the value in the fourth column is 0.

    Note that you can check the columns using df.printSchema()

提交回复
热议问题