Convert PySpark dataframe column from list to string

后端 未结 2 1966
悲哀的现实
悲哀的现实 2021-01-01 17:44

I have this PySpark dataframe

+-----------+--------------------+
|uuid       |   test_123         |    
+-----------+--------------------+
|      1    |[test         


        
2条回答
  •  囚心锁ツ
    2021-01-01 18:24

    While you can use a UserDefinedFunction it is very inefficient. Instead it is better to use concat_ws function:

    from pyspark.sql.functions import concat_ws
    
    df.withColumn("test_123", concat_ws(",", "test_123")).show()
    
    +----+----------------+
    |uuid|        test_123|
    +----+----------------+
    |   1|test,test2,test3|
    |   2|test4,test,test6|
    |   3|test6,test9,t55o|
    +----+----------------+
    

提交回复
热议问题