How to find common elements among two array columns?

前端 未结 3 2022
执笔经年
执笔经年 2021-01-25 08:49

I have two comma-separated string columns (sourceAuthors and targetAuthors).

val df = Seq(
  (\"Author1,Author2,Author3\",\"Author2,Aut         


        
3条回答
  •  夕颜
    夕颜 (楼主)
    2021-01-25 09:27

    That error means that your udf is returning unit ( no return at all, as void un Java )

    Try this. You are applying the intersect over the original s1 and S2 rather than over the splitted ones.

    def myUDF = udf((s1: String, s2: String) =>{

      val splitted1 = s1.split(",")
    
    
      val splitted2= s2.split(",")
    
    
    splitted1.intersect(splitted2).length
    

    } )

提交回复
热议问题