pyspark - What additional preprocessing steps should I take to create a spark dataframe?

前端 未结 0 976
南旧
南旧 2021-01-15 07:54

I started with a few dozen pdf files and I\'ve extracted the text in each one by looping through the following:

import pdfplumber

def get_text(file):
  with          


        
相关标签:
回答
  • 消灭零回复
提交回复
热议问题