Convert tab-delimited txt file into a csv file using Python

后端 未结 3 1635
臣服心动
臣服心动 2020-11-30 02:51

So I want to convert a simple tab delimited text file into a csv file. If I convert the txt file into a string using string.split(\'\\n\') I get a list with each list item a

相关标签:
3条回答
  • 2020-11-30 03:05

    csv supports tab delimited files. Supply the delimiter argument to reader:

    import csv
    
    txt_file = r"mytxt.txt"
    csv_file = r"mycsv.csv"
    
    # use 'with' if the program isn't going to immediately terminate
    # so you don't leave files open
    # the 'b' is necessary on Windows
    # it prevents \x1a, Ctrl-z, from ending the stream prematurely
    # and also stops Python converting to / from different line terminators
    # On other platforms, it has no effect
    in_txt = csv.reader(open(txt_file, "rb"), delimiter = '\t')
    out_csv = csv.writer(open(csv_file, 'wb'))
    
    out_csv.writerows(in_txt)
    
    0 讨论(0)
  • 2020-11-30 03:09

    This is how i Do it

    import csv
    
    with open(txtfile, 'r') as infile, open(csvfile, 'w') as outfile:
         stripped = (line.strip() for line in infile)
         lines = (line.split(",") for line in stripped if line)
         writer = csv.writer(outfile)
         writer.writerows(lines)
    
    0 讨论(0)
  • 2020-11-30 03:10

    Why you should always use 'rb' mode when reading files with the csv module:

    Python 2.7.2 (default, Jun 12 2011, 15:08:59) [MSC v.1500 32 bit (Intel)] on win32
    Type "help", "copyright", "credits" or "license" for more information.
    

    What's in the sample file: any old rubbish, including control characters obtained by extracting blobs or whatever from a database, or injudicious use of the CHAR function in Excel formulas, or ...

    >>> open('demo.txt', 'rb').read()
    'h1\t"h2a\nh2b"\th3\r\nx1\t"x2a\r\nx2b"\tx3\r\ny1\ty2a\x1ay2b\ty3\r\n'
    

    Python follows CP/M, MS-DOS, and Windows when it reads files in text mode: \r\n is recognised as the line separator and is served up as \n, and \x1a aka Ctrl-Z is recognised as an END-OF-FILE marker.

    >>> open('demo.txt', 'r').read()
    'h1\t"h2a\nh2b"\th3\nx1\t"x2a\nx2b"\tx3\ny1\ty2a' # WHOOPS
    

    csv with a file opened with 'rb' works as expected:

    >>> import csv
    >>> list(csv.reader(open('demo.txt', 'rb'), delimiter='\t'))
    [['h1', 'h2a\nh2b', 'h3'], ['x1', 'x2a\r\nx2b', 'x3'], ['y1', 'y2a\x1ay2b', 'y3']]
    

    but text mode doesn't:

    >>> list(csv.reader(open('demo.txt', 'r'), delimiter='\t'))
    [['h1', 'h2a\nh2b', 'h3'], ['x1', 'x2a\nx2b', 'x3'], ['y1', 'y2a']]
    >>>
    
    0 讨论(0)
提交回复
热议问题