I need to delete string from position X to position Y on each line in a text file

后端 未结 3 1905
轻奢々
轻奢々 2021-01-24 12:57

I have a huge flat file 100K records each spanning 3000 columns. I need to removed a segment of the data fay starting position 300 to position 500 before archiving. This is sens

相关标签:
3条回答
  • 2021-01-24 13:38

    Assuming that position means column, you can use cut to select the columns you want.

    cut -f 1-299,501-3000 CutMe.txt
    

    If your data is delimited by commas instead of tabs, then use -d.

    cut -d, -f 1-299,501-3000 CutMe.txt
    

    If position means character, you can do the same with cut -c.

    cut -c 1-299,501-3000 CutMe.txt
    
    0 讨论(0)
  • 2021-01-24 13:45

    Using sed:

    sed -r -i.bak 's/(.{299}).{200}/\1/' file
    

    The -r option enables extended regex. If you need to make it portable you can remove that option by escaping braces and curlies. The -i option makes changes in-places. I have put an extension .bak to safeguard from any mess up. You can remove it if you don't need to maintain the backup of original.

    For solution, we just capture the first 299 characters in a capture group and seek the next 200 we need to remove. We substitute this entire line with our captured group.

    0 讨论(0)
  • 2021-01-24 13:47

    Assuming "position" means "character":

    awk '{print substr($0,1,299) substr($0,501)}' file
    

    If it doesn't then edit your question to add some REPRESENTATIVE sample input and expected output (e.g. 5 lines of 6 columns each, not thousands of lines of thousands of columns).

    0 讨论(0)
提交回复
热议问题