MySQL export into outfile : CSV escaping chars

前端 未结 6 1928
暗喜
暗喜 2020-11-27 13:44

I\'ve a database table of timesheets with some common feilds.

id, client_id, project_id, task_id, description, time, date 

There are more b

相关标签:
6条回答
  • 2020-11-27 14:14

    Here is what worked here: Simulates Excel 2003 (Save as CSV format)

    SELECT 
    REPLACE( IFNULL(notes, ''), '\r\n' , '\n' )   AS notes
    FROM sometables
    INTO OUTFILE '/tmp/test.csv' 
    FIELDS TERMINATED BY ',' ENCLOSED BY '"' ESCAPED BY '"'
    LINES TERMINATED BY '\r\n';
    
    1. Excel saves \r\n for line separators.
    2. Excel saves \n for newline characters within column data
    3. Have to replace \r\n inside your data first otherwise Excel will think its a start of the next line.
    0 讨论(0)
  • 2020-11-27 14:18

    I think your statement should look like:

    SELECT id, 
       client,
       project,
       task,
       description, 
       time,
       date  
      INTO OUTFILE '/path/to/file.csv'
      FIELDS TERMINATED BY ',' OPTIONALLY ENCLOSED BY '"'
      LINES TERMINATED BY '\n'
      FROM ts
    

    Mainly without the FIELDS ESCAPED BY '""' option, OPTIONALLY ENCLOSED BY '"' will do the trick for description fields etc and your numbers will be treated as numbers in Excel (not strings comprising of numerics)

    Also try calling:

    SET NAMES utf8;
    

    before your outfile select, that might help getting the character encodings inline (all UTF8)

    Let us know how you get on.

    0 讨论(0)
  • 2020-11-27 14:25

    Probably won't help but you could try creating a CSV table with that content:

    DROP TABLE IF EXISTS foo_export;
    CREATE TABLE foo_export LIKE foo;
    ALTER TABLE foo_export ENGINE=CSV;
    INSERT INTO foo_export SELECT id, 
       client,
       project,
       task,
       REPLACE(REPLACE(ifnull(ts.description,''),'\n',' '),'\r',' ') AS description, 
       time,
       date
      FROM ....
    
    0 讨论(0)
  • 2020-11-27 14:29

    Without actually seeing your output file for confirmation, my guess is that you've got to get rid of the FIELDS ESCAPED BY value.

    MySQL's FIELDS ESCAPED BY is probably behaving in two ways that you were not counting on: (1) it is only meant to be one character, so in your case it is probably equal to just one quotation mark; (2) it is used to precede each character that MySQL thinks needs escaping, including the FIELDS TERMINATED BY and LINES TERMINATED BY values. This makes sense to most of the computing world, but it isn't the way Excel does escaping.

    I think your double REPLACE is working, and that you are successfully replacing literal newlines with spaces (two spaces in the case of Windows-style newlines). But if you have any commas in your data (literals, not field separators), these are being preceded by quotation marks, which Excel treats much differently than MySQL. If that's the case, then the erroneous newlines that are tripping up Excel are actually newlines that MySQL had intended as line terminators.

    0 讨论(0)
  • 2020-11-27 14:35

    Below procedure worked for me to resolve all the escaping issues and have the procedure more a generic utility.

    CREATE PROCEDURE `export_table`(
    IN tab_name varchar(50), 
    IN select_columns varchar(1000),
    IN filename varchar(100),
    IN where_clause varchar(1000),
    IN header_row varchar(2000))
    
    BEGIN
    INSERT INTO impl_log_activities(TABLE_NAME, LOG_MESSAGE,CREATED_TS) values(tab_name, where_clause,sysdate());
    COMMIT;
    SELECT CONCAT( "SELECT ", header_row,
        " UNION ALL ",
        "SELECT ", select_columns, 
        " INTO OUTFILE ", "'",filename,"'"
        " FIELDS TERMINATED BY ',' OPTIONALLY ENCLOSED BY '\"' ESCAPED BY '""' ",
        " LINES TERMINATED BY '\n'"
        " FROM ", tab_name, " ",
        (case when where_clause is null then "" else where_clause end)
    ) INTO @SQL_QUERY;
    
    INSERT INTO impl_log_activities(TABLE_NAME, LOG_MESSAGE,CREATED_TS) values(tab_name, @SQL_QUERY, sysdate());
    COMMIT;
    
    PREPARE stmt FROM @SQL_QUERY;
    EXECUTE stmt;
    DEALLOCATE PREPARE stmt;
    

    END

    0 讨论(0)
  • 2020-11-27 14:40

    What happens if you try the following?

    Instead of your double REPLACE statement, try:

    REPLACE(IFNULL(ts.description, ''),'\r\n', '\n')
    

    Also, I think it should be LINES TERMINATED BY '\r\n' instead of just '\n'

    0 讨论(0)
提交回复
热议问题