Google Apps Script Utilities.parseCsv() and replacement character - �

后端 未结 3 1321
余生分开走
余生分开走 2021-02-10 14:28

I\'m working on a project that involves a csv file in Google Drive that is updated with new data every minute or so.

I\'ve built a spreadsheet dashboard to make the dat

相关标签:
3条回答
  • 2021-02-10 14:51

    Have you check the file charset? You can specify it when calling getDataAsString(charset). Try this:

    function importData() {
      var ss = SpreadsheetApp.getActive();
      var file = DriveApp.getFilesByName("Agent Performance.csv")
      var csv = file.next().getBlob().getDataAsString('ISO-8859-1'); //note the charset
      var csvData = Utilities.parseCsv(csv);
      //unless you csv has variable amount of columns per line, you should do this
      if(csvData.length > 0) {
        ss.getSheetByName('CSV Import TEST')
          .getRange(1, 1, csvData.length, csvData[0].length).setValues(csvData);
      } else
        throw 'Blank file';
    }
    
    0 讨论(0)
  • 2021-02-10 14:52

    This may help but you will probably need to investigate whether it causes other difficulties with your data:

    I had the same problem importing a .txt file containing bank transaction data as the bank doesn't offer CSV download files. I found that the odd Characters are FFFD which appear to be inserted by the fileXYZ.getblob() method as substitutes for unrecognized characters for unrecognized codes, in my case these are replaced by spaces.

    my (very basic) solution once you have a file loaded into is as follows..

    function getBankTransactionFile(fileNameToGet) {
    // fileNameToGet is .txt and stored in folder specified in Var list
    
     var  inputFileFolderID = '0B2XXX insert your folder ID',
         fldrID = DriveApp.getFolderById(inputFileFolderID),
         theFileRetrieved = fldrID.getFilesByName('yourFileName'),
         csvFile, cntFiles = 0;
    
    // Even if it's only one file, must iterate a while loop in order to access the file. Google drive will allow multiple files of the same name.
      while (theFileRetrieved.hasNext()) {
        var fileXYZ = theFileRetrieved.next();
        cntFiles = cntFiles + 1;
        csvFile = Utilities.parseCsv(fileXYZ.getBlob()
                  .getDataAsString().replace('\uFFFD'," ",'g'), "\n")
        // Utilities.parseCsv(csv, delimiter) returns 2D array but the fileXYZ 
        // text loaded has FFFD characters inserted so these are substituted for
        // 'space' using the .replace method and 'g' for global flag
      }
      return csvFile;
    }
    

    I am brand new to GAS (converting from VBA) so pretty sure there is a more refined way but it works for my data.. note the \n is the newline character as the specified delimiter for my data. I found out what the odd character were by using Logger to show the raw data string, then extracting the code .charCodeAt(n) counting the characters to find n. As the .txt will import you can see what the unrecognized characters should be.. spaces in my case.

    0 讨论(0)
  • 2021-02-10 15:08

    I had the same requirement and the same problem with a csv file. What I did may be a workaround but worked at least fine for me.

    The "�" may actually be any type of ASCII character that is not recognized so in my case searching for "\uFFFD" did not solve the problem. So what I did is basically convert the payload in binary data. There I managed to notice that between all characters a NULL was being delivered (ASCII Code 0). This was in my case the �. So what I did is rebuild the byte array without the 0s and then copy it in the spreadsheet again.

    var response = UrlFetchApp.fetch(theUrl);
    var payload = response.getContentText();
    //Get byte Array 
    var bytes= response.getContent();
    var myArray = [];
    //Build byte array without the faulty characters
    for ( var i =1 ; i<bytes.length; i++){
      if (bytes[i] != 0){
       myArray.push(bytes[i]);
      }
    }
    //Reconvert to string.
    var newArray = Utilities.newBlob(myArray).getDataAsString();
    

    This script in my case also works fine if I am importing numbers and using them in formulas.

    0 讨论(0)
提交回复
热议问题