Forum Discussion

kumar25's avatar
kumar25
New Contributor II
5 months ago
Solved

How to read .xlsb file

Hi

I have to read data from .xlsb file. I tried to use Excel parser but it is throwing error.

  • kumar25 - My dev team is stating that this is currently not supported by our Excel Parser snap.  My major concern with using a Script snap to handle the parsing is that the file would either need to be passed as a single binary document, consuming as much memory as the file size, which can be quite large, or you would need to place the file on the Snaplex node before parsing and ensure that the SnapLogic process has read permissions to it.  

    Having the Script snap perform the parsing also requires that you install and maintain the libraries required to parse. 

    While is is feasible, I might recommend that you see if the source can provide the more standard XLSX format before going that route.

9 Replies

  • kumar25's avatar
    kumar25
    New Contributor II

    Hi koryknick this was a file created on Windows machine.

    Wanted to see if there is any possibility to use any script to convert the file to a readable file(.xlsx/.xlsm).

    I have to check on the possibility to open a support team, not sure if I have access.

  • kumar25 - this is not a commonly used format that I do not have personal experience with.  I will post to our internal channels to see if I can get an answer.

  • kumar25 - do you happen to know if this file was created on a Linux or Windows machine?  This apparently may make a difference to the parser.

  • kumar25 - it might be faster if you open a support ticket to have this question answered. Apparently we have seen this work with other customers, so there may be some conditions that require further investigation by the support team.

    • kumar25's avatar
      kumar25
      New Contributor II

      Hi koryknick this was a file created on Windows machine.

      Wanted to see if there is any possibility to use any script to convert the file to a readable file(.xlsx/.xlsm).

      I have to check on the possibility to open a support team, not sure if I have access.

  • kumar25 - The dev team has informed me that the Excel Parser only supports XLS and XLSX formats today.  So requesting support for XLSB would be an enhancement request, which would need to be evaluated and prioritized according to the number of customers impacted and weighed against other priorities.

    So I would recommend that you research Java or Python libraries that can parse the files that you are trying to source, then look at how to implement that with the Script so it can be called within your pipeline.

    Hope this helps! 

    • kumar25's avatar
      kumar25
      New Contributor II

      koryknick , thanks for the solution. Can you provide sample pipeline which does this transformation.

  • kumar25 - My dev team is stating that this is currently not supported by our Excel Parser snap.  My major concern with using a Script snap to handle the parsing is that the file would either need to be passed as a single binary document, consuming as much memory as the file size, which can be quite large, or you would need to place the file on the Snaplex node before parsing and ensure that the SnapLogic process has read permissions to it.  

    Having the Script snap perform the parsing also requires that you install and maintain the libraries required to parse. 

    While is is feasible, I might recommend that you see if the source can provide the more standard XLSX format before going that route.