10-04-2023 02:38 AM
I am trying to write data from postgres to parquet. The source has numeric data type which has can values as int, decimal or float. I have mapped numeric type to decimal in parquet but the issue is that it converts int values to decimal as well. ex 1 becomes 1.00 and when i map the numeric datatype to int i loose the decimal values. It will be a general pipeline for many objects and i wont have the column schema at the runtime. Is there any workaround for this in parquet writer where we can distinguish between int and decimal for numeric data type.
10-08-2023 09:54 PM
Hi @davidmil
Best option is to read and convert the data types from source, as @SpiroTaleski mentioned you can use expression file and maintain the datatype conversions, in such case, you need not worry for any sources/source metadata. you can maintain all possible source datatypes in the file and convert them to parquet supported datatypes and pass to parquet writer schema input, everything will be handled in this case.