Forum Discussion
Hi manichandana_ch , SpiroTaleski
Is this issue related to salesforce read snap due to some api related issue or at the parquet writer's end?
- manichandana_ch3 years agoNew Contributor III
Hi Roger667
please add a gate snap and JSON splitter to split the incoming data from output of gate snap. please remove the salesforce read mapper and connect the output of JSON splitter to parquet writer data input. In JSON splitter give this - jsonPath($, "input0[*]")
Here is the screenshot of what changes are done. please connect 395 port to parquet writer data input.
Issue is the design of pipeline. sf read snap is starting to process data records to parquet writer data input immediately but the schema is not getting processed from the sf read snap, seems like it will process the metadata after all the data is read and processed. But the data is not being processed further because parquet writer is not getting schema to start writing as it's waiting for schema details, it's like an interlock. when gate snap is added to the sf read data output gate snap, it accumulates all input data before it proceeds further so that before passing to parquet writer, data is accumulated at gate snap and then metadata is identified at schema output. Then it starts writing to parquet writer.
Thanks & Regards,
Mani Chandana Chalasani
- Roger6673 years agoNew Contributor III
HI manichandana_ch
Thanks for solving the buffer issue. This Worked but this led to another error. The parquet is not able to write boolean data types even though i have it in the excel file. It is identifying Boolean columns as string even tough i can see in the metadata that those columns are not string but boolean- manichandana_ch3 years agoNew Contributor III
Hi Roger667
Could you please check your SF read snap and make sure the 'Match datatype' is checked.
Thanks !