Unable to write salesforce data to s3 in parquet format

manichandana_ch
New Contributor III
3 years ago
Hi Roger667
please add a gate snap and JSON splitter to split the incoming data from output of gate snap. please remove the salesforce read mapper and connect the output of JSON splitter to parquet writer data input. In JSON splitter give this - jsonPath($, "input0[*]")
Here is the screenshot of what changes are done. please connect 395 port to parquet writer data input.

Issue is the design of pipeline. sf read snap is starting to process data records to parquet writer data input immediately but the schema is not getting processed from the sf read snap, seems like it will process the metadata after all the data is read and processed. But the data is not being processed further because parquet writer is not getting schema to start writing as it's waiting for schema details, it's like an interlock. when gate snap is added to the sf read data output gate snap, it accumulates all input data before it proceeds further so that before passing to parquet writer, data is accumulated at gate snap and then metadata is identified at schema output. Then it starts writing to parquet writer.
Thanks & Regards,
Mani Chandana Chalasani
- Roger667
  New Contributor III
  3 years ago
  HI manichandana_ch
  Thanks for solving the buffer issue. This Worked but this led to another error. The parquet is not able to write boolean data types even though i have it in the excel file. It is identifying Boolean columns as string even tough i can see in the metadata that those columns are not string but boolean
  - manichandana_ch
    New Contributor III
    3 years ago
    Hi Roger667
    Could you please check your SF read snap and make sure the 'Match datatype' is checked.
    Thanks !

Forum Discussion

Recent Discussions

Way to lock down in Prod org to "Monitor" only access?

trace API and proxy calls

Pagination Logic Fails After Migrating from REST GET to HTTP Client Snap

Pipeline Execute Pool size

Concat values of a field based on value of another field