How to copy the first row record in all the rows?

Employee

2 years ago

salonimaggo - the solution provided by nativeBasic functionally works, but I am concerned with memory consumption when dealing with large files since grouping the entire input into an array forces the entire input set into memory. Attached is a pipeline that should meet the same requirement while keeping the data streaming or cached to disk to perform the sort for larger data sets. This is a safer approach in a shared environment.

Just for some edification, you may wonder why I am using a Sort in the top path. This is to prevent a processing deadlock for input data sets larger than 1024 records, which is the total number of documents allowed to buffer between each snap. With how I have it configured, the Aggregate snap will wait for all input documents to be consumed before sending anything to output, so without the Sort snap in the top path, the Copy will not be able to send more than 1024 documents downstream and the Join would not be able to complete because the bottom path hasn't sent any data when the Copy stops sending any data to it due to buffer full of those 1024 documents.

Go ahead and test the deadlock by removing the Sort snap and attach the Copy directly to the "Cartesian Join" - when executed, the Copy will stop sending data after 1024 documents and never complete until you force the pipeline to stop. Note that if you do this using Validate Pipeline, you will want to increase your User Settings / Preview Document Count to at least 1500 and make sure you have an input source with more than 1024 documents.

Hope this helps!

Forum Discussion

Recent Discussions

Pagination and nextCursor in header

Javascript to promote top level lists

Google Sheets Subscribe questions

Basic string transformations not working

Can we generate XML file in pretty print format using native snapLogic snaps?