cancel
Showing results for 
Search instead for 
Did you mean: 

How to avoid Reading Data File Multiple Times

sshilpam
New Contributor

Hi,
How to read the file only once and perform multiple validations on the file.
Example:
My file has 20 columns and I want to find the count of 4 different columns (Column1, Column4, Column9 and Column19).
I have the logic to get the count built in a child pipeline and the Parent pipeline passes the column names for which the count has to be found.

Since the columns can be changing for each file, we have made this dynamic.

Issue here is, we are reading the file in teh child pipeline and based on teh column names passed from the parent pipeline, we are getting the count.

If we need to get the count for 4 columns, the child pipeline will execute 4 times and the input file is being read 4 times. Is there a way we can avoid reading the file multiple times.

Example: I want to find the count in Message, Invoice_id and amount columns.

image

2 REPLIES 2

akidave
Employee
Employee

After reading the file and parsing it, you could use the Copy snap to have multiple different validations run simultaneously on the input documents. A downstream Gate snap can combine all the streams into one.

Thank you Ajay for the response.

We have a file which has the list of conditions. For one component the conditions could be 5 and for other component the conditions could be 10. Since this is dynamic, we will not be able to tell how many data copies are required.

Is there a way we can build this dynamically.