Comparing the schema and discard the Object
Hi Team,
I have one scenario and want to solve it as soon as possible.
I have two files : Initial Load , Incremental Load
We have loaded the initial load data into the DB which contains the columns like : id, name, age, class
But We have the incremental files on the daily basis but there are some case we have the extra column which we have to discard the value and alert to the team that column is added newly How we can do in the pipeline .
Incremental file columns:
We have to give alert that this column is roll_number is added and also discard the columns. We cant do manually because it is a daily job and me can user mapper in this case.
Can anyone help me in this case.
mohit_jain the solution is very straight forward.
1. Get the header from your initial file.
2. Merge it with your incremental file.
3. Filter match and unmatched in two different output streams.
Download solution pipeline Compare_Schema_Discard_load & below is the screenshot.