cancel
Showing results forย 
Search instead forย 
Did you mean:ย 

Sort snap's functionality

darshthakkar
Valued Contributor

Results of a sort snap (ascending) is a little bit hard to digest. I used a file reader which was reading a single column A (i.e. IDs) and I decided to sort it. The results I get are surprising.

Input raw data:
1217
151983
1543
370904
402
402783
901602
373
373916
374
120651
1216
1000022
1000027
394533
906
909
1000019
918
1205

Output after sort:
1000019
1000022
1000027
1205
120651
1216
1217
151983
1543
370904
373
373916
374
394533
402
402783
901602
906
909
918

Result expected:
373
374
402
906
909
918
1205
1216
1217
1543
120651
151983
370904
373916
394533
402783
901602
1000019
1000022
1000027

Can anyone explain whatโ€™s the rationale behind the results I got using the sort snap? Itโ€™s confusing for me to be honest.
Thanking you in advance for your time, help and response.

Regards,
Darsh

1 ACCEPTED SOLUTION

bojanvelevski
Valued Contributor

Hey @darshthakkar,

This happens because the field youโ€™re sorting is string, so it is sorted alphabetically.

As I can see, all of these values support parsing so try and parse them before sorting.

Ex. : parseInt($ID)

View solution in original post

8 REPLIES 8

bojanvelevski
Valued Contributor

Hey @darshthakkar,

This happens because the field youโ€™re sorting is string, so it is sorted alphabetically.

As I can see, all of these values support parsing so try and parse them before sorting.

Ex. : parseInt($ID)

Thanks a TON @bojanvelevski, does using an โ€œExcel Parserโ€ after file reader help? Do we need to explicitly ingest a mapper and then put parseInt($ID) under expression?

Just for your reference, my pipeline looks like File Reader โ†’ Excel Parser โ†’ Mapper (as Iโ€™m renaming the single column, could have renamed it in excel itself before reading it but went this route) โ†’ Sort

It worked! I tried testing with parseInt($ID) and I got the desired results, thanks again @bojanvelevski ๐Ÿ™‚ :innocent:

You can use the same maper where you are renaming a column, and yes, implicitly parse the column you are sorting on.

Check the "Cell formatting" option, if "Formatted" is selected than I suppose the sorting field is coming as a string from the document. If youโ€™re able to edit the file than you can try changing the whole column formatting to number.