JmPearl
3 years agoNew Contributor II
RPE only returns small proportion of documents
I have a RPE running with the snaplogic recommended data science docker image. When i pass documents to the remote python script (hundreds, sometimes thousands) I only get around 1% of the documents back from the snap. Even when i literally pass the documents through the snap without any transforms, I get only 1% back.
whats going on here?
Any help with this is appreciated
Ultimately i figured it out by digging into the log files produced by my RPE. The text being sent to the remote python code snap was not decoded properly, and therefor the RPE would break at the nth document which had faulty text. The problem was fixed by setting up a mapper snap that used HTML.decode on the text prior to the RPE snap.