Forum Discussion
To solve this I created a pipeline with a ZipFile Reader, followed by a Binary Router, to select only the “content” files rather than the metadata files in this file, use the filter:
!$[‘content-location’].startsWith(“__”)
This then gives you only the output files you want.
The output from the Zipfile Reader is Binary, and the files are streamed to a downstream Snap. If you use the Binary Router Snap next, you can choose to route the ‘content’ files to one output, this might be using the file name, the content length or some other field(s) to use in the selection criteria. For those ‘files’ you want to discard, if you leave the second output unterminated, it will just get discarded. If you really want to be clean, then use a file writer on that output and write it to /dev/null.