For saving data in parquet format in s3 , below is the pipeline configuration.
This pipeline creates meta data from the data itself , though it uses parquet data type ‘binary’ which is equivalent to string.
the first mapper converts the doc into str...