Forum Discussion
Thanks endor_force for the detail analysis , I have tried with other sample file which do not have Chinese characters, It does not accept throw the same error, Instead I used ISO-8859-1 which accept the input, Now the problem is what is would be output character set. I can not see any ANSI or MS DOS character set there instead it shows Windows character sets
I am trying to get the output in MS DOS ANSI character set format , any thoughts ?
IBM850 would be the Latin multilingual MS-DOS charset i assume.
I tested on my pc and when typing out a file transcoded from UTF-8 to IBM850 in a windows dos prompt it looks ok, i have not verified on older dos versions or dosbox.
It will fail with error in transcoding if you have any unsupported character of the target charset, even the euro-sign (€) will cause failure. It seems like the transcoding is relying on old non-euro version of IBM850?
Other Multilingual Latin charsets with euro support such as 858 or 912 is not existing to select from.
For verification with the tools used previously, chardet would identify an IBM850 transcoded file as windows1252 with 73% confidence (which is not correct) and "file -i" says it is unknown 8-bit