according to RFC standards, the non-compliant data will be replaced by unicode "unknown character" code points. r