Skip to main content
Solved

XML Conversion Problem: Accented Characters (É) Result in � Symbol

  • January 25, 2024
  • 2 replies
  • 301 views

Forum|alt.badge.img

Greetings everyone.
I have a problem using a flat file connector, which I use to convert a fixed length .txt file to xml. the conversion is successful, but the characters present that have accents are converted with the symbol � in the resulting xml. Are there any settings or workarounds to avoid this?

address in the .txt file → 3 AVENUE DES SPÉLUGUES
address in the xml file → 3 AVENUE DES SP�LUGUES

Thanks in advance to anyone who can help me

Best answer by Piet Potappel

Usually this is the effect of conversion from ASCII to UNICODE.
You could use a more sophisticated editor like NOTEPAD++ to see what the new character has become.

This post might help you in Arc :-)
Replacing non-ASCII characters in a file | Community (cdata.com)

This topic has been closed for replies.

2 replies

  • Collaborator
  • Answer
  • January 25, 2024

Usually this is the effect of conversion from ASCII to UNICODE.
You could use a more sophisticated editor like NOTEPAD++ to see what the new character has become.

This post might help you in Arc :-)
Replacing non-ASCII characters in a file | Community (cdata.com)


Forum|alt.badge.img
  • Author
  • Apprentice
  • January 25, 2024

Thanks for the reply and advice.
I tried to really understand how this symbol is interpreted (and his real value) but this is also displayed to me by opening the xml generated with notepad++

For the moment I found the script you suggested useful, passing the txt file is "corrected" and the char "É" is converted into "E"

For the moment I can proceed like this but I would like to be able to save the exact information data as it is passed to me. I'm trying to understand if I can somehow extend and/or modify the suggested script.