Decode UTF-8 characters from JSON file

mxmehl · September 12, 2017, 2:47pm

I have some JSON data with UTF-8 encoded sequences which look like:

[{
		"name": "Bj\u00c3\u00b6rn XYZe\u00c3\u009fle",
		"comment": "text & text"
	},
	{
		"name": "Simona XYZ",
		"comment": "bla citizens\u00e2\u0080\u0099 autonomy and text."
	}
]

In Hugo, I show this data with just {{ .name }}.

The problem: It doesn’t recode these UTF-8 sequences but shows them like this:

BjÃ¶rn XYZieÃle
bla citizensâ autonomy and text

Obviously, I lack a decoding function. I already tried safeHTML, safeJS or htmlUnescape. What can I do except changing the input data?

Jura · September 16, 2017, 1:49pm

I’d use markdownify:

{{ "bla citizens\u00e2\u0080\u0099 autonomy and text." | markdownify }}

Returns for me:

bla citizensâ autonomy and text.

(Don’t know if that are the right characters, but they are at least not UTF-8 anymore. )

mxmehl · September 17, 2017, 10:20am

No, they’re not. The correct character would have been the backtick-apostrophe

As written above, it was an encoding error on the input side. However, maybe it would have been possible to decode this with Hugo or some additional Go magic.

Topic		Replies	Views
Confusion with json htmlescaped and utf-8 support	1	589	September 25, 2017
How do I preserve linebreaks when using getJSON? support	2	822	June 11, 2016
[SOLVED] Json data generated from PowerShell could not be read from Hugo support	3	3351	July 25, 2018
Prolem outputting special characters in JSON support	8	1172	May 27, 2020
Problems with encoding special characters in json+ld ("invalid escape sequence") support	17	13714	July 17, 2019

Decode UTF-8 characters from JSON file

Related topics