Content parsing based on set of rules

alexeystar · August 20, 2023, 7:35pm

I would like to implement custom content parsing rules for advanced typography and symbols. For instance, I want all content that matches a set of predefined patterns to be automatically replaced, for instance:

turn " - " into " — "
turn “CMD” into “⌘”
turn “…” into “…”
etc.

What is the best approach to implement such functionality without breaking the source code?
I would love to create an advanced typography module and share it with the community. Any guidance is appreciated.

Thanks!

Georg · August 21, 2023, 7:10am

Hugo includes the typographer extension that replaces “—” with the UTF8 long dash and three normal dots with the UTF8 ellipsis. We can also configure this typographer to replace dumb quotes with the curly ones of a specific language (English by default).

It’s also possible to use the function replaceRE on the rendered content.

Please consider, that we can use all UTF8 characters in Markdown directly. The substitution of ASCII characters is a convenience for the most frequent ones.

alexeystar · August 21, 2023, 12:03pm

Thank you for quick help, Georg! Typographer extension is exactly what was needed.

Can’t wrap my head around using replaceRE though. I tried {{ .Content | replaceRE CMD "$1" "⌘" }}, but it breaks the code. What confuses me, is if the rendered content is going to be processed, doesn’t that mean that the content should be re-processed again?

chrillek · August 21, 2023, 12:22pm

The parameters of replaceRE must be strings. Which CMD Is not. And you don’t even need REs here, as you’re simply replacing one string with another.

Georg · August 22, 2023, 9:30am

There is a collection of replacements for missing inline markup and a few typographic features in this module.

I’m using them for my coming theme https://perplex.desider.at .

system · October 5, 2023, 8:44am

This topic was automatically closed 2 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Is it possible to customize Goldmark's Typographer substitutions? support	4	630	August 11, 2022
Best way to implement custom emoji support	5	1991	December 27, 2020
Using replaceRE for handling Chinses unicode characters support	3	538	February 26, 2021
Replace html content from Hugo Pipes? support	12	3704	December 31, 2018
Advanced content formatting support	5	728	December 6, 2017

Content parsing based on set of rules

Related topics