Stripping footnotes from Summary

Gary · August 16, 2016, 10:55pm

I’ve made use of {{ .Summary }} to render out either the auto generated X character summary or my manually split summary however it looks like footnote tags are still included even though the footnotes themselves are not, leaving unclickable [^1] in the summary.

This post mentions the same problem and that solving it may not be simple.

That said, what would be involved in stripping out the footnote tags e.g. [^1] when displaying .Summary?

The other alternative I’m considering is adding a post-summary = “stuff” to the frontmatter but I’d prefer to only use that in cases where I want a pretty different summary of a post due to the need for markup and duplication of content.

Is this something .Summary should handle, or should I suck it up and use replaceRE on Summary to remove the footnote tags?

bep · August 17, 2016, 10:49pm

This should be fixed – and I suspect https://github.com/spf13/hugo/pull/2303 does – I haven’t tested it, though.

nekr0z · December 5, 2022, 8:26pm

Seems to still be the problem in 2022.

jmooring · December 5, 2022, 9:05pm

I think you have to:

Split the summary manually using the  tag, OR
Define the summary in front matter, OR
Use replaceRE to strip the tags

https://gohugo.io/content-management/summaries/#summary-splitting-options

nekr0z · December 6, 2022, 4:56am

Yes, these options are there (although the <--!more--> tag wouldn’t help much in a case of a footnote in the first sentence), but this is a clear enough case that should, IMO, be considered a bug.

I mean, I don’t believe there’s a usecase where that rogue “1” at the end of a word in summary is actually desired. Stripping footnotes is what should be done by Hugo automatically when generating .Summary both by autosplitting and when <--!more--> is used. A frontmatter-defined summary might be an exception, but are there really any users that place a footnote there?

jmooring · December 6, 2022, 5:04am

Actually, thinking more about this, can you provide an example?

With automatic summaries, HTML tags are stripped.
https://github.com/gohugoio/hugo/issues/8910#issuecomment-903158600

nekr0z · December 6, 2022, 6:31am

Oh, the tags are stripped alright, which actually makes the case worse.

You see, a footnote^[1] is rendered as a link containing some text. In the case of Discourse it is replaced with some JS magic, but in Hugo, it would be just 1, wrapped in <a> and .

The Hugo automatic summary process (as well as with the <--!more--> tag) strips those tags. The resulting summary looks like:

You see, a footnote1 is rendered as a link containing…

And since that “1” there is no longer wrapped in any tags, there’s no good way to replaceRE it. Definitely a bug IMO.

like this one, yep. ↩︎

jmooring · December 6, 2022, 6:57am

I understand.

This markdown:

A footnote[^1] here. A superscript<sup>®</sup> after it.

Is rendered by Goldmark as:

<p>A footnote<sup id="fnref:1"><a href="#fn:1" class="footnote-ref" role="doc-noteref">1</a></sup> here. A superscript<sup>®</sup> after it.</p>

So the regex to strip it out is not simple.

There’s been discussion in the past related to using bluemonday to sanitize HTML, and it has a regex option. We might want to look at that before spending more time on what we have now, which is a little bit broken.

nekr0z · December 6, 2022, 7:36am

Yep, I wouldn’t want to try doing this with regex. Of course, in HTML terms the task is pretty simple: completely wipe out any node that has id prefixed with “fnref:” and anything inside it.

Of course, this needs to be done before the tags are stripped in the process of .Summary generation, which is AFAIK not something a user can currently do.

nekr0z · December 6, 2022, 7:37am

Do you want me to open a GitHub issue to document this?

jmooring · December 6, 2022, 7:41am

Sure.

nekr0z · December 6, 2022, 7:55am

github.com/gohugoio/hugo

Footnote markers are not stripped when autogenerating `.Summary`

opened 07:54AM - 06 Dec 22 UTC

nekr0z

Bug NeedsTriage

When generating `.Summary`, either automatically or by taking the content above …the `<--!more-->` tag, Hugo strips the tags from HTML, leaving just the text. This doesn't play well with Hugo-generated footnotes. This markdown: ```md A footnote[^1] here. ``` Produces this HTML when rendered with Goldmark (Hugo's default renderer): ```html A footnote<a href="#fn:1" class="footnote-ref" role="doc-noteref">1</a> here. ``` Which, after stripping the tags, produces this `.Summary`: A footnote1 here. and leaves user with no good way to get rid of that rogue "1" with `replaceRE` or other available instruments. There seems to be no use-case such that leaving the "1" in the `.Summary` would be the desired behavior. Hugo should delete every HTML node that has `id` prefixed with `fnref:`, and all inside it, before stripping tags when generating `.Summary`. (from [this](https://discourse.gohugo.io/t/stripping-footnotes-from-summary/3923/1) forum thread) ### What version of Hugo are you using (`hugo version`)? Current `master` branch. ### Does this issue reproduce with the latest release? Yes.

jmooring · December 6, 2022, 7:57am

In the meantime…

{{ .RawContent | replaceRE `\[\^\d+\]` "" | .Page.RenderString | plainify | truncate 250 }}

nekr0z · December 6, 2022, 8:02am

Yeah, who needs those .Summary crutches anyway!

chimpden · April 1, 2025, 5:48pm

The Github issue 10503 above was closed, so adding this here.

I’m using this instead of {{ .Summary }} to strip the footnote from summaries:

{{ replaceRE `<sup id="fnref:"*.+</sup>` "" .Summary | safeHTML }}

It matches substrings beginning with , and replaces it with nothing.

For the example above, this regex should result in this:

<p>A footnote here. A superscript<sup>®</sup> after it.</p>

irkode · April 1, 2025, 8:28pm

your regex does work the way you wrote. In fact it matches

<sup id="fnref: - text
"* - any number of double quotes (even zero)
.+ - sequence af characters (at least one, longest match)
 - text

which will slurp in all between the start and end and return A footnoteafter it.,
also text between first and second footnote.

try it out: Summary footnote - wrong

irkode · April 1, 2025, 8:40pm

Guess in case:

the footnote identifier is a number
the footnote (incl. end tag) is completely available in the summary

You want that one:  which will match:

<sup id="fnref: - text
\d+ - a number of digits
" - a double quote
.*? - any number of characters (even zero, shortest match)
 - text

try it out: Hugo summary regex (working)

ps. and I remember there where lot’s of changes regarding summaries - so maybe … it’s not relevant anymore.

maybe a candidate to archive

chimpden · April 2, 2025, 2:55am

Thank you for the correction. It “worked” on my site (single footnote instances within summaries) but I clearly didn’t test it on the example string above.

If anybody plans to use that regex, I don’t believe it’ll work if you’re using named footnotes[^like-this], but this should work.[^1]

Topic		Replies	Views
Issue with footnotes in posts in Hugo/Ananke support footnotes	5	102	October 2, 2024
Fixing footnotes in summaries support footnotes	0	342	November 30, 2019
Summary Stripping Tags support	5	1350	September 23, 2019
How to make footnote links absolute? support footnotes	8	2526	August 31, 2018
Summary with <!--more--> not functioning as expected support	1	432	March 11, 2018

Stripping footnotes from Summary

Related topics