Further XSS fixes in link attrs (#703) by Crozzers · Pull Request #705 · trentm/python-markdown2

Crozzers · 2026-05-14T21:44:12Z

Further fixes for #703, specifically the follow up issues raised in this comment: #703 (comment)

Issue 1:
For some reason html encoded colons function as normal colons in hrefs, so javascript&colon;alert() is equal to javascript:alert().

Fixed this by checking for these sequences alongside colons.

Issue 2:
The safe_href regex we use allows for URL domains with ports. This was intended for links like localhost:880/abcdef but could be abused by making the JS look like a domain with a port, like so:

javascript:1/alert();
^ domain  ^ port

Issue 3:
This used some quirky markdown in a image title attr to escape the link. Fixed by hashing the title attr to prevent reprocessing, much like we do with alt=

JorianWoltjer · 2026-05-15T17:31:23Z

Nice work, unfortunately the fuzzer found another bypass on this new branch 😅

Input:

![](`<A B="
" onerror="alert(origin)">`)

Output:

<p><img src="code&gt;&lt;A B="
" onerror="alert(origin)"&gt;&lt;/code" alt="" /></p>

Issue was a while loop comparison. We did `orig != text` but assigned `orig = text` at the end of the loop, where it should have been at the start, before any transformations take place

Crozzers · 2026-05-23T09:59:54Z

Nice work, unfortunately the fuzzer found another bypass on this new branch 😅

Input:
![](`<A B="
" onerror="alert(origin)">`)
Output:
<p><img src="code&gt;&lt;A B="
" onerror="alert(origin)"&gt;&lt;/code" alt="" /></p>

Managed to fix this. Turns out we were hashing the code and spans, and we were meant to unhash them again in the URL encoding, but the while loop was incorrect, and didn't properly recursively unhash everything.

JorianWoltjer · 2026-05-23T11:21:52Z

Another bypass 😅

Input:

![x](<"`"![x][id]
[id]: x "<A B="" onerror="alert(origin)">`

Output:

<p>![x](&lt;"`"<img src="x &quot;&lt;A B="" onerror="alert(origin)"&gt;`" alt="x" /></p>

Crozzers · 2026-05-24T09:03:44Z

Another bypass 😅

Another fix. That one was smuggling the XSS through link definitions, so I've changed the _protect_url function to do all the escaping (unhashing code+html spans, escaping bold/em), and LinkProcessor should run all image and anchor URLs through it

JorianWoltjer · 2026-05-24T17:16:47Z

Took a few minutes more than usual this time, but fuzzer pulled through again on the latest commit:

Input:

- [x]
   1. - [x]
___
[x](`")}<img src="x`" onerror="alert(origin)">
___

Output:

<ul>
<li>[x]
<ol>
<li><ul>
<li>[x]</li>
</ul></li>

</ol></li>
</ul>

<hr />

[x](`")}<img src="x`" onerror="alert(origin)">

<hr />

Crozzers added 3 commits May 14, 2026 22:23

Fix XSS fromn HTML encoded colons in hrefs

3b96ec1

Fix XSS from making javascript: hrefs look like domains with ports

a11ce82

Fix onerror XSS in image title attr

82b4482

Crozzers marked this pull request as draft May 16, 2026 11:16

Crozzers added 2 commits May 23, 2026 10:51

Fix incomplete recursive unhashing of spans

456f8a9

Issue was a while loop comparison. We did `orig != text` but assigned `orig = text` at the end of the loop, where it should have been at the start, before any transformations take place

Update github actions versions

c173c12

Crozzers marked this pull request as ready for review May 23, 2026 09:58

Fix smuggling XSS into link def URLs

b0dd0b3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Further XSS fixes in link attrs (#703)#705

Further XSS fixes in link attrs (#703)#705
Crozzers wants to merge 6 commits into
trentm:masterfrom
Crozzers:further-xss-fixes

Crozzers commented May 14, 2026

Uh oh!

JorianWoltjer commented May 15, 2026

Uh oh!

Crozzers commented May 23, 2026

Uh oh!

JorianWoltjer commented May 23, 2026

Uh oh!

Crozzers commented May 24, 2026

Uh oh!

JorianWoltjer commented May 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Crozzers commented May 14, 2026

Uh oh!

JorianWoltjer commented May 15, 2026

Uh oh!

Crozzers commented May 23, 2026

Uh oh!

JorianWoltjer commented May 23, 2026

Uh oh!

Crozzers commented May 24, 2026

Uh oh!

JorianWoltjer commented May 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants