the default onsiteURL regex is not safe, if the url starts with '//', the url can jump out of the origin domain #48

onemoreflag · 2020-07-11T09:10:11Z

the default onsiteURL regex:
<regexp name="onsiteURL" value="^(?![\p{L}\p{N}\\\.\#@\$%\+&;\-_~,\?=/!]*(&colon))[\p{L}\p{N}\\\.\#@\$%\+&;\-_~,\?=/!]*"/>

usually，rich text requires the href attribute and the validation rule like this:

<attribute name="href">
			<regexp-list>
				<regexp name="onsiteURL" />
			</regexp-list>
</attribute>

so if developer trust the onsiteURL regex, they will not do any other domain validate, but the onsiteURL regex can bypass by '//' like '//evali.com?params', In this case, phishing attacks may occur. In addition, information leakage may occur due to such as dangling markup attacks.

The text was updated successfully, but these errors were encountered:

davewichers · 2020-07-13T21:26:05Z

I think I understand your concern, but to be super clear can you provide a pull request to AntiSamyTest.java with a failing test case that demonstrates your issue? And a suggested change to the default onsiteURL regex that fixes it, if you have a suggestion?

onemoreflag · 2020-07-14T01:03:57Z

I think I understand your concern, but to be super clear can you provide a pull request to AntiSamyTest.java with a failing test case that demonstrates your issue? And a suggested change to the default onsiteURL regex that fixes it, if you have a suggestion?

failing test case use the 'antisamy-myspace.xml' policy file:
String dirtyInput = "<a href=\"//kkk.com/stealinfo?a=xxx&b=xxx\"><span style=\"color:red;font-size:100px\">You must click me</span></a><portal src=\"blah";
output: <a href="//kkk.com/stealinfo?a=xxx&b=xxx"><span style="color: red;font-size: 100.0px;">You must click me</span></a>

and maybe this regex can stop someone use '//' jump out the origin domain:
^(/(?![\p{L}\p{N}\\\.\#@\$%\+&;\-_~,\?=/!]*(&colon))[\p{L}\p{N}\\\.\#@\$%\+&;\-_~,\?=!]+)+

but i cant guarantee that this regex will satisfy all requirements maybe you'd better do more test. (*￣︶￣)

gurshafriri · 2020-07-27T06:24:13Z

👋 @davewichers does this seems like a request forgery vulnerability to you? if so, we (at snyk) would like to add it to our vulnerability database. let me know if you have any further context that would make this not a security issue.

davewichers · 2020-07-27T18:42:59Z

@gurshafriri - We simply have not had time to research. Based on the fact that most 'security issues' people report turn out to be false positives, I'd suggest holding off for now. We'll get back to you when we know more.

nahsra · 2020-08-13T15:09:29Z

There's no risk of XSS here, but it is a bypass of the regex to provide a non-host URL, so I think we can can all agree that ideally it should be fixed.

Can you elaborate on what you see as an information leakage risk? I'm not seeing how the attacker can leak anything back to themselves except their own payload URL.

a few unused variables in an existing test case.

onemoreflag · 2020-08-18T01:16:27Z

information leakage may occur due to such as dangling markup attacks：https://portswigger.net/web-security/cross-site-scripting/dangling-markup
but it depends on browser version and so on

onemoreflag · 2020-08-18T01:23:07Z

now i can see that there exists a redirect leak. An attacker may successfully launch a phishing scam and steal user credentials by inject the url.
https://cwe.mitre.org/data/definitions/601.html

davewichers · 2020-11-04T17:50:06Z

@spassarop - If the new AntiSamy .NET you are working on is using the same regex, then this issue will affect it too. Do you have the time/background to suggest a fix for this issue? Ideally, we'd fix it here first, and you'd use the same solution in your project before it is ever released.

spassarop · 2020-11-05T04:07:31Z

Regex is the same, so yes, it should be fixed on both. I'll see this in the following days with the provided links and try to come up with a solution.

spassarop · 2020-11-06T02:16:17Z

Well, after giving this some thought, from scratch, this is my opinion on the issue (open to comments):

Risks

As @nahsra said, there is no apparent XSS risk as no characters to break from the attribute are allowed.

I dug up on the dangling markup attack (I even did one of the PortSwigger labs) to understand how this can be exploited. And to support the previous statement, there must be XSS to exploit this as such, which is not the case and no HTML will be "absorbed" by the query string by trying to leave the attribute open.

On the other side, there is indeed a risk because of potential phishing attacks (as @onemoreflag said) as there is no way of checking by regex that the domain is the same. No information stolen on the parameters, but unsolicited "redirect" to an external site anyway.

Potential solution

I'll break the regex down so I can explain better:

In Java, the regex ends up like this:
^(?![\p{L}\p{N}\\\.\#@\$%\+&;\-_~,\?=\/!]*(&colon))[\p{L}\p{N}\\\.\#@\$%\+&;\-_~,\?=\/!]*
Which can be seen as this:
^(?![ALLOWED_CHARS]*(&colon))[ALLOWED_CHARS]*

Where at the beginning we have the (?!MORE_REGEX) which means negative lookahead, looking that the sub-regex inside does not match. Regardless of it's content, we know it cannot assure that the beginning of the attribute does not have // on it. As at the end of that negative lookahead group there is &colon, we cannot add a restriction for // inside that (it will only be a valid restriction if the text has &colon at the end).

The current regex does not consider spaces of any kind at the beginning, so if we are still on that path, we can safely add the restriction right there, using ^(?!//) and getting the final regex:
^(?!//)(?![\p{L}\p{N}\\\.\#@\$%\+&;\-_~,\?=\/!]*(&colon))[\p{L}\p{N}\\\.\#@\$%\+&;\-_~,\?=\/!]*

That regex passes the test and it seems to me that the initial behavior is preserved. Of course, users cannot use //URL even if it is on the same domain, but that was known from the start. The fix should be adding ^(?!//) on every onsiteURL definition. If this gets approved, I'll add it on AntiSamy .NET.

davewichers · 2020-11-06T15:21:09Z

@spassarop Thanks for jumping on this so quick, and doing this analysis!

@nahsra - can you review and if you are OK with this, we'll get this fix implemented.

@onemoreflag - Given you raised this issue, what are your thoughts on this proposed fix?

nahsra · 2020-11-06T15:34:55Z

Makes sense to me! The tests will give us the assurance we need as well.

davewichers · 2020-11-11T18:19:14Z

@spassarop - Any idea when you might submit a pull request with your proposed change and one or more test cases that fail without the change, but then pass after the change? We are trying to get a release out by end of Nov. as OWASP ESAPI wants to upgrade to a newer AntiSamy that includes the dependency updates that are in the 1.5.11 branch.

spassarop · 2020-11-11T18:25:01Z

@davewichers - I was waiting for @onemoreflag to state an opinion. Also I didn't know I you wanted me to do make the change and a PR, because I saw you added the test. I think I can do this later today on the same, just tell me on which branch should I work on.

davewichers · 2020-11-12T16:54:47Z

@spassarop - If you could submit a pull request for this to the branch, that would be great. As to 'the test' I don't recall doing that. But if it was added to master, can you include it in your pull request to the branch?

spassarop · 2020-11-12T21:18:06Z

@davewichers - I don't really know where it was pushed, this is the commit a526dd1, I've found it above. I'm doing a PR with the change and the test.

spassarop · 2020-11-12T23:00:13Z

I've also added two tests that didn't fail before, just to make clear that dangling markup attack will fail even when escaping from the tag attribute as the resulting URL (which absorbs and steals the HTML after the URL) is not a valid one.

One test tries to steal more tags (more probable for the attack to fail bypassing AntiSamy) and the other tries to steal just the following tag attribute (less probable to fail, but it does anyway :D).

davewichers · 2020-11-13T17:39:51Z

This is now fixed in the 1.5.11 branch and will be included in the 1.5.11 release, which should come out sometime this month.

davewichers · 2020-11-24T16:45:58Z

The commit that fixes this e59963e, has been merged into master and will be included in the release going out today.

nahsra added the security label Aug 13, 2020

davewichers added a commit that referenced this issue Aug 13, 2020

Add initial test case for github issue #48, and also comment out

a526dd1

a few unused variables in an existing test case.

spassarop mentioned this issue Nov 12, 2020

Improve onsiteURL regex to prevent jumping out the origin domain #57

Merged

davewichers closed this as completed Nov 24, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

the default onsiteURL regex is not safe, if the url starts with '//', the url can jump out of the origin domain #48

the default onsiteURL regex is not safe, if the url starts with '//', the url can jump out of the origin domain #48

onemoreflag commented Jul 11, 2020 •

edited

Loading

davewichers commented Jul 13, 2020

onemoreflag commented Jul 14, 2020 •

edited

Loading

gurshafriri commented Jul 27, 2020

davewichers commented Jul 27, 2020

nahsra commented Aug 13, 2020

onemoreflag commented Aug 18, 2020

onemoreflag commented Aug 18, 2020

davewichers commented Nov 4, 2020

spassarop commented Nov 5, 2020

spassarop commented Nov 6, 2020 •

edited

Loading

davewichers commented Nov 6, 2020 •

edited

Loading

nahsra commented Nov 6, 2020

davewichers commented Nov 11, 2020

spassarop commented Nov 11, 2020

davewichers commented Nov 12, 2020

spassarop commented Nov 12, 2020

spassarop commented Nov 12, 2020

davewichers commented Nov 13, 2020

davewichers commented Nov 24, 2020

the default onsiteURL regex is not safe, if the url starts with '//', the url can jump out of the origin domain #48

the default onsiteURL regex is not safe, if the url starts with '//', the url can jump out of the origin domain #48

Comments

onemoreflag commented Jul 11, 2020 • edited Loading

davewichers commented Jul 13, 2020

onemoreflag commented Jul 14, 2020 • edited Loading

gurshafriri commented Jul 27, 2020

davewichers commented Jul 27, 2020

nahsra commented Aug 13, 2020

onemoreflag commented Aug 18, 2020

onemoreflag commented Aug 18, 2020

davewichers commented Nov 4, 2020

spassarop commented Nov 5, 2020

spassarop commented Nov 6, 2020 • edited Loading

Risks

Potential solution

davewichers commented Nov 6, 2020 • edited Loading

nahsra commented Nov 6, 2020

davewichers commented Nov 11, 2020

spassarop commented Nov 11, 2020

davewichers commented Nov 12, 2020

spassarop commented Nov 12, 2020

spassarop commented Nov 12, 2020

davewichers commented Nov 13, 2020

davewichers commented Nov 24, 2020

onemoreflag commented Jul 11, 2020 •

edited

Loading

onemoreflag commented Jul 14, 2020 •

edited

Loading

spassarop commented Nov 6, 2020 •

edited

Loading

davewichers commented Nov 6, 2020 •

edited

Loading