UnifiedPush: Rate limiting issues with matrix.gateway.unifiedpush.org and up.schildi.chat #144

binwiederhier · 2022-02-14T18:19:54Z

On ntfy.sh, I'm seeing the IPs of matrix.gateway.unifiedpush.org and up.schildi.chat being heavily rate limited. We need to find a solution, or otherwise message delivery will keep on being severely impacted.

karmanyaahm · 2022-02-14T18:50:46Z

could you add a temporary exception for those two IPs as we work out a solution? perhaps just an if ip!=xyz { apply rate limit }
It's not a proper solution but should solve the problem while we develop a proper solution

Also, thanks for pointing out this issue, has it been happening for long or has it just started now?

karmanyaahm · 2022-02-14T18:54:40Z

To restate the problem here on Github:

They're the default gateways for ntfy.sh (for fluffychat and schildichat respectively) so all users' messages are being forwarded through those two. Having an exception for those two is probably the simplest solution, however, it's unsustainable as more apps are supported. Even ignoring the gateway, a server with a large number of users (like matrix.org) would start seeing similar problems soon.

The best solution imho is per-endpoint rate limiting, but that can be bypassed by creating multiple endpoints. This is a problem that was missed earlier, since topics in ntfy are not the property of authenticated users (like in Gotify/Nextpush) but rather are created on message send.

binwiederhier · 2022-02-14T19:05:43Z

Also, thanks for pointing out this issue, has it been happening for long or has it just started now?

From the logs it looks like for 2 weeks or so.

could you add a temporary exception for those two IPs as we work out a solution?

I think that's what it'll be. A rate-limits-exclude-ips setting in the config or something like that. It's not all that unsustainable if I only ever have to add an app every couple of weeks. It's not like ntfy is gonna be the new Google soon.

It'd be cool if the gateway could use some sort of credentials, then I could tie the rate limits to the "app" as opposed to the IP address. But that's likely not in the spec.

The best solution imho is per-endpoint rate limiting

As you have stated, that's not sufficient, because people could just create many many topics and circumvent the limits.

karmanyaahm · 2022-02-14T20:10:11Z

gateway could use some sort of credentials, then I could tie the rate limits to the "app"

The endpoint URL is that credential.

One possible way to use that could be to check ratelimit := perEndpoint[endpoint] if (endpoint has been recently subscribed to) else global. This then restricts topics based on the subscription rate limits.

binwiederhier · 2022-02-14T20:32:26Z

The endpoint URL is that credential.

I don't fully follow. Can you elaborate on that?

One possible way to use that could be to check ..

Because endpoints can be freely chosen, I have to maintain a global per-IP limit, because there is no auth.

Rate limiting on a per-endpoint basis can be an additional measure to prevent individual users being abusing in UP-land (e.g. 5 messages per 10s for each topic), but overall this doesn't work even if the volume on the individual endpoints is low.

Example: If you have 6 endpoints, each publishing 1 message every 10 seconds, you've already reached the global rate limit of 1 message per 10s per IP (that's what ntfy.sh is set to; with an initial burst of 60 messages).

binwiederhier · 2022-02-14T21:11:29Z

Here's a working implementation that I could put live tonight: https://github.com/binwiederhier/ntfy/pull/145/files

I'll likely also increase the other rate limits a bit, and update the docs accordingly.

karmanyaahm · 2022-02-15T05:26:56Z

I don't fully follow. Can you elaborate on that?
Because endpoints can be freely chosen, I have to maintain a global per-IP limit, because there is no auth.

I don't know how viable these techniques would be, but these are just my thoughts:
Idea 1
The goal is to simulate a list of "valid" topics. Then, rate limits could be per-endpoint (rather than global) for such topics; these limits could be more liberal than the combined global limit (say, 1msg / 10s, same amount but per-endpoint). non "valid" topics continue to exist and are treated with a simple global limit.

There are various ways to judge topics as "valid". Auth is a simple one, but one that core ntfy lacks (and that's good for usability).
Another method more applicable to ntfy to judge "valid" topics could be based on subscriptions. If >1 user is subscribed to the topic for a while (say, >12h/day), that means the topic is probably not spam. And since the number of subscriptions per IP is limited, that basically shifts the burden of rate limiting from the sender to the subscriber.

However, this means a potential abuser can send a lot more messages with just 1 IP - 30 subs * 1msg / 10sec / sub, rather than just 1msg/10sec.

Idea 2
Process incoming messages with a much higher rate limit (say, 10msg / s / IP), but add rate limits to the subscribers (say, 1msg/sec?), and just drop any messages over the limit from being sent to that subscriber. This will limit overuse/abuse of the server for any practical purpose (since sent messages can't be received), but still allows for potential DOS sending attacks.

Overall
This problem is pretty complicated, but the above two are the simplest things that seems like would work to me within these constraints, without having to add exceptions.

Exceptions (along with user-agent logging, which I'll work on for common-proxies soon) are probably the only simple solution though and unless ntfy+UnifiedPush sees crazy scale, exceptions will be the most practical. I'm sure there are some other advanced ways; I'll do research on how webpush servers handle this.

binwiederhier · 2022-02-16T20:57:53Z

re idea 1:

Auth is a simple one, but one that core ntfy lacks

ntfy has auth now, and I've actually thought about adding configurable limits to users; so this would be in line with the future strategy. The proxies could implement basic auth; though I do not know how much they are "aware" of what they are talking to.

If >1 user is subscribed to the topic for a while (say, >12h/day), that means the topic is probably not spam

That seems like a can of worms to me. I'd really rather not...

re idea 2:

Process incoming messages with a much higher rate limit (say, 10msg / s / IP), but add rate limits to the subscribers (say, 1msg/sec?)

I like your thinking there, but you are right; I'd have to buffer these and store them and that would be impractical. Also, Firebase, for instance or outgoing email is not buffered at all, it's just forwarded to as I get them. So this option is sadly not feasible at all.

I think the thing I implemented (exemption based on IP) is alright for now. I'd love it if we could add auth-based exemption or limits instead, but you'll have to answer how feasible that is for the proxies.

I'll close this ticket for now, since the problem is solved, but we can keep discussing here.

bmarty · 2024-05-07T14:12:02Z

Hello,
It seems that ntfy.sh is rate-limiting request from matrix.org.
From the matrix.org log:

Failed to push data to @UserRedacted:matrix.org/im.vector.app.android/https://ntfy.sh/upRPREDACTEDWI?up=1: <class 'synapse.http.RequestTimedOutError'> 504: Timeout connecting to remote server

Is there anything we can do to fix this?
Thanks!

binwiederhier · 2024-05-08T01:31:00Z

@bmarty If you provide hostnames + IP address of the publishers, I am happy to whitelist them.

bmarty · 2024-05-13T07:33:49Z

Thanks @binwiederhier , here is an official request: #1106

binwiederhier added 🪲 bug Something isn't working server Relates to the main binary (server or client) labels Feb 14, 2022

binwiederhier changed the title ~~Rate limiting issues with matrix.gateway.unifiedpush.org and up.schildi.chat~~ UnifiedPush: Rate limiting issues with matrix.gateway.unifiedpush.org and up.schildi.chat Feb 14, 2022

binwiederhier added the unified-push UnifiedPush feature or bug label Feb 14, 2022

binwiederhier pushed a commit that referenced this issue Feb 14, 2022

Rate limit exemption; relates to #144

2ad0802

binwiederhier mentioned this issue Feb 14, 2022

WIP: Rate limit exemption #145

Merged

binwiederhier closed this as completed Feb 16, 2022

MayeulC mentioned this issue Jun 11, 2022

Consider Including a Matrix Gateway endpoint as part of ntfy #319

Closed

bmarty mentioned this issue May 7, 2024

Allow configuring push notification provider element-hq/element-x-android#2340

Closed

bmarty mentioned this issue May 13, 2024

Add matrix.org pushers IP adresses to the allow list, to avoid rate limiting. #1106

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

UnifiedPush: Rate limiting issues with matrix.gateway.unifiedpush.org and up.schildi.chat #144

UnifiedPush: Rate limiting issues with matrix.gateway.unifiedpush.org and up.schildi.chat #144

binwiederhier commented Feb 14, 2022 •

edited

Loading

karmanyaahm commented Feb 14, 2022

karmanyaahm commented Feb 14, 2022

binwiederhier commented Feb 14, 2022

karmanyaahm commented Feb 14, 2022

binwiederhier commented Feb 14, 2022

binwiederhier commented Feb 14, 2022

karmanyaahm commented Feb 15, 2022

binwiederhier commented Feb 16, 2022

bmarty commented May 7, 2024 •

edited

Loading

binwiederhier commented May 8, 2024

bmarty commented May 13, 2024

UnifiedPush: Rate limiting issues with matrix.gateway.unifiedpush.org and up.schildi.chat #144

UnifiedPush: Rate limiting issues with matrix.gateway.unifiedpush.org and up.schildi.chat #144

Comments

binwiederhier commented Feb 14, 2022 • edited Loading

karmanyaahm commented Feb 14, 2022

karmanyaahm commented Feb 14, 2022

binwiederhier commented Feb 14, 2022

karmanyaahm commented Feb 14, 2022

binwiederhier commented Feb 14, 2022

binwiederhier commented Feb 14, 2022

karmanyaahm commented Feb 15, 2022

binwiederhier commented Feb 16, 2022

bmarty commented May 7, 2024 • edited Loading

binwiederhier commented May 8, 2024

bmarty commented May 13, 2024

binwiederhier commented Feb 14, 2022 •

edited

Loading

bmarty commented May 7, 2024 •

edited

Loading