Who should I send Delete activities to?

SorteKanin · March 24, 2025, 9:35pm

How to handle incoming Delete activities is already well-discussed elsewhere [1][2].

However, I have a hard time finding material on how to handle outgoing Delete activities. Specifically around this question: Who do I send such an activity to?

Naively, you might send the activity to the actor’s followers. But imagine this simple scenario:

Alice follows Bob.
Bob posts a Note that Alice receives.
Alice stops following Bob.
Bob deletes the Note.

If we only send Delete activities to the followers of the actor, Alice will never receive the Delete activity and the Note will not be deleted, which is clearly not what Bob intended.

Another strategy might be to keep a record of all historical followers of an actor, and then send Delete activities to current and past followers. However, I am worried that this is not good enough either:

Alice follows Bob and Bob follows Charlie.
Charlie posts a Note that Bob receives.
Bob shares (Announces) the Note, and Alice receives the Note too because she follows Bob.
Alice stops following Bob and Bob stops following Charlie.
Charlie deletes the Note.

I am a bit dumbfounded at what to do in this scenario. I might be missing something, but it seems impossible for Charlie to know that Alice has the Note. The only way I could possibly see this working is that Bob receives the Delete activity (as he is a past follower) and then graciously re-Announces the Delete activity to his followers, since he previously announced the deleted Note. Then Alice would also receive the Delete, as the re-Announce of the Delete would also need to be sent to all past followers.

However this seems complicated and relies on the good behaviour of Bob to re-Announce to his past followers, and I see no way to discover if Bob never re-Announced the deletion.

Another option is to send Delete activities to all known servers, but this doesn’t necessarily include Alice either I would think.

How is this done in existing implementations? This is making me think reliable deletions are impossible and I’d love to be proved wrong.

trwnh · March 24, 2025, 11:16pm

It is impossible, yes. You never know who has a copy, as people can fetch public things without authentication, but what you can do is use “best effort” delivery paths. By default, this is “every known actor”. If you maintain a little information ahead of time, such as disabling public access and enforcing some kind of authentication on fetch plus tracking of delivery recipients, you can reduce your total set from the entire known universe.

But this is assuming a certain worldview where everyone else is an “instance” that will syndicate a copy status on Create activities, which might not be the case if your Create is consumed as a regular notification. And of course people can ignore Delete activities even if they do have a copy, and so on. So this is very much “best effort” with an emphasis on “effort”. At best, you can only strive to notify of a deletion to anyone who was aware of a creation or its byproduct, assuming you track those. Otherwise, you just blast it out there for everyone, or accept that old copies may be floating around and send only to your followers, or some other heuristic or strategy.

SorteKanin · March 24, 2025, 11:22pm

I don’t think the problem goes away even if we consider non-public activities though. But it makes it easier perhaps.

Do you have any idea how established implementations tackle this? What you suggest with keeping track of what actors have fetched what posts seems cumbersome and complicated. I guess I may just settle for sending to all known instances, even if that seems awfully excessive.

trwnh · March 25, 2025, 12:53am

If you consider private activities only, then it is only an issue if they can be forwarded.

Mastodon et al’s “best effort” paths typically go to all known actors/servers, since they don’t track fetches. There’s a PR to add a new AUTHORIZED_FETCH=actors env var that starts tracking fetches in order to limit Deletes from going to everyone, but the PR has stalled for 2+ years: Use bloom filters to limit what servers account deletion notices are sent to by ClearlyClaire · Pull Request #22273 · mastodon/mastodon · GitHub

Pleroma/Akkoma I think don’t bother sending Deletes to everyone, they just accept the impossibility. IIRC there might be a patch floating around somewhere to do some kind of tracking on signed fetches, but it’s not part of mainline.

Misskey logic is here: misskey/packages/backend/src/core/NoteDeleteService.ts at 26b2cfe51877575631b4aa73b353cf7f415d6089 · misskey-dev/misskey · GitHub

followers
relays
mentioned users
anyone who boosted it
anyone who liked it

nightpool · March 25, 2025, 5:19am

A no op delete is generally a very inexpensive activity, so I don’t see a problem with federating it promiscuously. obviously, it does give away a little bit of metadata information such as the ID of an activity the remote server may not know about, but for public activities, the trade-off is generally worth it and servers could consider sending decoy delete activities if privacy is super important

trwnh · March 25, 2025, 8:12am

it might be “inexpensive” on its own, but it spams inboxes, fills databases, consumes federation workers, and represents significant noise for smaller or lesser-powered servers. it generally doesn’t make sense unless you assume everyone else is replicating some state machine. probably the logic that misskey uses is good enough, and sending Deletes to the entirety of the known universe is extraneous.

silverpill · March 25, 2025, 9:12am

I think delivering Delete to all actors who interacted with the object is optimal (=everyone who liked, reacted, replied or reposted). Same for Update activities.

SorteKanin · March 25, 2025, 3:06pm

Using a bloom filter is an interesting approach. But to @nightpool’s point, not sure it makes too much sense as an optimization. I imagine Delete activities already account for a very small percentage of the total activities sent, so sending a few too many probably doesn’t have a big impact.

Thanks for the discussion, this has been helpful.

nightpool · March 25, 2025, 4:10pm

how could a delete activity “fill” a database, except through implementation error? surely a delete activity should only ever have the ability to empty a database.

as for spamming inboxes and consuming federation workers, I believe I’ve already addressed that. Delete activities are rare and—if you don’t have any responsive content—extremely inexpensive.

SorteKanin · March 25, 2025, 4:21pm

There are some implementations that (perhaps unwisely? Who am I to judge) keep a record, or at least a cache, of all the incoming (and maybe outgoing) activities they receive. Or the “database” here could be a queue for processing incoming activities perhaps, where the delete would fill the queue.

Again, don’t think it’s a big concern but I can definitely see how even delete activities can cost storage and/or memory.

trwnh · March 25, 2025, 4:29pm

to reiterate, the idea that a Delete only ever “empties” a database is rooted in the idea that a Delete is a transient activity that is consumed upon receipt and handling of side effects. if you give a Delete an id, then it may reasonably be persisted. this may be done for several reasons, particularly for implementations that treat the inbox as an inbox and not as an RPC interface – the Delete is a notification message just like any other, and messages in an inbox shouldn’t generally randomly disappear. i think the case where an inbox is only ever read by a single client is common for fedi, but absolutely not a guarantee by the ActivityPub specification.

silverpill · March 25, 2025, 4:33pm

On smaller servers the majority of incoming activities are Delete, because Mastodon is flooding the network with them. Everyone hates it, but there is no way to stop it. I have inboxes that were deleted more than 3 years ago, and Mastodon servers still send thousands of Delete activities there every hour.

And this is only Delete(Actor) activities.

nightpool · May 22, 2025, 7:42pm

Well, have you sent those servers a Delete activity letting them know that your inbox doesn’t exist anymore?

SorteKanin · May 22, 2025, 10:09pm

In fairness, if you have sent activities to an inbox for 3 years and not gotten a successful HTTP status back (<400) in all that time, it’s probably time to consider it dead on your end.

But of course the other end should try its best to inform you as well. But you can’t always rely on this so you need the above logic anyway.

trwnh · May 24, 2025, 10:09am

How would anyone know to send such an Activity, and how should anyone know what shape it should have? Receiving a POST at a URI might let you infer it used to be an inbox, but it doesn’t let you know which actor used to declare that inbox, and sending a Delete(inbox Collection) activity will not be effective. Shared inbox also tends to make this worse, since you have even less information. In most cases regardless of the inbox, your only information is that activities are addressed to something that is ostensibly someone’s follower collection; you might be able to verify that an addressee == actor.followers.id, but you have no guaranteed way to know the contents of such a collection.

It unequivocally falls on the sending server (Mastodon or otherwise) to stop sending activities to URIs that continually report failures for an extended period of time.

silverpill1 · May 24, 2025, 6:13pm

I think Mastodon respecting 410 Gone would be the cleanest solution: https://github.com/mastodon/mastodon/issues/33290#issuecomment-2602875388

@proto-s2s