Folksonomy for Filtration on Mastodon

December 01, 2022 17:12 +0000 | Mastodon 1

Mastodon is Twitter's logical successor. Like Twitter, it's a "microblogging" platform that lets you follow and ~~retweet~~ "boost" posts you like for your followers to see. The key difference compared to its predecessor is the "federated" nature of the platform, which others have written on heavily, so for the uninitiated I'll just say that "it's distributed, so no central authority controls it."

What we don't talk about nearly as much though, is Mastodon's painfully limited system of managing that federation at scale. When faced with the reality of dealing with unwanted content, the answer in the Mastodon community is "the instance moderator does that". The assumption being that not only does the owner of the server you're using have the time/skill/inclination to sift through reports of your content, but also that this moderator shares your values.

Additionally, instances can be "de-federated" from the greater network by moderators of other (potentially huge) instances for not blocking content those networks consider objectionable. It all sounds like a neat and tidy way to keep the baddies out, but it's also a recipe for an echo chamber.

The reality of living in a society is that there's very little consensus around what content should be permitted, and this is a good thing! Some people share pornographic content daily, while others consider this a mortal sin. For that matter, posting a drawing of a prophet is enough to drive some idiots to violence, while for others, it's considered hate speech.

The hate speech question alone is especially difficult, as the term itself even lacks a consensus-backed definition. It's regularly used in online spaces to shut down debate rather than to inform it.

So with all of this, how can we expect the human moderator model to scale? The word "scale" itself suggests expanding a system's capabilities beyond that of individuals. We must accept the use of algorithms in navigating this space, and we can do this without risking the bias AI-based systems have demonstrated.

For my money, the answer is community tagging and leveraging that tagging to allow clients rather than (or at least in addition to) instances to filter according to their values.

The idea is allow users to tag content (and even other users) with whatever string they want: tree-hugger, fascist, pedo, cute-puppies, porn, nazi, deluded-psycopath, nerd, bootlicker -- whatever. Instances or clients can then choose which tags they want to filter on, as well as the weight they want to allocate to tags applied by people and posts bearing specific tags.

So for example if a user gets a lot of nazi tags from 15 different accounts, your client, configured to have a nazi threshold of 10 will filter out that user's content. If however a user was tagged as pedo from 15 different users who are also already tagged as nazis according to your threshold, the weight of those tags could be diminished or completely invalidated and so the "probably not a pedo" user's posts would get through.

The idea is to mimic actual human behaviour. If a MAGA nut tells you that "that dude's a fascist" you're less likely to care about that statement than if it had come from someone whose opinion you actually value.

Of course this system suffers from the whole question of gaining a reputation from the start, so maybe this would have to work much like other reputational systems (eBay comes to mind) and rely on people less filter-conscious to rate people and posts before the more filter-heavy users see that content.

I'm curious what other Mastodon users (Mastodonians?) think about such a system. If they're satisfied with the current system of filtering/defederation or if they have different/better ideas to manage the problem with limited bias.

Comments

Mihnea
30 Dec 2022, 10:38 p.m. |

I've been giving this some thought as well, but I'd start by saying you're being a bit too optimistic about Mastodon replacing Twitter -- I think it's only appealing for a certain type of niche categories of people (like you or I), who want to have a conversation and learn something; that leaves the vast majority of twits (twats?) outside, since they don't actually care about any of that, and just want a platform to trumpet their own image. For them, Instagram/TikTok or any of the other large platforms will be much more attractive since it provides a much larger audience, or they'll simply stay on Twitter since they love the commotion.

In any case though, it's undeniable that the usage of Mastodon has gone up, with a lot of great communities forming on the platform, and it's important to keep the signal-to-noise ratio high. But I'm not sure that's an attainable goal for a system designed like this -- which is, similarly to Twitter, basically one giant, all-encompassing asynchronous chat-room for the entire planet. It's all cute and cuddly when the group is small, but as soon as it starts to grow it reaches the point of Eternal September. You end up with two diametrically opposed goals: filtering "undesirables" (for whatever definition of that) in a scalable way, while at the same time trying to avoid echo-chambers and stifling conversations. You either go the way of "maximum freedom of speech" and allow everything with minimal to no moderation -- which actually works great for small/niche groups, but as you well pointed out cannot scale, and would be quickly overrun by trolls/spam/nastiness; or you start some form of moderation, which will need to become more and more strict to catch all the tricks used to avoid it, and eventually lead to exactly the type of abusive censorship and echo chambers present today on FB, Twitter and the like.

Your suggestion with the tags is a good example of this -- what stops the "nazis" from tagging you and similar people with the same tag? It's not like there would be an "overseer" to judge if a tag is correct or not, and just look at how the orcs are calling Ukrainians "nazis" in their own country... those kind of people love to project, and you'd soon end up with everyone everywhere being tagged "nazi"/"pedo"/whatever, thus you're back to the starting point.

My take on this would be to not even try (you know, WarGames style). We do it for email already... Why would Mastodon need a global moderation system anyway? It's not like there are any advertisers (customers) who would get spooked. Just let individuals do the filtering, like spam filters do today, with either statistical/Bayesian or AI-based models. The reason AI fails at the scale of FB or Twitter is because it's, again, trained to be "universal", some godly overseer that decides what everyone is allowed to say and removes comments into a black hole without appeal. But if you have personal models that each individual trains and configures ("I like this, gimme more" vs "I hate this, block it in the future"), you'll get the human-like filtering you were looking for. Implementation-wise, it could likely be a simple "upvote"/"downvote" button, with visibility and effect only for your own user.