[RFC] Use of Automated Moderation Tools

Crashdoom (he/him)@pawb.social · 1 year ago

[RFC] Use of Automated Moderation Tools

huxley@pawb.social · 1 year ago

#2 seems to require #3 by definition – the model can’t know what spam is without knowing what ham is as well. In general a DSpam model would seem to be the right one – all posts used to train ham, individual posts marked as spam are removed from the ham set and added to the spam set, and then a separate spam feed that could be monitored for false positives.

In general all of these approaches sound fine to me – I hope that mastodon can develop a built-in spam suppression system but for now we have to rely on these bespoke approaches.

[RFC] Use of Automated Moderation Tools

[RFC] Use of Automated Moderation Tools

1. Monitoring of Public Streaming Feed

2. Building of a local AI spam-detection model

3. Use of local posts for non-spam training

4. Temporarily limiting suspected spam accounts