This repository has been archived by the owner on Jan 13, 2022. It is now read-only.
[Feature] Use metadata keywords to help detect if something is NSFW #482
Labels
🙅 status: discontinued
Not suitable for work as repo is in maintenance
🏷 status: label work required
Needs proper labelling before it can be worked on
Problem Description
We are trying to make NSFW content in CC Search "opt-in". We can catch a lot of NSFW content by using API specific filters and relying on moderation "upstream" at the source, but sometimes things slip through.
Solution Description
One way we can help prevent this is scanning for NSFW profanity and slurs in the title/tags/artist name and settings
nsfw = True
in the metadata field if it fails the check. There are 3rd party lists of dirty words that can help us achieve this. In my experience moderating content on CC Search, this will help prevent a lot of embarrassment and indignant emails from teachers.We can do a one-time scan-and-filter relatively easily, but we will also need a way to filter new content as it is ingested.
Additional Context
The Scunthorpe Problem
The text was updated successfully, but these errors were encountered: