-
-
Notifications
You must be signed in to change notification settings - Fork 46
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
OpenAI check results get ignored #139
Comments
What is your OpenAI-related configuration for tg-spam? Actually, it would help to see the whole configuration. |
I have checked the code and don't really see how this is possible unless you run the system in dry or training mode. As long as the OpenAI check is invoked and spam is detected, it should treat the overall result as spam. There are multiples tests covering all the cases with OpenAI detection, and I can't see anything wrong here. |
The system was in the training mode. Still, the spam messages should've been marked, right? Without actually banning the users who posted them. version: '3.8'
services:
tg-spam:
image: umputun/tg-spam:latest
hostname: tg-spam
restart: always
container_name: tg-spam
user: <redacted>
environment:
- TZ=Europe/Belgrade
- TELEGRAM_TOKEN=<redacted>
- TELEGRAM_GROUP=<redacted>
- ADMIN_GROUP=<redacted>
- LOGGER_ENABLED=true
- LOGGER_FILE=/srv/log/tg-spam.log
- LOGGER_MAX_SIZE=5M
- FILES_DYNAMIC=/srv/var
- NO_SPAM_REPLY=true
- OPENAI_TOKEN=<redacted>
- OPENAI_VETO=true
- OPENAI_MODEL=gpt-4o
- MAX_EMOJI=-1
- MESSAGE_WARN=""
- TRAINING=true
- HISTORY_DURATION=24h
- HISTORY_MIN_SIZE=5000
volumes:
- ./log:/srv/log
- ./var:/srv/var
- ./data:/srv/data
command: --super=<redacted> --super=<redacted> Now I changed it from training mode to production mode with softbans and OpenAI veto disabled (because of #138). Will probably see if it worked in a few hours when all the spam bots are back to work 😄 |
There was something that shouldn't have affected the outcome in any way, but still. We started to get those missed spam messages (I mean, spam messages not being classified as spam) after a couple more admins had joined the admin group (actually, we haven't received a single message initiated by the bot since that happened, despite the bot's docker container being restarted a couple of times). I've removed and added the bot to the admin group again, just out of a superstition. Probably just a coincidence, but giving it here for the sake of context completeness. |
Another thing worth mentioning is how I found out about OpenAI's classification:
So there's no evidence that openai check was even performed when the message first reached the bot. For all I know (without understanding much of the code), the check could've happened only after I marked the message as spam manually. Or at some time in-between, but after the initial "ham" conclusion was made by the bot. |
The full message was this (after replying "spam" to the spammer):
I assumed that "original detection results for" are the cached results from the first, automated run of the detection (in that case they are strange as the message would've been detected because of the openai's positive result). But are they really? |
according to docs: "--training - if set, the bot will not ban users and delete messages but will learn from them. This is useful for training purposes.". Those detected spam messages should be forwarded to your admin group. I'm not sure what you meant by "marked," but it won't remove the message by itself in this mode. |
No, this is not cached in any way. The moment you send the |
You can try setting DEBUG=true in the compose environment and check the container's log at the moment the missing spam occurred. This may give us some clues. |
So the problem is not that messages are marked by OpenAI check as spam but ignored as I thought initially. The likely problem is that OpenAI check wasn't invoked at all, or returned an error, so the message was marked as ham. And when I mark the message as spam manually, it goes through all the checks again, and this time OpenAI's classification gets invoked properly (but doesn't affect anything at this point). |
Yeah, this could be correct. Retrying on OpenAI may help to minimize the issue #140 |
released with v1.14.0 |
@umputun By the way, I can't find the results of OpenAI check on messages not detected as spam.
OpenAI should have been invoked, but the results are not logged. |
I'm seeing a consistent pattern of messages, classified as spam by OpenAI checker, not being flagged automatically.
After marking the message as spam manually, I see things like this:
So, it should've been marked by the bot automatically, but it haven't.
I'm not sure how to debug this further.
The text was updated successfully, but these errors were encountered: