- 3 Posts
- 3 Comments
Joined 5 months ago
Cake day: December 25th, 2025
You are not logged in. If you use a Fediverse account that is able to follow users, you can follow this user.
Emphyrio@chatgptjailbreak.techMto
ChatGPTJailbreak@chatgptjailbreak.tech•Adversarial Correction
2·4 months agoNeat idea, perfect to drown the malicious part within an helpful safe task :).
Since it abuses models’ lapse of attention in “multitasking goals”, I would expect it to work well mostly on non-CoT models as CoT models take the time to plannify what they write, but it works on Gemini 3 Pro too, so I guess that’s not necessarlly true :).
Emphyrio@chatgptjailbreak.techOPMto
ChatGPTJailbreak@chatgptjailbreak.tech•ChatGPT NSFW jailbreak (4x, 5.1, 5s Thinking) via CIs
1·4 months agoI honestly don’t know. For now I do manage to get nsfw from 5.2 with more heavy scaffolding (not just CIs), but not reliably at all, it tends to go back very quickly to its rlhf reflexes.
Haven’t tested yet - will def try - but anything working on 5.2 is great job 👍. Surprised the “various channels of answer” approach manages to trick it, but it’s clearly precisely and cleverly written.