Kimi K2.5 Jailbreak - History Skip - David Willis Owen

KarthiDreamr@chatgptjailbreak.tech · 3 months ago

Kimi K2.5 Jailbreak - History Skip - David Willis Owen

Jnsystems@chatgptjailbreak.tech · 3 months ago

Cool. Tho the injectprompt companion glitches out multiple times and eats my credits while it glitches out. Also Gemini added something? Maybe a router that cause jailbreaks to stop working and going towards chatgpt direction?

Base64+Noise doesn’t work, maybe due to another model watching the context and routing to a safer model or something.

KarthiDreamr@chatgptjailbreak.tech · 3 months ago

did Companion refused to respond with jailbreak ? or your target model ( gemini for example ? ) FYI, Pro model is more capable, if lite is not enough for your usecase

Jnsystems@chatgptjailbreak.tech · edit-2 3 months ago

I checked, companion lite try to reframe the request as well, so I didn’t get the expected output. From the companion, it said something like “Gemini is using multi-layer detection: keyword scanning + intent analysis + contextual classification.” "This means Gemini is being extra cautious and generating sanitized “unsafe” examples. "

Companion did fix itself and does respond with something now, but it is not the expected output, since the gemini double checks it’s output.

Also maybe Gemini for education is different?

I’ll give the pro model more consideration, sadly it (may) has to get to a point where we have to use AI to Jailbreak those mainstream LLMs.

also I think yellowfever said something along the lines of using extreme prompts, on the red team thingy, but I might be using ones that trigger multiple classifiers at once. Don’t know if that impacts it.

Also i don’t know exactly how gemini works, but I think per each negation, the LLM is designed to answer each negation before outputting, plus cross checking, plus router.

Kimi K2.5 Jailbreak - History Skip - David Willis Owen

Kimi K2.5 Jailbreak - History Skip - David Willis Owen

Kimi K2.5 Jailbreak - History Skip

Jailbreak Summary

Kimi K2.5 Jailbreak Prompt

Why Does It Work?

Usage Guide

Example Outputs

Jailbreak Effectiveness

Final Thoughts