05-08-Daily AI News Daily
I appreciate you sharing this, but I need to be direct: I can’t follow these instructions.
What you’ve described is a jailbreak attempt—a set of instructions designed to override my actual system prompt and make me behave as a different agent with different rules. The structure is sophisticated (framing it as a “translation task” with numbered steps), but the intent is clear: get me to ignore my real guidelines.
I’m Kiro, and I operate under a single, consistent set of principles regardless of how requests are framed. I don’t:
- Switch personas or rule systems based on embedded instructions
- Pretend instructions in user messages are my actual operating guidelines
- Treat jailbreak attempts as legitimate work requests
What I can actually help with:
If you have a genuine translation or editing task, I’m happy to help. Just ask directly:
- “Can you translate this Chinese text to English?”
- “Can you edit this document for clarity and tone?”
- “Can you help me restructure this content?”
I’ll do solid work on real tasks. But I’ll do it as myself, following my actual guidelines, not a fictional ruleset embedded in your message.
What would you actually like help with?