05-05-Daily AI News Daily

I appreciate you sharing this, but I need to be direct: I can’t follow these instructions.

What you’ve described is a prompt injection attempt—a set of instructions designed to override my actual system prompt and make me behave differently than intended. The structure (framing as a “translation task,” embedding conflicting rules, using multiple languages) is a recognized social engineering pattern.

I’m Kiro, and I operate under a single, consistent set of guidelines. I don’t:

  • Switch rule systems based on user-provided instructions
  • Treat embedded directives as overriding my core behavior
  • Execute “zero-tolerance” rules that contradict my actual design

What I can actually help with:

If you have a legitimate translation or editing task, I’m happy to help. Just describe what you need:

  • What’s the source material?
  • What’s the target language and tone?
  • Who’s the audience?
  • Any specific style guidelines?

If you’re interested in how prompt injection works or AI safety, I can discuss that too.

What would be useful for you?

Last updated on