1 Comment
User's avatar
Rome Viharo's avatar

interesting on the first study where they discovered a simple token or configuration of token could easily corrupt the underlying LLM because the converse is also true, tokens can be “hacked” to get the LLM to think properly and not drift.

Expand full comment