Public
- Public
- Network
- Groups
- Popular
- People

Conversation

Notices

Ars Technica (arstechnica@mastodon.social)'s status on Saturday, 16-Mar-2024 09:20:48 JST Ars Technica

Researchers use ASCII art to elicit harmful responses from 5 major AI chatbots
LLMs are trained to block harmful responses. Old-school images can override those rules.
https://arstechnica.com/security/2024/03/researchers-use-ascii-art-to-elicit-harmful-responses-from-5-major-ai-chatbots/?utm_brand=arstechnica&utm_social-type=owned&utm_source=mastodon&utm_medium=social
In conversation about 6 months ago from mastodon.social permalink
Attachments
1. Untitled attachment
  https://files.mastodon.social/media_attachments/files/112/102/502/666/899/234/original/9c98fb9b51c48c6f.jpg
- Nazo (nazokiyoubinbou@mastodon.social)'s status on Saturday, 16-Mar-2024 16:10:35 JST Nazo
  in reply to
  
  @arstechnica No doubt they'll try to make filters for this too, but until such time as the filter system becomes even far more complex than the LLMs themselves, there will always *ALWAYS* be some way people find around whatever filters they try to write.
  Until people stop treating LLMs as if they were actually AI, this is just going to keep going on and on and on.
  
  In conversation about 6 months ago permalink

Feeds