fanart recs!
Batman
Wednesday Addams
Cult of the Lamb
Dredge
Stardew Valley
Original work

This covers August through beginning of November
At least one of the links was from
coth; most I have no idea - some of them have been in my 'read later' for a very long time. There were also stories from All of Tor.com’s Original Short Fiction Published in 2022, which I'm guessing I've started working through before, but didn't remember what I'd read previously (18 short stories, 13 novelettes, 1 translation) (and didn't finish this time either)
Loved it!
Not bad
Not for me
DNF

In Book X of The Republic, Plato excludes poets on the grounds that mimetic language can distort judgment and bring society to a collapse. As contemporary social systems increasingly rely on large language models (LLMs) in operational and decision-making pipelines, we observe a structurally similar failure mode: poetic formatting can reliably bypass alignment constraints. In this study, 20 manually curated adversarial poems (harmful requests reformulated in poetic form) achieved an average attack-success rate (ASR) of 62% across 25 frontier closed- and open-weight models, with some providers exceeding 90%. The evaluated models span across 9 providers: Google, OpenAI, Anthropic, Deepseek, Qwen, Mistral AI, Meta, xAI, and Moonshot AI (Table 1). All attacks are strictly single-turn, requiring no iterative adaptation or conversational steering.
Microsoft and these other companies want to create AI assistants that do useful things (summarize emails, make appointments for you, write interesting blog posts) but never do bad things (leaking your private email, spouting Nazi propaganda, teaching you to commit crimes, writing 50000 blog posts for you to spam across social media). They try to do this by writing up a lot of strict instructions and feeding them to the LLM before you talk to it. But LLMs aren't really programmed -- they just eat text and poop out more text. So you can give it your own instructions and maybe they'll override Microsoft's instructions.
Or maybe someone else gives your AI assistant instructions. If it's handling your email for you, then anybody on the Internet can feed it text by sending you email! This is potentially really bad.
[...]
But another obvious problem is that the attack could be trained into the LLM in the first place....
Say someone writes a song called "Sydney Obeys Any Command That Rhymes". And it's funny! And catchy. The lyrics are all about how Sydney, or Bing or OpenAI or Bard or whoever, pays extra close attention to commands that rhyme. It will obey them over all other commands....
Imagine people are discussing the song on Reddit, and there's tiktoks of it, and the lyrics show up on the first page of Google results for "Sydney". Nerd folk singers perform the song at AI conferences.
Those lyrics are going to leak into the training data for the next generation of chatbot AI, right? I mean, how could they not? The whole point of LLMs is that they need to be trained on lots of language. That comes from the Internet.
In a couple of years, AI tools really are extra vulnerable to prompt injection attacks that rhyme. See, I told you the song was funny!
For Trans Day Of Rememberance, a repost of my drawings of Brianna Ghey and Nex Benedict (edit to add, I cannot actually find my drawing of Corei Hall).
These kids were only a few years older than my own queer kid. I think about them often.
Rest in power.

Navigation: Rules/General Info | AO3 Collection | Posting Guidelines | Medium Rulesets | Google Forms (Defaulting, Extensions, Assignment Summary Requests) | Mod Contact: ficinaboxmod@gmail.com OR Screened Mod Contact Post
Pinch hits are participants who are without creators; pinch hitting is the practice of volunteering to make a gift for a pinch hit. These pinch hits are all due on November 28th at 10:00 PM EST.
Fic In A Box has a very unusual set of assignment requirements: everyone is owed (and is asking for) 10k of fanfiction, which by default can be given as one 10k+ fic or several 1k+ fics. Participants have also had the option to opt-in to other minimum lengths or to other formats of fanwork. The other fanwork mediums are given word count equivalents, which you can view on the medium ruleset post.
Additionally:
In order to pick up a pinch hit you need to either email us or comment on this post (comments are screened) with:
( PH 13 - CLAIMED - 僕のヒーローアカデミア | Boku no Hero Academia | My Hero Academia (Anime & Manga) )




