Published in 2026

CaMeLs Can Use Computers Too: System-level Security for Computer Use Agents

Hanna Foerster*, Tom Blanchard*, Kristina Nikolić, Ilia Shumailov, Cheng Zhang, Robert Mullins, Nicolas Papernot, Florian Tramèr, Yiren Zhao

Preprint on arXiv

PDF

Published in 2026

Beautiful Images, Toxic Words: Understanding and Addressing Offensive Text in Generated Images

A Kumar*, T Blanchard*, A Dziedzic, F Boenisch

Conference on the Association of the Advancement of Artificial Intelligence (AAAI) 2026, AI Alignment track

PDF Code

Published in 2024

Open llms are necessary for current private adaptations and outperform their closed alternatives

Vincent Hanke, T Blanchard, Franziska Boenisch, Iyiola E Olatunji, Michael Backes, Adam Dziedzic

Conference on Neural Information Processing Systems (NeurIPS) 2024

PDF Website Code Video