Published in 2026
CaMeLs Can Use Computers Too: System-level Security for Computer Use Agents
Hanna Foerster*, Tom Blanchard*, Kristina Nikolić, Ilia Shumailov, Cheng Zhang, Robert Mullins, Nicolas Papernot, Florian Tramèr, Yiren Zhao
Preprint on arXiv
PDF
Published in 2026
Beautiful Images, Toxic Words: Understanding and Addressing Offensive Text in Generated Images
A Kumar*, T Blanchard*, A Dziedzic, F Boenisch
Conference on the Association of the Advancement of Artificial Intelligence (AAAI) 2026, AI Alignment track
PDF
Code
Published in 2024
Open llms are necessary for current private adaptations and outperform their closed alternatives
Vincent Hanke, T Blanchard, Franziska Boenisch, Iyiola E Olatunji, Michael Backes, Adam Dziedzic
Conference on Neural Information Processing Systems (NeurIPS) 2024
PDF
Website
Code
Video