Inlay

A method for testing AI safety by using human-like concepts to trick generative models into making mistakes, has been developed by Hub researchers. The paper, entitled Concept-based Adversarial Attack: A Probabilistic Perspective, is summarised on our website: www.genai.ac.uk/news/hub-res...

3mo

Many thanks to the UKRI AI Hub in Generative Models for featuring our work and for their support of this research.

Gen AI - the AI Hub in Generative Models

3mo

Andi Zhang

A method for testing AI safety by using human-like concepts to trick generative models into making mistakes, has been developed by Hub researchers. The paper, entitled Concept-based Adversarial Attack: A Probabilistic Perspective, is summarised on our website: www.genai.ac.uk/news/hub-res...

3mo

Gen AI - the AI Hub in Generative Models