ERINIA
Preprint on LLM-powered attacks on misinformation detection
31.10.2024
Our work on using large language models for generating adversarial examples is available as a preprint on arXiv. It describes how such models (e.g. LLAMA or GEMMA) can be queried to obtain meaning-preserving reformulations of a given text,helping to improve the understanding of the vulnerabilities of text classification models in the misinformation detection applications.