Red Teaming Large Language Models for Healthcare

Vahid Balazadeh; Michael Cooper; David Pellow; Atousa Assadi; Jennifer Bell; Mark Coastworth; Kaivalya Deshpande; Jim Fackler; Gabriel Funingana; Spencer Gable-Cook; Anirudh Gangadhar; Abhishek Jaiswal; Sumanth Kaja; Christopher Khoury; Amrit Krishnan; Randy Lin; Kaden McKeen; Sara Naimimohasses; Khashayar Namdar; Aviraj Newatia; Allan Pang; Anshul Pattoo; Sameer Peesapati; Diana Prepelita; Bogdana Rakova; Saba Sadatamin; Rafael Schulman; Ajay Shah; Syed Azhar Shah; Syed Ahmar Shah; Babak Taati; Balagopal Unnikrishnan; Stephanie Williams; Rahul G Krishnan

doi:10.48550/arxiv.2505.00467

Back

Red Teaming Large Language Models for Healthcare

Preprint

Open access

Red Teaming Large Language Models for Healthcare

Vahid Balazadeh, Michael Cooper, David Pellow, Atousa Assadi, Jennifer Bell, Mark Coastworth, Kaivalya Deshpande, Jim Fackler, Gabriel Funingana, Spencer Gable-Cook, …

ArXiV.org

Cornell University

05/01/2025

DOI: 10.48550/arxiv.2505.00467

Files and links (1)

url

https://doi.org/10.48550/arxiv.2505.00467View

Preprint (Author's original)This preprint has not been evaluated by subject experts through peer review. Preprints may undergo extensive changes and/or become peer-reviewed journal articles. Open Access

Abstract

We present the design process and findings of the pre-conference workshop at the Machine Learning for Healthcare Conference (2024) entitled Red Teaming Large Language Models for Healthcare, which took place on August 15, 2024. Conference participants, comprising a mix of computational and clinical expertise, attempted to discover vulnerabilities -- realistic clinical prompts for which a large language model (LLM) outputs a response that could cause clinical harm. Red-teaming with clinicians enables the identification of LLM vulnerabilities that may not be recognised by LLM developers lacking clinical expertise. We report the vulnerabilities found, categorise them, and present the results of a replication study assessing the vulnerabilities across all LLMs provided.

Computer Science - Artificial Intelligence

Computer Science - Computation and Language

Details

Title: Subtitle: Red Teaming Large Language Models for Healthcare
Creators: Vahid Balazadeh - University of Toronto
Michael Cooper - University of Toronto
David Pellow - University of Toronto
Atousa Assadi - University of Toronto
Jennifer Bell
Mark Coastworth - Vector Institute
Kaivalya Deshpande - NYU Langone Health
Jim Fackler - Johns Hopkins University
Gabriel Funingana - Cancer Research UK Cambridge Institute
Spencer Gable-Cook
Anirudh Gangadhar
Abhishek Jaiswal
Sumanth Kaja
Christopher Khoury
Amrit Krishnan - Vector Institute
Randy Lin - Algoma University
Kaden McKeen - University of Toronto
Sara Naimimohasses - University of Iowa
Khashayar Namdar - University of Toronto
Aviraj Newatia - University of Toronto
Allan Pang - Leeds Teaching Hospitals NHS Trust
Anshul Pattoo
Sameer Peesapati
Diana Prepelita - University of Cambridge
Bogdana Rakova
Saba Sadatamin - University of Toronto
Rafael Schulman
Ajay Shah - University of Edinburgh
Syed Azhar Shah
Syed Ahmar Shah
Babak Taati - University of Toronto
Balagopal Unnikrishnan - University of Toronto
Stephanie Williams
Rahul G Krishnan - University of Toronto
Resource Type: Preprint
Publication Details: ArXiV.org
DOI: 10.48550/arxiv.2505.00467
ISSN: 2331-8422
Publisher: Cornell University; Ithaca, New York
Language: English
Date posted: 05/01/2025
Academic Unit: Gastroenterology and Hepatology; Internal Medicine
Record Identifier: 9984816017402771

Metrics

14 Record Views