Model Developmental Safety: A Safety-Centric Method and Applications in Vision-Language Models

Gang Li; Wendi Yu; Yao Yao; Wei Tong; Yingbin Liang; Qihang Lin; Tianbao Yang

doi:10.48550/arxiv.2410.03955

Back

Model Developmental Safety: A Safety-Centric Method and Applications in Vision-Language Models

Preprint

Open access

Model Developmental Safety: A Safety-Centric Method and Applications in Vision-Language Models

Gang Li, Wendi Yu, Yao Yao, Wei Tong, Yingbin Liang, Qihang Lin and Tianbao Yang

ArXiv.org

Cornell University

10/04/2024

DOI: 10.48550/arxiv.2410.03955

Files and links (1)

url

https://doi.org/10.48550/arxiv.2410.03955View

Preprint (Author's original)This preprint has not been evaluated by subject experts through peer review. Preprints may undergo extensive changes and/or become peer-reviewed journal articles. Open Access

Abstract

In the real world, a learning-enabled system usually undergoes multiple cycles of model development to enhance the system's ability to handle difficult or emerging tasks. This continual model development process raises a significant issue that the model development for acquiring new or improving existing capabilities may inadvertently lose capabilities of the old model, also known as catastrophic forgetting. Existing continual learning studies focus on mitigating catastrophic forgetting by trading off performance on previous tasks and new tasks to ensure good average performance. However, they are inadequate for many applications especially in safety-critical domains, as failure to strictly preserve the performance of the old model not only introduces safety risks and uncertainties but also imposes substantial expenses in the re-improving and re-validation of existing properties. To address this issue, we introduce model developmental safety as a guarantee of a learning system such that in the model development process the new model should strictly preserve the existing protected capabilities of the old model while improving its performance on target tasks. To ensure the model developmental safety, we present a safety-centric framework by formulating the model developmental safety as data-dependent constraints. Under this framework, we study how to develop a pretrained vision-language model (aka the CLIP model) for acquiring new capabilities or improving existing capabilities of image classification. We propose an efficient constrained optimization algorithm with theoretical guarantee and use its insights to finetune a CLIP model with task-dependent heads for promoting the model developmental safety. Our experiments on improving vision perception capabilities on autonomous driving and scene recognition datasets demonstrate the efficacy of the proposed approach.

Computer Science - Artificial Intelligence

Computer Science - Learning

Mathematics - Optimization and Control

Statistics - Machine Learning

Details

Title: Subtitle: Model Developmental Safety: A Safety-Centric Method and Applications in Vision-Language Models
Creators: Gang Li - Texas A&M University
Wendi Yu - Texas A&M University
Yao Yao - University of Iowa
Wei Tong - General Motors (United States)
Yingbin Liang - The Ohio State University
Qihang Lin - University of Iowa
Tianbao Yang - Texas A&M University
Resource Type: Preprint
Publication Details: ArXiv.org
DOI: 10.48550/arxiv.2410.03955
ISSN: 2331-8422
Publisher: Cornell University; Ithaca, New York
Language: English
Date posted: 10/04/2024
Academic Unit: Computer Science; Business Analytics
Record Identifier: 9984723056402771

Metrics

14 Record Views