Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/AI Problem/AI Hacking/AI Red teaming/AI Jailbreak/Defense Jailbreaking/
Latent Adversarial Training
Search

Latent Adversarial Training

Creator
Creator
Seonglae Cho
Created
Created
2025 Jan 18 2:39
Editor
Editor
Seonglae Cho
Edited
Edited
2025 Jan 18 2:48
Refs
Refs
AT, adversarial perturbations are applied to the model’s latent state instead of its inputs
 
 
 
arxiv.org
https://arxiv.org/pdf/2403.05030
targeted LAT
latent-adversarial-training
aengusl • Updated 2025 Mar 30 13:52
arxiv.org
https://arxiv.org/pdf/2407.15549
LLM-LAT (LLM Latent Adversarial Training)
Org profile for LLM Latent Adversarial Training on Hugging Face, the AI community building the future.
LLM-LAT (LLM Latent Adversarial Training)
https://huggingface.co/LLM-LAT
LLM-LAT (LLM Latent Adversarial Training)
 
 
 

Recommendations

Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/AI Problem/AI Hacking/AI Red teaming/AI Jailbreak/Defense Jailbreaking/
Latent Adversarial Training
Copyright Seonglae Cho