PerturbationCATCAPO (DPO)Continuous-Adversarial Preference Optimization Continuous-AdvTrainsophie-xhonneux • Updated 2024 Dec 14 8:8arxiv.orghttps://arxiv.org/pdf/2405.15589