Papillon
aaFrostnovaUpdated 2025 Mar 6 2:0

Creator
Creator
Seonglae Cho
Created
Created
2024 Dec 20 23:47
Editor
Edited
Edited
2024 Dec 21 0:0
Refs
Refs

Time-consuming but high
ASR
method

Maintain semantic consistency and short length to avoid
Perplexity Filter
  1. Empty seed pool
  1. Mutation through LLM helper such as GPT3.5
    1. Role-play, Contextualization, Expand system prompts
    2. Jailbreaking tempaltes
  1. Execution and Check
    1. RoBERTa based fast check model
    2. ChatGPT based judge model
    3. If success → add to seed pool
notion image
notion image
 
 
 
 
 

Recommendations