Lipschitz-constrained Skill Discovery

Creator
Creator
Seonglae Cho
Created
Created
2024 May 22 2:9
Editor
Edited
Edited
2025 Jan 19 14:59
Refs
Refs

LSD

Take changes in state distance into account for more complex and dynamic skills by incentivizing challenging behaviors.
Align the direction of skill vector and state change.
LSD는 learning skill policy and ϕ(s)\phi(s) 에 distance 고려를 추가하기 위해 to maximize z\cdot z 항을 추가한다 (ϕ(s)ϕ(s))z(\phi(s') - \phi(s)) \cdot z and regulate ϕ(s)\phi(s) to reflect distance in ss: ϕ(s)ϕ(s)ss|| \phi (s') - \phi(s)||\le||s' - s|| preventing ϕ(s)\phi(s) becoming infinitely large.
 
 

Limitation

euclidan distance based does not always consistent
 
 
 
 
 

Recommendations