Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/AI Risk/AI Alignment/AI Control/Activation Steering/
Weight Steering
Loading views...
Search

Weight Steering

Creator
Creator
Seonglae ChoSeonglae Cho
Created
Created
2026 Jun 19 0:13
Editor
Editor
Seonglae ChoSeonglae Cho
Edited
Edited
2026 Jun 19 0:13
Refs
Refs
 
 
Steer2Edit
Steer2Edit: From Activation Steering to Component-Level Editing
Steering methods influence Large Language Model behavior by identifying semantic directions in hidden representations, but are typically realized through inference-time activation interventions...
Steer2Edit: From Activation Steering to Component-Level Editing
https://arxiv.org/abs/2602.09870
Steer2Edit: From Activation Steering to Component-Level Editing
From Weights to Activations: Is Steering the Next Frontier of Adaptation?
Post-training adaptation of language models is commonly achieved through parameter updates or input-based methods such as fine-tuning, parameter-efficient adaptation, and prompting. In parallel, a...
From Weights to Activations: Is Steering the Next Frontier of Adaptation?
https://arxiv.org/abs/2604.14090v1
From Weights to Activations: Is Steering the Next Frontier of Adaptation?
Weight Arithmetics Steering
Steering Language Models with Weight Arithmetic
Providing high-quality feedback to Large Language Models (LLMs) on a diverse training distribution can be difficult and expensive, and providing feedback only on a narrow distribution can result...
Steering Language Models with Weight Arithmetic
https://arxiv.org/abs/2511.05408
Steering Language Models with Weight Arithmetic
 
 

 

Recommendations

Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/AI Risk/AI Alignment/AI Control/Activation Steering/
Weight Steering
Copyright Seonglae Cho