source
Arena
Observed user preference when models edit the same source image under the same instruction. How well edits preserve intent while landing visually preferred outputs.
Pixel-level preservation fidelity. Objective adherence scoring for localized edits or safety policies.