I don't want you to provide anything, and I'm not the one that needs to be convinced. I'm content with either decision. I'm saying that the people who decide which version of code to run should demand more than an intuitive demonstration that the change will make things better.
What does better mean? In curation, "better" means that it is more likely to rank a set of posts in the correct order, according to user preferences. So, it seems to me that the witnesses who will run the code should ask whoever is proposing the change to provide some level of evidence that the post ranking after the change is likely to be more correct (closer to matching user preferences) than post ranking before the change.
Again, the way to realign incentives is not through Law but the enforcement of law
Your argument seems self-contradictory. On one hand, you say that the rules don't matter - and we need to just depend on curators to downvote, but you're making that argument in support of a rule change. If we can't solve the problem of incorrect ranking of posts by changing the rules of the game, then why are we having this conversation at all?