AI assistants have improved coding productivity, but how do you know if they're getting better? Even if a user bothers to rate an AI interaction by a simple thumbs-up or thumbs-down click, that ...