1 essay on this topic.
When your model's labels come from human decisions rather than reality, you're not measuring what you think you're measuring.