School trains you to equate difficulty with value
School is a closed-world domain: crisply-defined problems, single-dimension grading, low performance ceiling. The only form of progression is harder courses. Spend 15 years in this system and you internalize one heuristic: harder = more valuable. Given the reward function, this isn't irrational — the reward function is just wrong.
Elite recruiting culture reinforces it. "What are you looking for?" "Hard technical problems." Companies say it back in job ads. Interview questions reward the answer. By graduation, difficulty-seeking feels like ambition rather than a trained reflex.
When you enter the real world, you carry this heuristic into a domain where it doesn't apply. The problem isn't ability — it's that the heuristic was trained on the wrong environment.
School never asked you to find the problem, only to solve it. That's the gap.
Source claim: School's reward structure trains people to optimize for difficulty rather than importance, and that habit survives graduation.