Why Pass@k Optimization Can Degrade Pass@1: Prompt Interference in LLM Post-training

Publication
Under review

Related