Pioneering Work on Enhancing Code Reasoning with Search and Self-Correction
Large language models (LLMs) for code generation and understanding can be boosted by training on search-like reasoning traces and by self-correction mechanisms. Recent work has framed code tasks as iterative decision processes, using fine-tuning or r...