MFTCoder: Boosting Code LLMs with Multitask Fine-Tuning Paper • 2311.02303 • Published Nov 4, 2023 • 12
CodeFuse-13B: A Pretrained Multi-lingual Code Large Language Model Paper • 2310.06266 • Published Oct 10, 2023 • 2
CoBa: Convergence Balancer for Multitask Finetuning of Large Language Models Paper • 2410.06741 • Published Oct 9, 2024 • 3
Every Sample Matters: Leveraging Mixture-of-Experts and High-Quality Data for Efficient and Accurate Code LLM Paper • 2503.17793 • Published Mar 22 • 23
Code Graph Model (CGM): A Graph-Integrated Large Language Model for Repository-Level Software Engineering Tasks Paper • 2505.16901 • Published May 22 • 47
Unifying the Perspectives of NLP and Software Engineering: A Survey on Language Models for Code Paper • 2311.07989 • Published Nov 14, 2023 • 26
LLMDFA: Analyzing Dataflow in Code with Large Language Models Paper • 2402.10754 • Published Feb 16, 2024 • 1
GALLa: Graph Aligned Large Language Models for Improved Source Code Understanding Paper • 2409.04183 • Published Sep 6, 2024 • 2
Rodimus*: Breaking the Accuracy-Efficiency Trade-Off with Efficient Attentions Paper • 2410.06577 • Published Oct 9, 2024 • 14
CodeFuse-CR-Bench: A Comprehensiveness-aware Benchmark for End-to-End Code Review Evaluation in Python Projects Paper • 2509.14856 • Published Sep 18 • 1
D2LLM: Decomposed and Distilled Large Language Models for Semantic Search Paper • 2406.17262 • Published Jun 25, 2024 • 4
E2LLM: Encoder Elongated Large Language Models for Long-Context Understanding and Reasoning Paper • 2409.06679 • Published Sep 10, 2024 • 4
F2LLM Technical Report: Matching SOTA Embedding Performance with 6 Million Open-Source Data Paper • 2510.02294 • Published Oct 2 • 43