FinCoT: Grounding Chain-of-Thought in Expert Financial Reasoning Paper • 2506.16123 • Published Jun 19 • 8
Typhoon 2.1 Collection Typhoon 2.1 Text ThaiLLM release by SCB 10X. • 7 items • Updated 13 days ago • 3
Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs Paper • 2502.12982 • Published Feb 18 • 19
European LLMs Collection Large language models for European languages (multilingual and monolingual) • 13 items • Updated May 26, 2024 • 3
An Open Recipe: Adapting Language-Specific LLMs to a Reasoning Model in One Day via Model Merging Paper • 2502.09056 • Published Feb 13 • 31
The Lessons of Developing Process Reward Models in Mathematical Reasoning Paper • 2501.07301 • Published Jan 13 • 99
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper • 2501.04519 • Published Jan 8 • 285
Typhoon 2 Text Collection Typhoon 2 Text ThaiLLM release by SCB 10X. • 20 items • Updated 13 days ago • 5
Typhoon 2 Multimodal Collection Latest Official Multimodal ThaiLLM release by SCB 10X. • 3 items • Updated 13 days ago • 4
Unleashing Reasoning Capability of LLMs via Scalable Question Synthesis from Scratch Paper • 2410.18693 • Published Oct 24, 2024 • 42
🪐 SmolLM Collection A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated May 5 • 236
Probably function calling datasets Collection Created using the https://huggingface.co/spaces/librarian-bots/dataset-column-search-api Space. • 39 items • Updated Jul 17, 2024 • 39
SambaNova SN40L: Scaling the AI Memory Wall with Dataflow and Composition of Experts Paper • 2405.07518 • Published May 13, 2024 • 27
Model Merging Collection Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12, 2024 • 248
Interesting Datasets Collection A collection of datasets that I come across • 9 items • Updated Jan 4, 2024 • 5