mradermacher/ASTRA-14B-Thinking-v1-GGUF Reinforcement Learning • 15B • Updated about 2 hours ago • 572