ParaRNN: Unlocking Parallel Training of Nonlinear RNNs for Large Language Models Paper • 2510.21450 • Published Oct 24 • 4