Unitxt: Flexible, Shareable and Reusable Data Preparation and Evaluation for Generative AI Paper • 2401.14019 • Published Jan 25, 2024 • 23
The Belebele Benchmark: a Parallel Reading Comprehension Dataset in 122 Language Variants Paper • 2308.16884 • Published Aug 31, 2023 • 10
Genie: Achieving Human Parity in Content-Grounded Datasets Generation Paper • 2401.14367 • Published Jan 25, 2024 • 8