OpenDataArena

community

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

LHL3341 authored a paper 3 days ago

OpenDataArena: A Fair and Open Arena for Benchmarking Post-Training Dataset Value

apeters submitted a paper 8 days ago

OpenDataArena: A Fair and Open Arena for Benchmarking Post-Training Dataset Value

LHL3341 authored a paper 2 months ago

LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models

View all activity

LHL3341

authored a paper 3 days ago

OpenDataArena: A Fair and Open Arena for Benchmarking Post-Training Dataset Value

Paper • 2512.14051 • Published 9 days ago • 38

apeters

submitted a paper to Daily Papers 8 days ago

OpenDataArena: A Fair and Open Arena for Benchmarking Post-Training Dataset Value

Paper • 2512.14051 • Published 9 days ago • 38

LHL3341

authored 5 papers 2 months ago

LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models

Paper • 2410.09732 • Published Oct 13, 2024 • 54

A Strategic Coordination Framework of Small LLMs Matches Large LLMs in Data Synthesis

Paper • 2504.12322 • Published Apr 11 • 28

Can One Domain Help Others? A Data-Centric Study on Multi-Domain Reasoning via Reinforcement Learning

Paper • 2507.17512 • Published Jul 23 • 36

Scaling Code-Assisted Chain-of-Thoughts and Instructions for Model Reasoning

Paper • 2510.04081 • Published Oct 5 • 23

Where am I? Cross-View Geo-localization with Natural Language Descriptions

Paper • 2412.17007 • Published Dec 22, 2024

apeters

authored 13 papers 3 months ago

FABind: Fast and Accurate Protein-Ligand Binding

Paper • 2310.06763 • Published Oct 10, 2023

BioT5: Enriching Cross-modal Integration in Biology with Chemical Knowledge and Natural Language Associations

Paper • 2310.07276 • Published Oct 11, 2023 • 5

MolXPT: Wrapping Molecules with Text for Generative Pre-training

Paper • 2305.10688 • Published May 18, 2023 • 1

BioT5+: Towards Generalized Biological Understanding with IUPAC Integration and Multi-task Tuning

Paper • 2402.17810 • Published Feb 27, 2024 • 1

Leveraging Biomolecule and Natural Language through Multi-Modal Learning: A Survey

Paper • 2403.01528 • Published Mar 3, 2024 • 1

SSM-DTA: Breaking the Barriers of Data Scarcity in Drug-Target Affinity Prediction

Paper • 2206.09818 • Published Jun 20, 2022

3D-MolT5: Towards Unified 3D Molecule-Text Modeling with 3D Molecular Tokenization

Paper • 2406.05797 • Published Jun 9, 2024 • 2

Multi-Agent Collaborative Data Selection for Efficient LLM Pretraining

Paper • 2410.08102 • Published Oct 10, 2024 • 21

Improving Temporal Generalization of Pre-trained Language Models with Lexical Semantic Change

Paper • 2210.17127 • Published Oct 31, 2022 • 1

NatureLM: Deciphering the Language of Nature for Scientific Discovery

Paper • 2502.07527 • Published Feb 11 • 20

LEMMA: Learning from Errors for MatheMatical Advancement in LLMs

Paper • 2503.17439 • Published Mar 21 • 15

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14 • 306

A Strategic Coordination Framework of Small LLMs Matches Large LLMs in Data Synthesis

Paper • 2504.12322 • Published Apr 11 • 28

AI & ML interests

Recent Activity

Team members 3

OpenDataArena-Community's activity