LogicalLargeLanguageModels

community

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

cedzhang authored a paper about 2 months ago

Modeling Open-World Cognition as On-Demand Synthesis of Probabilistic Models

cedzhang authored a paper about 2 months ago

On the Same Wavelength? Evaluating Pragmatic Reasoning in Language Models across Broad Concepts

cedzhang authored a paper about 2 months ago

Evaluating Language Models' Evaluations of Games

View all activity

cedzhang

authored 4 papers about 2 months ago

Modeling Open-World Cognition as On-Demand Synthesis of Probabilistic Models

Paper • 2507.12547 • Published Jul 16, 2025

On the Same Wavelength? Evaluating Pragmatic Reasoning in Language Models across Broad Concepts

Paper • 2509.06952 • Published Sep 8, 2025

Evaluating Language Models' Evaluations of Games

Paper • 2510.10930 • Published Oct 13, 2025

Code-enabled language models can outperform reasoning models on diverse tasks

Paper • 2510.20909 • Published Oct 23, 2025 • 1

cedzhang

authored a paper 2 months ago

LINC: A Neurosymbolic Approach for Logical Reasoning by Combining Language Models with First-Order Logic Provers

Paper • 2310.15164 • Published Oct 23, 2023 • 3

benlipkin

authored 3 papers 9 months ago

LINC: A Neurosymbolic Approach for Logical Reasoning by Combining Language Models with First-Order Logic Provers

Paper • 2310.15164 • Published Oct 23, 2023 • 3

Elements of World Knowledge (EWOK): A cognition-inspired framework for evaluating basic world knowledge in language models

Paper • 2405.09605 • Published May 15, 2024

Fast Controlled Generation from Language Models with Adaptive Weighted Rejection Sampling

Paper • 2504.05410 • Published Apr 7, 2025 • 2

minimario

authored 5 papers almost 2 years ago

SantaCoder: don't reach for the stars!

Paper • 2301.03988 • Published Jan 9, 2023 • 7

LINC: A Neurosymbolic Approach for Logical Reasoning by Combining Language Models with First-Order Logic Provers

Paper • 2310.15164 • Published Oct 23, 2023 • 3

LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code

Paper • 2403.07974 • Published Mar 12, 2024 • 3

StarCoder 2 and The Stack v2: The Next Generation

Paper • 2402.19173 • Published Feb 29, 2024 • 152

CRUXEval: A Benchmark for Code Reasoning, Understanding and Execution

Paper • 2401.03065 • Published Jan 5, 2024 • 11

minimario

authored a paper over 2 years ago

LeanDojo: Theorem Proving with Retrieval-Augmented Language Models

Paper • 2306.15626 • Published Jun 27, 2023 • 17

benlipkin

authored a paper over 2 years ago

StarCoder: may the source be with you!

Paper • 2305.06161 • Published May 9, 2023 • 31

minimario

authored a paper over 2 years ago

StarCoder: may the source be with you!

Paper • 2305.06161 • Published May 9, 2023 • 31

AI & ML interests

Recent Activity

Team members 3

L3M's activity