Kishan

kishanpb

https://sites.google.com/a/tamu.edu/kpb/home

AI & ML interests

None yet

Recent Activity

authored a paper 3 days ago

Every Question Has Its Own Value: Reinforcement Learning with Explicit Human Values

upvoted a paper 3 days ago

TARDIS STRIDE: A Spatio-Temporal Road Image Dataset for Exploration and Autonomy

upvoted a paper 3 days ago

Every Question Has Its Own Value: Reinforcement Learning with Explicit Human Values

View all activity

Organizations

None yet

authored a paper 3 days ago

Every Question Has Its Own Value: Reinforcement Learning with Explicit Human Values

Paper • 2510.20187 • Published 4 days ago • 17

upvoted 2 papers 3 days ago

TARDIS STRIDE: A Spatio-Temporal Road Image Dataset for Exploration and Autonomy

Paper • 2506.11302 • Published Jun 12 • 3

Every Question Has Its Own Value: Reinforcement Learning with Explicit Human Values

Paper • 2510.20187 • Published 4 days ago • 17

upvoted a paper 24 days ago

CLUE: Non-parametric Verification from Experience via Hidden-State Clustering

Paper • 2510.01591 • Published 25 days ago • 26

authored 2 papers about 1 month ago

TARDIS STRIDE: A Spatio-Temporal Road Image Dataset for Exploration and Autonomy

Paper • 2506.11302 • Published Jun 12 • 3

Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation

Paper • 2509.15194 • Published Sep 18 • 33

upvoted a paper about 1 month ago

Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation

Paper • 2509.15194 • Published Sep 18 • 33

upvoted an article 3 months ago

Article

<p style="text-align:center;"> Bourbaki (7b): SOTA 7B Algorithms for Putnam Bench (Part I: Reasoning MDPs)</p>

and 2 others •

Jul 13

• 11

upvoted a collection 4 months ago

Reward Models

Collection

Nemotron reward models. For use in RLHF pipelines and LLM-as-a-Judge • 8 items • Updated 5 days ago • 21

liked a model 5 months ago

sarvamai/sarvam-m

Text Generation • 24B • Updated May 28 • 2.97k • 311

liked a dataset 5 months ago

Tera-AI/STRIDE

Viewer • Updated Aug 11 • 1.7M • 830 • 4

Kishan

AI & ML interests

Recent Activity

Organizations

kishanpb's activity

<p style="text-align:center;"> Bourbaki (7b): SOTA 7B Algorithms for Putnam Bench (Part I: Reasoning MDPs)</p>