1 14 2

Pala Tej Deep

Tej3

Tej-Deep

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 months ago

From Perception to Action: An Interactive Benchmark for Vision Reasoning

authored a paper about 2 months ago

Training Vision-Language Process Reward Models for Test-Time Scaling in Multimodal Reasoning: Key Insights and Lessons Learned

authored a paper about 2 months ago

LLMs Can't Handle Peer Pressure: Crumbling under Multi-Agent Social Interactions

View all activity

Organizations

upvoted a paper about 2 months ago

From Perception to Action: An Interactive Benchmark for Vision Reasoning

Paper • 2602.21015 • Published Feb 24 • 23

authored 2 papers about 2 months ago

Training Vision-Language Process Reward Models for Test-Time Scaling in Multimodal Reasoning: Key Insights and Lessons Learned

Paper • 2509.23250 • Published Sep 27, 2025 • 6

LLMs Can't Handle Peer Pressure: Crumbling under Multi-Agent Social Interactions

Paper • 2508.18321 • Published Aug 24, 2025 • 2

updated a dataset about 2 months ago

Tej3/GAPO_data_phi4

Viewer • Updated Feb 19 • 9.66k • 5

published a dataset about 2 months ago

Tej3/GAPO_data_phi4

Viewer • Updated Feb 19 • 9.66k • 5

updated a dataset about 2 months ago

Tej3/GAPO_data_qwen3

Viewer • Updated Feb 19 • 9.66k • 5

published a dataset about 2 months ago

Tej3/GAPO_data_qwen3

Viewer • Updated Feb 19 • 9.66k • 5

updated a dataset about 2 months ago

Tej3/GAPO_data_llama32

Viewer • Updated Feb 19 • 9.66k • 5

published a dataset about 2 months ago

Tej3/GAPO_data_llama32

Viewer • Updated Feb 19 • 9.66k • 5

upvoted a paper 4 months ago

Error-Free Linear Attention is a Free Lunch: Exact Solution from Continuous-Time Dynamics

Paper • 2512.12602 • Published Dec 14, 2025 • 44

upvoted a paper 5 months ago

NORA-1.5: A Vision-Language-Action Model Trained using World Model- and Action-based Preference Rewards

Paper • 2511.14659 • Published Nov 18, 2025 • 13

upvoted a paper 6 months ago

Training Vision-Language Process Reward Models for Test-Time Scaling in Multimodal Reasoning: Key Insights and Lessons Learned

Paper • 2509.23250 • Published Sep 27, 2025 • 6

updated a model 11 months ago

Tej3/trustalign_qwen3_4b_dpo

4B • Updated May 30, 2025

published a model 11 months ago

Tej3/trustalign_qwen3_4b_dpo

4B • Updated May 30, 2025

updated a model 11 months ago

Tej3/trustalign_llama3.1_8b_dpo

8B • Updated May 30, 2025

published a model 11 months ago

Tej3/trustalign_llama3.1_8b_dpo

8B • Updated May 30, 2025

authored 2 papers 11 months ago

Error Typing for Smarter Rewards: Improving Process Reward Models with Error-Aware Hierarchical Supervision

Paper • 2505.19706 • Published May 26, 2025 • 3

Emma-X: An Embodied Multimodal Action Model with Grounded Chain of Thought and Look-ahead Spatial Reasoning

Paper • 2412.11974 • Published Dec 16, 2024 • 10

upvoted a paper 11 months ago

Error Typing for Smarter Rewards: Improving Process Reward Models with Error-Aware Hierarchical Supervision

Paper • 2505.19706 • Published May 26, 2025 • 3

New activity in declare-lab/PathFinder-600K 11 months ago

Update task category

#1 opened 11 months ago by

nielsr

Pala Tej Deep

AI & ML interests

Recent Activity

Organizations

Tej3's activity

Update task category