5 7 39

Nikolai Skripko

NikolaiSkripko

https://github.com/Skripkon

skripkon

AI & ML interests

LLMs, Data Mining

Recent Activity

upvoted a paper 14 days ago

Instruction-Following Evaluation in Function Calling for Large Language Models

liked a model 17 days ago

ai-sage/GigaChat3.1-10B-A1.8B-GGUF

upvoted a collection 17 days ago

GigaChat3

View all activity

Organizations

None yet

upvoted a paper 14 days ago

Instruction-Following Evaluation in Function Calling for Large Language Models

Paper • 2509.18420 • Published Sep 22, 2025 • 3

liked a model 17 days ago

ai-sage/GigaChat3.1-10B-A1.8B-GGUF

Text Generation • 11B • Updated 20 days ago • 16.9k • 58

upvoted a collection 17 days ago

GigaChat3

Collection

6 items • Updated Dec 5, 2025 • 18

liked 2 models 19 days ago

#12 opened 22 days ago by

NikolaiSkripko

liked a model 20 days ago

ai-sage/GigaChat3.1-10B-A1.8B

Text Generation • 11B • Updated 20 days ago • 2.35k • 27

upvoted a collection 21 days ago

GigaChat 3.1

Collection

6 items • Updated 22 days ago • 58

liked a model about 1 month ago

Qwen/Qwen3-4B-Thinking-2507

Text Generation • 4B • Updated Aug 6, 2025 • 1.71M • • 577

liked a dataset about 1 month ago

zai-org/ComplexFuncBench

Updated Jan 22, 2025 • 147 • 14

upvoted a collection about 1 month ago

Qwen3.5

Collection

21 items • Updated Mar 9 • 1.51k

New activity in Qwen/Qwen3-4B-Instruct-2507 2 months ago

Discrepancy in benchmark score (BFCL-v3)

#18 opened 5 months ago by

mmrbulbul

liked a model 3 months ago

Qwen/Qwen3-4B-Instruct-2507

Text Generation • 4B • Updated Sep 17, 2025 • 7.31M • • 808

upvoted a paper 3 months ago

DeepPlanning: Benchmarking Long-Horizon Agentic Planning with Verifiable Constraints

Paper • 2601.18137 • Published Jan 26 • 35

liked a dataset 3 months ago

Qwen/DeepPlanning

Viewer • Updated Mar 3 • 2.14k • 646 • 194

liked 2 models 3 months ago

Skywork/Skywork-Reward-V2-Qwen3-8B

Text Classification • 8B • Updated Jul 6, 2025 • 8.84k • 24

Skywork/Skywork-Reward-V2-Llama-3.1-8B

Text Classification • 8B • Updated Jul 6, 2025 • 48.3k • 42

liked a dataset 3 months ago

RioLee/TRBench-BFCL

Viewer • Updated Jan 14 • 11.9k • 44 • 3

upvoted a paper 3 months ago

One Model to Critique Them All: Rewarding Agentic Tool-Use via Efficient Reasoning

Paper • 2510.26167 • Published Oct 30, 2025 • 3

New activity in RioLee/ToolRM-Gen-Qwen3-4B-Thinking-2507 3 months ago

Weak spot of the model

#1 opened 3 months ago by

NikolaiSkripko

Nikolai Skripko

AI & ML interests

Recent Activity

Organizations

NikolaiSkripko's activity

⚠️ Benchmark Leaks

Discrepancy in benchmark score (BFCL-v3)

Weak spot of the model