Text Generation
Transformers
Safetensors
qwen3
conversational
text-generation-inference
konstantgr nielsr HF Staff commited on
Commit
0f145e4
Β·
verified Β·
1 Parent(s): 100f15d

Improve model card: Add paper and code badges, update datasets metadata (#1)

Browse files

- Improve model card: Add paper and code badges, update datasets metadata (8749329d1cb741c183f50c5dfde43081306e8b56)


Co-authored-by: Niels Rogge <nielsr@users.noreply.huggingface.co>

Files changed (1) hide show
  1. README.md +10 -5
README.md CHANGED
@@ -1,12 +1,15 @@
1
  ---
2
- library_name: transformers
3
- datasets:
4
- - JetBrains-Research/envbench-zeroshot-rl
5
  base_model:
6
  - JetBrains-Research/Qwen3-8B-am
7
- pipeline_tag: text-generation
 
 
 
 
8
  license: mit
 
9
  ---
 
10
  <img src="https://github.com/JetBrains-Research/PIPer/blob/main/misc/piper-logo.png?raw=true" alt="PIPer Mascot" style="height: 6em">
11
  <h1>
12
  PIPer: On-Device Environment Setup via Online Reinforcement Learning
@@ -15,6 +18,8 @@ license: mit
15
 
16
  <div align="center">
17
 
 
 
18
  [![Models](https://img.shields.io/badge/πŸ€—%20Hugging%20Face-Models-orange.svg)](https://jb.gg/PIPer)
19
  [![Dataset](https://img.shields.io/badge/πŸ€—%20Hugging%20Face-Dataset-green.svg)](https://huggingface.co/datasets/JetBrains-Research/PIPer-envbench-zeroshot-rl)
20
  [![License](https://img.shields.io/badge/License-MIT-red.svg)](LICENSE)
@@ -96,4 +101,4 @@ uv run piper/hparams_entrypoint.py +experiment=llm-reward --info config
96
 
97
  ## πŸ“„ License
98
 
99
- This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.
 
1
  ---
 
 
 
2
  base_model:
3
  - JetBrains-Research/Qwen3-8B-am
4
+ datasets:
5
+ - JetBrains-Research/PIPer-envbench-zeroshot-rl
6
+ - JetBrains-Research/PIPer-SFT-2500-sharegpt
7
+ - JetBrains-Research/PIPer-eval
8
+ library_name: transformers
9
  license: mit
10
+ pipeline_tag: text-generation
11
  ---
12
+
13
  <img src="https://github.com/JetBrains-Research/PIPer/blob/main/misc/piper-logo.png?raw=true" alt="PIPer Mascot" style="height: 6em">
14
  <h1>
15
  PIPer: On-Device Environment Setup via Online Reinforcement Learning
 
18
 
19
  <div align="center">
20
 
21
+ [![Paper](https://img.shields.io/badge/πŸ“–-Paper-b31b1b.svg)](https://huggingface.co/papers/2509.25455)
22
+ [![Code](https://img.shields.io/badge/πŸ’»-Code-blue.svg)](https://github.com/JetBrains-Research/PIPer)
23
  [![Models](https://img.shields.io/badge/πŸ€—%20Hugging%20Face-Models-orange.svg)](https://jb.gg/PIPer)
24
  [![Dataset](https://img.shields.io/badge/πŸ€—%20Hugging%20Face-Dataset-green.svg)](https://huggingface.co/datasets/JetBrains-Research/PIPer-envbench-zeroshot-rl)
25
  [![License](https://img.shields.io/badge/License-MIT-red.svg)](LICENSE)
 
101
 
102
  ## πŸ“„ License
103
 
104
+ This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.