Spaces:
Running
Running
title: Evaluation Dataset Quiz | |
emoji: π§ | |
colorFrom: blue | |
colorTo: green | |
sdk: gradio | |
sdk_version: 4.19.2 | |
app_file: app.py | |
pinned: false | |
license: mit | |
# HuggingFace Evaluation Dataset Quiz | |
Test your knowledge with questions from popular evaluation datasets! | |
## Features | |
- π― Interactive quiz interface built with Gradio | |
- π 8 popular evaluation datasets including: | |
- GSM8K (Grade School Math) | |
- MMLU (Massive Multitask Language Understanding) | |
- AI2 ARC (Science Questions) | |
- HellaSwag (Commonsense NLI) | |
- WinoGrande (Winograd Schema) | |
- BoolQ (Boolean Questions) | |
- SQuAD (Reading Comprehension) | |
- PIQA (Physical Reasoning) | |
- π² Random question selection | |
- β Immediate feedback on answers | |
- π Score tracking | |
- π Support for multiple question formats: | |
- Multiple choice | |
- True/False | |
- Text input for QA tasks | |
## How to Use | |
1. **Select a Dataset**: Choose from the available evaluation datasets | |
2. **Choose Number of Questions**: Select how many questions you want (5-20) | |
3. **Start Quiz**: Click "Start Quiz" to begin | |
4. **Answer Questions**: Select or type your answer and click "Submit Answer" | |
5. **Get Feedback**: See if you got it right and learn the correct answer | |
6. **Continue**: Click "Next Question" to proceed | |
7. **View Score**: See your final score at the end | |
## Local Development | |
```bash | |
# Clone the repository | |
git clone <your-repo-url> | |
cd eval_quiz_app | |
# Install dependencies | |
pip install -r requirements.txt | |
# Run the app | |
python app.py | |
``` | |
## Deployment | |
This app is designed to run on HuggingFace Spaces. Simply push to your Space repository and it will deploy automatically. | |
## Contributing | |
Feel free to add more datasets or improve the quiz functionality! |