README / README.md
davanstrien's picture
davanstrien HF Staff
update readme
a2c1456
metadata
title: README
emoji: πŸ“š
colorFrom: red
colorTo: indigo
sdk: static
pinned: false

UV Scripts

Ready-to-run ML tools powered by UV - zero setup, maximum power

Run state-of-the-art ML workflows with a single command. From OCR to classification, all scripts work instantly with uv run.

What are UV scripts?

UV scripts are self-contained Python scripts that use inline metadata to specify dependencies. Just uv run script.py and everything installs automatically.

Perfect for:

  • πŸš€ GPU workflows on HF Jobs
  • πŸ’» Local processing on your machine
  • πŸ”„ Reproducible pipelines that work anywhere

πŸš€ Quick Example

# Extract text from images with state-of-the-art OCR (no local GPU needed!)
hf jobs uv run --flavor l4x1 \
  https://huggingface.co/datasets/uv-scripts/ocr/raw/main/nanonets-ocr.py \
  your-images your-extracted-text

πŸ“š Browse Scripts

Script Collection Description GPU Required
ocr Extract text from images with VLMs (LaTeX, tables, forms) βœ…
classification Text classification with guaranteed valid outputs βœ…
dataset-creation Create datasets from PDFs and files ❌
vllm High-performance inference with vLLM βœ…

🎯 Why UV Scripts?

Zero Setup

No virtual environments, no dependency conflicts, no installation steps. UV handles everything automatically when you run the script.

GPU Optimized

Seamlessly run on local GPUs or scale to cloud with HF Jobs. Same script, different compute.

🌟 Featured Scripts

OCR Any Document Dataset

Extract text from images with state-of-the-art accuracy:

# Handles LaTeX, tables, forms, handwriting
hf jobs uv run --flavor l4x1 \
  https://huggingface.co/datasets/uv-scripts/ocr/raw/main/nanonets-ocr.py \
  your-images extracted-text

Classify with Guaranteed Valid Outputs

Text classification that always returns valid labels:

# Uses vLLM's structured generation - no invalid outputs!
hf jobs uv run --flavor l4x1 \
  https://huggingface.co/datasets/uv-scripts/classification/raw/main/classify-dataset.py \
  --input-dataset imdb --column text \
  --labels "positive,negative" --output-dataset imdb-classified

πŸš€ Getting Started with HF Jobs

Run any UV script on GPU infrastructure:

hf jobs uv run --flavor l4x1 \
  https://huggingface.co/datasets/uv-scripts/[collection]/raw/main/[script].py \
  [args]

Choose your GPU flavor:

  • l4x1 - Good balance for most tasks
  • a10g-large - More memory for larger models
  • a100-large - Maximum performance

πŸ“– Learn More


UV Scripts is a community project showcasing the power of UV for ML workflows.