File size: 1,379 Bytes
f56ede2
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
# Download Model Checkpoints

This script downloads model checkpoints from the Hugging Face Hub based on configurations specified in a YAML file.

## Functionality
- **Load Configuration**: Reads a YAML configuration file to get model details.
- **Download Model**: Downloads files for specified models from the Hugging Face Hub to a local directory.
  - Checks for a valid `local_dir` in the configuration; skips download if `local_dir` is null.
  - Creates the local directory if it doesn't exist.
  - Supports `allow` and `deny` patterns to filter files:
    - If `allow` patterns are specified, only those files are downloaded.
    - If no `allow` patterns are provided, all files are downloaded except those matching `deny` patterns.
  - Uses `hf_hub_download` from the `huggingface_hub` library with symlinks disabled.

## Command-Line Arguments
- `--config_path`: Path to the YAML configuration file (defaults to `configs/model_ckpts.yaml`).

## Dependencies
- `argparse`: For parsing command-line arguments.
- `os`: For directory creation.
- `yaml`: For reading the configuration file.
- `huggingface_hub`: For downloading files from the Hugging Face Hub.

## Usage
Run the script with:
```bash
python scripts/download_ckpts.py --config_path <path_to_yaml>
```
The script processes each model in the configuration file, printing the model ID and local directory for each.