LYL1015 commited on
Commit
a367400
·
1 Parent(s): 5545a01

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +70 -1
README.md CHANGED
@@ -7,4 +7,73 @@ tags:
7
  description: |
8
  This is the weights repository for CVPR 2025 JarvisIR paper.
9
  Contains all pretrained model weights used in the paper.
10
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
7
  description: |
8
  This is the weights repository for CVPR 2025 JarvisIR paper.
9
  Contains all pretrained model weights used in the paper.
10
+ ---
11
+
12
+ # JarvisIR: Elevating Autonomous Driving Perception with Intelligent Image Restoration
13
+
14
+ ## Model Description
15
+
16
+ JarvisIR is a novel vision-language model (VLM) based intelligent image restoration system designed for autonomous driving perception under adverse weather conditions. The system uses a VLM as a central controller to dynamically coordinate multiple expert restoration models for handling complex weather degradations including rain, fog, night scenes, and snow.
17
+
18
+ ## Key Features
19
+
20
+ - **VLM-based Controller**: First framework to use vision-language models for controlling image restoration workflows
21
+ - **Multi-Expert Coordination**: Dynamic scheduling of specialized restoration models (denoising, super-resolution, deraining, etc.)
22
+ - **Weather-Adaptive**: Handles multiple weather degradations: night/low-light, rain, fog, snow scenarios
23
+ - **Two-Stage Training**: Supervised Fine-Tuning (SFT) + Mixed-Rank Reward-based Human Feedback (MRRHF) alignment
24
+
25
+ ## Model Architecture
26
+
27
+ The system consists of:
28
+ 1. **VLM Controller**: Based on LLaVA-v1.5-7B for task planning and model selection
29
+ 2. **Expert Models**: Specialized restoration networks for different degradation types
30
+ 3. **Reward Models**: Multiple IQA models for quality assessment and alignment
31
+
32
+ ## Training Data
33
+
34
+ - **CleanBench-Synthetic**: 150K synthetic degraded images with annotations
35
+ - **CleanBench-Real**: 80K real-world adverse weather images for alignment training
36
+ - **Coverage**: Four main weather scenarios (night, rain, fog, snow) with multiple degradation combinations
37
+
38
+ ## Performance
39
+
40
+ - **50% average improvement** in perception metrics on CleanBench-Real compared to existing all-in-one methods
41
+ - Superior performance across all weather conditions tested
42
+ - Enhanced robustness and generalization to real-world scenarios
43
+
44
+ ## Intended Use
45
+
46
+ **Primary Applications:**
47
+ - Autonomous driving perception systems
48
+ - Multi-weather image restoration pipelines
49
+ - Research in vision-language model applications
50
+
51
+
52
+ ## Model Checkpoints
53
+
54
+ This repository contains weights for:
55
+ - `jarvisir`: Model after supervised fine-tuning and MRRHF alignment stage
56
+ - `expert-tools/`: Individual specialist restoration model weights
57
+
58
+
59
+ ## Citation
60
+
61
+ ```bibtex
62
+ @inproceedings{jarvisir2025,
63
+ title={JarvisIR: Elevating Autonomous Driving Perception with Intelligent Image Restoration},
64
+ author={Lin, Yunlong and Lin, Zixu and Chen, Haoyu and Pan, Panwang and Li, Chenxin and Chen, Sixiang and Kairun, Wen and Jin, Yeying and Li, Wenbo and Ding, Xinghao},
65
+ booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
66
+ year={2025}
67
+ }
68
+ ```
69
+
70
+ ## Related Resources
71
+
72
+ - **Project Page**: https://cvpr2025-jarvisir.github.io/
73
+ - **Code Repository**: https://github.com/LYL1015/JarvisIR
74
+ - **Paper**: https://arxiv.org/pdf/2504.04158
75
+
76
+ ## Acknowledgments
77
+
78
+ This work advances the field of intelligent image restoration by combining vision-language models with expert system coordination, specifically targeting autonomous driving applications under challenging weather conditions.
79
+