File size: 1,025 Bytes
c2df73d
8d0827f
 
c2df73d
 
8d0827f
c2df73d
8d0827f
23d9d5e
8d0827f
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
---
license: apache-2.0
pipeline_tag: depth-estimation
---

This repository contains the Camera Depth Model (CDM) of the paper [Manipulation as in Simulation: Enabling Accurate Geometry Perception in Robots](https://huggingface.co/papers/2509.02530).

The Camera Depth Models (CDMs) are proposed as a simple plugin on daily-use depth cameras, which take RGB images and raw depth signals as input and output denoised, accurate metric depth. This enables accurate geometry perception in robots by effectively bridging the sim-to-real gap for manipulation tasks.

Project page: https://manipulation-as-in-simulation.github.io/
Code: https://github.com/ByteDance-Seed/manip-as-in-sim-suite

## Sample Usage

To run depth inference on RGB-D camera data, use the `infer.py` script provided in the `cdm` directory of the main repository.

```bash
cd cdm
python infer.py \
    --encoder vitl \
    --model-path /path/to/model.pth \
    --rgb-image /path/to/rgb.jpg \
    --depth-image /path/to/depth.png \
    --output result.png
```