DPT/README.md at main · ranftlr/DPT

Vision Transformers for Dense Prediction

This repository contains code and models for:

Vision Transformers for Dense Prediction

Monodepth:

Segmentation:

Place one or more input images in the folder input.
Run a monocular depth estimation model:
```
python run_monodepth.py
```
Or run a semantic segmentation model:
```
python run_segmentation.py
```
The results are written to the folder output_monodepth and output_segmentation, respectively.

Use the flag -t to switch between different models. Possible options are dpt_hybrid (default) and dpt_large.