maitrix-org / Pandora
- понедельник, 27 мая 2024 г. в 00:00:09
Pandora: Towards General World Model with Natural Language Actions and Video States
We introduce Pandora, a step towards a General World Model (GWM) that:
Please refer to world-model.ai for results.
[Website] [Paper] [Model] [Gallery]
conda create -n pandora python=3.12.3 nvidia/label/cuda-12.1.0::cuda-toolkit -y
conda activate pandora
pip install torch torchvision torchaudio
bash build_envs.sh
If your GPU doesn't support CUDA 12.1, you can also install with CUDA 11.8:
conda create -n pandora python=3.12.3 nvidia/label/cuda-11.8.0::cuda-toolkit -y
conda activate pandora
pip install torch torchvision torchaudio
bash build_envs.sh
CUDA_VISIBLE_DEVICES={cuda_id} python gradio_app.py --ckpt_path {path_to_ckpt}
Then you can interact with the model through gradio interface.
@article{xiang2024pandora,
title={Pandora: Towards General World Model with Natural Language Actions and Video States},
author={Jiannan Xiang and Guangyi Liu and Yi Gu and Qiyue Gao and Yuting Ning and Yuheng Zha and Zeyu Feng and Tianhua Tao and Shibo Hao and Yemin Shi and Zhengzhong Liu and Eric P. Xing and Zhiting Hu},
year={2024}
}