microsoft / NUWA
- воскресенье, 28 ноября 2021 г. в 00:31:12
A unified 3D Transformer Pipeline for visual synthesis
This is the official repo for the paper: NÜWA: Visual Synthesis Pre-training for Neural visUal World creAtion.
NÜWA is a unified multimodal pre-trained model that can generate new or manipulate existing visual data (i.e., images and videos) for 8 visual synthesis tasks (as shown above).