Abstract: Latent Diffusion Models have emerged as an efficient alternative to conventional diffusion approaches by compressing high-dimensional images into a lower-dimensional latent space using a ...
Abstract: Generalizing to unseen scenes presents a significant challenge to semantic segmentation models, as it aims to replicate the human adaptability. Existing deep learning models still suffer ...
We introduce a video diffusion transformer to design metasurfaces with a given Eletromagnetic response via generating current distributions at different frequencies. To use the pretained models, start ...
Important Note: This repository implements SVG-T2I, a text-to-image diffusion framework that performs visual generation directly in Visual Foundation Model (VFM) representation space, rather than ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results