Basic English Language Video

Photorealistic fire scene video generation via multimodal large language model and pre-trained video diffusion model

Abstract: Text-to-video diffusion models have made significant progress. However, there is still a lack of dedicated research on generating fire scene videos with physical realism and visual fidelity.

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Photorealistic fire scene video generation via multimodal large language model and pre-trained video diffusion model

Trending now