LSDM
Language-driven Scene Synthesis using Multi-conditional Diffusion Model
Description
Our main contribution is the Guiding Points Network, where we integrate all information from the conditions to generate guiding points. By applying transformation matrices to scene entities (human/objects) with attention weighting, we can forecast the spanning of the target object.