Text2Place: Affordance Aware Text Guided Human Placement
[ECCV 2024]

Rishubh Parihar, Harsh Gupta, Sachidanand VS, Venkatesh Babu

Vision and AI Lab, IISc Bangalore

Setup Environment

Clone the repository:

git clone https://github.com/Harsh-Gupta9897/Text2Place

Create a virtual environment (vnv) and activate it:
```
python3 -m venv vnv
source vnv/bin/activate
```
Install the required dependencies from requirements.txt:
```
pip install -r requirements.txt
```

Generating Mask

In this section, we will generate a mask using the modified code from stable_diffusion_guidance.py and mask_generator.py files.

Open stable_diffusion_guidance.py located at Text2Place/threestudio/threestudio/models/guidance/stable_diffusion_guidance.py and make the following changes (it is optional):

# Add your code changes here
# for multiple persons(n_persons), 
# n= total number of blobs required, default=5 for each person
self.n_persons = 1
self.n = 5 * self.n_persons      

self.sigmoid = nn.Sigmoid()
self.tanh = nn.Tanh()

# Centers for each blob, shape: n x 2, in case for static fixed centers
# self.center = nn.Parameter(torch.tensor([[-1.0,-3.0],[-1.0,-1.0],[-1.0,-1.0],[-1.0,-3.0],[-1.0,2.0]]))

# Centers for each blob, shape: 1 x 2, in case for dynamic centers
self.center = nn.Parameter(torch.tensor([[-1.0,-3.0]]))

# parameters for each blob
self.scale = nn.Parameter(torch.ones(self.n) * 0.6)
self.aspect_ratio = nn.Parameter(torch.ones(self.n) * 2.0)  # a ∈ R for each blob
self.angle = nn.Parameter(torch.ones(self.n))  # θ ∈ [−π, π] for each blob

# relative angles 
self.relative_angles = nn.Parameter(torch.ones(self.n))
self.radius = 0.1  # 0.07, 0.216, 0.13 - change this to play with the blob parameters

Open mask_generator.py and make the following changes:

dump_path = '/path/to/dump_dir'
input_path = '/path/to/folder/images_folder_name_present'   # input_dir
output_path = 'path/to/output_dir'   # output_dir
output_folder_name = 'output_dir_name'  # output_dir_name
input_folder_name = 'images_folder_name'  # input_bg_image_folder_path
save_video_frames = False  # to save video frames in case to visualize it
n_iters = 1000  # number of iterations to generate mask
thresold = 0.2
guidance_scale = 200.
one_person = True

Run the modified code to generate the mask:
```
python mask_generator.py
```

That's it! You have successfully generated a mask using the modified code.

Inpainting with the above Generated mask.

Go Inside the inpainting_pipeline folder. cd inpainting_pipeline

Generate the textual_embedding for particular subject by using textual_inversion.py file:
```
    bash generate_ti_run.sh
```
after getting the embedding, provide the path in inpaint_run.sh file, and run it
```
    bash inpaint_run.sh
```

After running the file , you will get the result in same output folder

Acknowledgements

This project uses code from the threestudio repository. We have made modifications to their original code to suit our specific requirements.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
inpainting_pipeline		inpainting_pipeline
threestudio		threestudio
Readme.md		Readme.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Text2Place: Affordance Aware Text Guided Human Placement
[ECCV 2024]

Setup Environment

Generating Mask

Inpainting with the above Generated mask.

Acknowledgements

About

Releases

Packages

Contributors 2

Languages

Harsh-Gupta9897/Text2Place

Folders and files

Latest commit

History

Repository files navigation

Text2Place: Affordance Aware Text Guided Human Placement [ECCV 2024]

Setup Environment

Generating Mask

Inpainting with the above Generated mask.

Acknowledgements

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Text2Place: Affordance Aware Text Guided Human Placement
[ECCV 2024]

Packages