A Collection of Text-to-Image Generation Studies

This GitHub repository summarizes papers and resources related to the text-to-image (T2I) generation task.

Note

This document serves as the homepage of the whole GitHub repo. Papers are summarized according to different research directions, published years, and conferences.

The topics section summarizes papers that are highly related to T2I generation according to different properties, e.g., prerequisites of T2I generation, diffusion models with other techniques (e.g., Diffusion Transformer, LLMs, Mamba, etc.), and diffusion models for other tasks.

If you have any suggestions about this repository, please feel free to start a new issue or pull requests.

Recent news of this GitHub repo are listed as follows.

🔥 [Nov. 19th] We have released our latest paper titled "StableV2V: Stablizing Shape Consistency in Video-to-Video Editing", with the correponding code, model weights, and a testing benchmark DAVIS-Edit open-sourced. Feel free to check them out from the links!

Click to see more information.

[Apr. 26th] Update a new topic: Diffusion Models Meet Federated Learning. See the topics section for more details!
[Mar. 28th] The official AAAI 2024 paper list are released! Official version of PDFs and BibTeX references are updated accordingly.
[Mar. 21th] The topics section has been updated. This section aims to offer paper lists that are summarized according to other properties of diffusion models, e.g., Diffusion Transformer-based methods, diffusion models for NLP, diffusion models integrated with LLMs, etc. The corresponding references of these papers are also concluded in reference.bib.
[Mar. 7th] All available CVPR, ICLR, and AAAI 2024 papers and references are updated.
[Mar. 1st] Websites of the off-the-shelf text-to-image generation products and toolkits are summarized.

To-Do Lists

Published Papers on Conferences
- Update NeurIPS 2024 Papers
- Update ECCV 2024 Papers
- Update CVPR 2024 Papers
  - Update ⚠️ Papers and References
  - Update arXiv References into the Official Version
- Update AAAI 2024 Papers
  - Update ⚠️ Papers and References
  - Update arXiv References into the Official Version
- Update ICLR 2024 Papers
- Update NeurIPS 2023 Papers
Regular Maintenance of Preprint arXiv Papers and Missed Papers

Name		Name	Last commit message	Last commit date
Latest commit History 119 Commits
github-materials		github-materials
topics		topics
LICENSE		LICENSE
README.md		README.md
reference.bib		reference.bib

Name	Year	Website	Specialties
Stable Diffusion 3	2024	link	Diffusion Transformer-based Stable Diffusion
Stable Video	2024	link	High-quality high-resolution images
DALL-E 3	2023	link	Collaborate with ChatGPT
Ideogram	2023	link	Text images
Playground	2023	link	Athestic images
HiDream.ai	2023	link	-
Dashtoon	2023	link	Text-to-Comic Generation
WHEE	2023	link	WHEE is an online AI generation tool, which can be applied for T2I generation, I2I generation, SR, inpainting, outpainting, image variation, virtural try-on, etc.
Vega AI	2023	link	Vega AI is an online AI generation tool, which can be applied for T2I generation, I2I generation, SR, T2V generation, I2V generation, etc.
Wujie AI	2022	link	The Chinese name is "无界AI", offering AIGC resources and online services
Midjourney	2022	link	Powerful close-sourced generation tool

Name	Website	Description
Stable Diffusion WebUI	link	Built based on Gradio, deployed locally to run Stable Diffusion checkpoints, LoRA weights, ControlNet weights, etc.
Stable Diffusion WebUI-forge	link	Built based on Gradio, deployed locally to run Stable Diffusion checkpoints, LoRA weights, ControlNet weights, etc.
Fooocus	link	Built based on Gradio, offline, open source, and free. The manual tweaking is not needed, and users only need to focus on the prompts and images.
ComfyUI	link	Deployed locally to enable customized workflows with Stable Diffusion
Civitai	link	Websites for community Stable Diffusion and LoRA checkpoints

License

AlonzoLeeeooo/awesome-text-to-image-studies

Folders and files

Latest commit

History

Repository files navigation

A Collection of Text-to-Image Generation Studies

Contents

To-Do Lists

Products

Papers

Survey Papers

Text-to-Image Generation

Conditional Text-to-Image Generation

Personalized Text-to-Image Generation

Text-Guided Image Editing

Text Image Generation

Datasets

Toolkits

Q&A

References

Star History

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages