- 1301.3781 - Efficient Estimation of Word Representations in Vector Space
- 1406.2661 - Generative Adversarial Nets
- 1502.03167 - Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift
- 1706.03762 - Attention is All You Need
- 1906.04358 - Weight Agnostic Neural Networks
- 2010.11929 - An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
- 2112.10752 - High-Resolution Image Synthesis with Latent Diffusion Models
- 2201.11903 - Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
- 2203.14465 - STaR: Bootstrapping Reasoning With Reasoning
- 2207.12598 - Classifier-Free Diffusion Guidance
- 2212.09748 - Scalable Diffusion Models with Transformers
- 2305.18290 - Direct Preference Optimization: Your Language Model is Secretly a Reward Model
- 2312.09390 - Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision
- 2402.06782 - Debating with More Persuasive Language Models Leads to More Truthful Answers
- 2403.13187 - Evolutionary Optimization of Model Merging Recipes
- 2406.04692 - Mixture-of-Agents Enhances Large Language Model Capabilities
- 2408.06292 - The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery
demo
Folders and files
Name | Name | Last commit date | ||
---|---|---|---|---|
parent directory.. | ||||