OpenAI's DALL-E explained. How GPT-3 creates images from descriptions.

OpenAI's DALL-E explained. How GPT-3 creates images from descriptions.

Jun 26, 2021
|
76 views
Details
How can GPT-3 create an avocado armchair? Have a look at DALL路E, OpenAI鈥檚 new amazing text-to-image generator. Video with a high-level explanation of how can it be this good and why? 馃搫 DALL-E blog, not a paper (yet): https://openai.com/blog/dall-e/ Play around with many input combinations! This is impressive. 馃摵 Ms. Coffee Bean's GPT-3 video: https://youtu.be/5fqxPOaaqi0 Outline: * 00:00 DALL-E is here * 02:26 How can it work? * 04:00 Why does it work? * 05:36 OpenAI is showing off ;) * 08:25 Multimodality 馃搫 Image-GPT: Chen, M., Radford, A., Child, R., Wu, J., Jun, H., Luan, D., & Sutskever, I. (2020, November). Generative pretraining from pixels. In International Conference on Machine Learning (pp. 1691-1703). PMLR. http://proceedings.mlr.press/v119/chen20s/chen20s.pdf 馃搫 StackGAN++: Zhang, H., Xu, T., Li, H., Zhang, S., Wang, X., Huang, X., & Metaxas, D. N. (2018). Stackgan++: Realistic image synthesis with stacked generative adversarial networks. IEEE transactions on pattern analysis and machine intelligence, 41(8), 1947-1962. https://arxiv.org/pdf/1710.10916v3.pdf 馃搫 StyleGAN2: Karras, T., Laine, S., Aittala, M., Hellsten, J., Lehtinen, J., & Aila, T. (2020). Analyzing and improving the image quality of stylegan. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 8110-8119). https://arxiv.org/pdf/1912.04958.pdf 馃敆 Links: YouTube: https://www.youtube.com/AICoffeeBreak Twitter: https://twitter.com/AICoffeeBreak Reddit: https://www.reddit.com/r/AICoffeeBreak/ #AICoffeeBreak #MsCoffeeBean #OpenAI #DALL-E #MachineLearning #AI #research

00:00 DALL-E is here 02:26 How can it work? 04:00 Why does it work? 05:36 OpenAI is showing off ;) 08:25 Multimodality
Comments
loading...