The talk will cover generative modeling for multimodal input (image and text) in the context of product retrieval in fashion/e-commerce. The presentation will include examples of applying GAN architectures for image generation with multimodal query using architectures derived from StackGAN, AttnGAN, as well as author's own recent research.
Download the slides for this talk.Download ( PDF, 3142.71 MB)