Name:
Kilichbek_Haydarov_MS_Thesis_final.pdf
Size:
5.258Mb
Format:
PDF
Description:
MS Thesis for Kilichbek Haydarov
Type
ThesisAuthors
Haydarov, Kilichbek
Advisors
Elhoseiny, Mohamed
Committee members
Wonka, Peter
Michels, Dominik
Program
Computer ScienceDate
2021-11-30Permanent link to this record
http://hdl.handle.net/10754/673850
Metadata
Show full item recordAbstract
Developing intelligent systems that can recognize and express human affects is essential to bridge the gap between human and artificial intelligence. This thesis explores the creative and emotional frontiers of artificial intelligence. Specifically, in this thesis, we investigate the relation between the affective impact of visual stimuli and natural language by collecting and analyzing a new dataset called ArtEmis. Furthermore, capitalizing on this dataset, we demonstrate affective AI models that can emotionally talk about artwork and generate them given their affective descriptions. In text-to-image generation task, we present HyperCGAN: a conceptually simple and general approach for text-to-image synthesis that uses hypernetworks to condition a GAN model on text. In our setting, the generator and the discriminator weights are controlled by their corresponding hypernetworks, which modulate weight parameters based on the provided text query. We explore different mechanisms to modulate the layers depending on the underlying architecture of a target network and the structure of the conditioning variable.Citation
Haydarov, K. (2021). Towards Affective Vision and Language. KAUST Research Repository. https://doi.org/10.25781/KAUST-18B50ae974a485f413a2113503eed53cd6c53
10.25781/KAUST-18B50