Interpolating the Text-to-Image Correspondence Based on Phonetic and Phonological Similarities for Nonword-to-Image Generation
Text-to-Image (T2I) generation is the task of synthesizing images corresponding to a Gift Card given text input.The recent innovations in artificial intelligence have enhanced the capacity of conventional T2I generation, yielding more and more powerful models day by day.However, their behavior is known to become unstable in the face of text inputs