BRIDGING LANGUAGE AND VISUALS THROUGH AI
Keywords:
Keywords: computer vision, machine learning, content personalization, accessibility, communication, interactive video, real-time generation, ethical AI,automated content creation, algorithmic video productionAbstract
Absrtact: It employs natural language processing and multimedia tools to select visuals, generate audio, and assemble videos automatically. Despite its potential for content creation and accessibility, challenges persist in accurately interpreting complex text and maintaining creative quality. Anticipated advancements include enhanced personalization and real-time generation, fostering applications in education, marketing, and beyond.
References
"Text-to-Video: Generating Descriptive Video Content from Text"Authors: Haoran Wang, Yueh-Hua Wu, Ming-Ting Sun.Published in: 2019 IEEE International Conference on Multimedia and Expo (ICME)
"Text-to-Video Synthesis via Adversarial Cross-Modal Retrieval"Authors: Yitong Li, Chenliang Xu.Published in: 2020 IEEE Winter Conference on Applications of Computer Vision (WACV)
"Generative Adversarial Text-to-Image Synthesis",Authors: Scott Reed, Zeynep Akata, Xinchen Yan, Lajanugen Logeswaran, Bernt Schiele, Honglak Lee,Published in: 2016 Proceedings of The 33rd International Conference on Machine Learning
"Learning to Generate Images from Text Descriptions",Authors: Scott Reed, Zeynep Akata, Xinchen Yan, Lajanugen Logeswaran, Bernt Schiele, Honglak Lee,Published in: 2016 arXiv preprint arXiv:1511.02793
"A Review on Text to Video Synthesis Techniques",Authors: Renuka Devi K., S. Kalaivani,Published in: 2020 International Conference on Communication and Electronics Systems (ICCES)
"Deep Text-to-Video Synthesis with Natural Language",Authors: Hsuan-I Ho, Wei-Chen Chiu, Yu-Sheng Chen, Chu-Song Chen,Published in: 2017 IEEE International Conference on Multimedia & Expo Workshops (ICMEW)