在多模态生成视频或3D视频的领域,以下是一些相关的研究论文和文献:
- “Text-to-Video Synthesis with Cascaded Generative Adversarial Networks” by Jiwei Li et al. (2018)
- “Talking Face Generation by Adversarially Disentangled Audio-Visual Representation” by Hyeongwoo Kim et al. (2018)
- “Neural Voice Puppetry: Audio-driven Facial Reenactment” by Eli Shlizerman et al. (2017)
- “Generative Adversarial Text to Image Synthesis” by Scott Reed et al. (2016)
- “Deep Visual-Semantic Alignments for Generating Image Descriptions” by Andrej Karpathy and Li Fei-Fei (2015)
- “Sequence to Sequence – Video to Text” by Subhashini Venugopalan et al. (2015)
这只是一小部分相关研究的例子。您可以通过学术搜索引擎(如Google Scholar)进一步探索这个领域,并找到更多相关的论文和研究成果。
内容由零声教学AI助手提供,问题来源于学员提问