Search filters

List of works by Liqiang Nie

Adapting Generative Pretrained Language Model for Open-domain Multimodal Sentence Summarization

scientific article published on 19 July 2023

Attribute-driven Disentangled Representation Learning for Multimodal Recommendation

scientific article published on 26 October 2024

Differential-Perceptive and Retrieval-Augmented MLLM for Change Captioning

scientific article published on 26 October 2024

Diffusion Facial Forgery Detection

scientific article published on 26 October 2024

Do Vision-Language Transformers Exhibit Visual Commonsense? An Empirical Study of VCR

scientific article published on 27 October 2023

Explicit Granularity and Implicit Scale Correspondence Learning for Point-Supervised Video Moment Localization

scientific article published on 26 October 2024

Fine-grained Key-Value Memory Enhanced Predictor for Video Representation Learning

scientific article published on 27 October 2023

LayoutLLM-T2I: Eliciting Layout Guidance from LLM for Text-to-Image Generation

scientific article published on 27 October 2023

Let Me Show You Step by Step: An Interpretable Graph Routing Network for Knowledge-based Visual Question Answering

scientific article published on 11 July 2024

MIS '24: 1st ACM Multimedia Workshop on Multi-modal Misinformation Governance in the Era of Foundation Models

scientific article published on 17 October 2024

Multimodal Dialog Systems with Dual Knowledge-enhanced Generative Pretrained Language Model

scientific article published on 06 October 2023

NovaChart: A Large-scale Dataset towards Chart Understanding and Generation of Multimodal Large Language Models

scientific article published on 26 October 2024

Revisiting Unsupervised Temporal Action Localization: The Primacy of High-Quality Actionness and Pseudolabels

scientific article published on 26 October 2024

TME: Tree-guided Multi-task Embedding Learning towards Semantic Venue Annotation

scientific article published on 01 February 2023