I am a researcher working at the intersection of NLP, computer vision, and multimodal learning, with a focus on large multimodal models and video understanding. I received my PhD from the National University of Singapore in September 2025, advised by Professor See-Kiong Ng and Professor Anh-Tuan Luu. My research focuses on building and understanding multimodal models for language, vision, and video.
Specifically, my research program centers on grounding multimodal models in time — building systems that base their predictions on concrete visual evidence in dynamic environments, rather than producing fluent but ungrounded answers. It is organized along three directions:
-
Building and benchmarking temporal grounding systems. Architectures and benchmarks that force models to recover the temporal evidence supporting their predictions, not just produce plausible final answers. [DemaFormer (EMNLP’23)] [Motion-aware Contrastive (AAAI’25)] [Multi-Scale Contrastive (AAAI’25)] [STEMO (arXiv’26)]
-
Learning from partial temporal supervision. Treating noisy, incomplete video–text alignment as a core modeling challenge so that models avoid spurious correlations and adapt efficiently under realistic data. [MAMA (ECCV’24)] [READ (AAAI’24)]
-
Visual foresight. Anticipating future visual states via retrieval-grounded prediction and physics-aware motion-guided animation, rather than free-form generative hallucination. [Eulerian Motion Guidance (arXiv’26)]
My broader vision is to build the temporal infrastructure that makes autonomous multimodal agents trustworthy in dynamic environments — turning temporal grounding from a niche task into a core architectural requirement for reliable multimodal intelligence.
I am open to opportunities for collaboration and new research ideas. If you are interested in working with me, please feel free to contact via my email.
News
| Mar 24, 2026 | Invited to serve as Area Chair for NeurIPS 2026 |
|---|---|
| Sep 2, 2025 | Succesfully defended my PhD thesis: Video Understanding - Through a Temporal Lens |
| Apr 9, 2025 | Invited to serve as Area Chair for NeurIPS 2025. |
| Dec 10, 2024 | Two papers accepted to AAAI 2025. |
| Oct 25, 2024 | Invited to serve as Area Chair for NAACL 2025. |
| Sep 26, 2024 | One paper accepted to NeurIPS 2024. |
| Sep 20, 2024 | One paper accepted to EMNLP 2024. |
| Jul 2, 2024 | One paper accepted to ECCV 2024. |
| Jun 21, 2024 | Invited to serve as Area Chair for EMNLP 2024. |
| May 16, 2024 | Two papers accepted to ACL 2024. |
| Mar 14, 2024 | One paper accepted to NAACL 2024. |
| Dec 20, 2023 | One paper accepted to Artificial Intelligence Review. |
| Dec 10, 2023 | Two papers accepted to AAAI 2024. |
Selected Publications
For the full list of my publications: Publications, Google Scholar.
2026
2025
- In Proceedings of the 39th AAAI Conference on Artificial Intelligence (AAAI 2025)
- In Proceedings of the 39th AAAI Conference on Artificial Intelligence (AAAI 2025)
2024
- In Advances in Neural Information Processing Systems (NeurIPS 2024)
- In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP 2024)
- In Proceedings of the 18th European Conference on Computer Vision (ECCV 2024)
- In Proceedings of Findings of 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024 Findings)
- In Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2024)
- In Proceedings of the 38th AAAI Conference on Artificial Intelligence (AAAI 2024)
- In Proceedings of the 38th AAAI Conference on Artificial Intelligence (AAAI 2024)
2023
- In Proceedings of Findings of 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023 Findings)
- In Proceedings of the 40th International Conference on Machine Learning (ICML 2023)
- In Proceedings of Findings of 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023 Findings)
- In Proceedings of the 37th AAAI Conference on Artificial Intelligence (AAAI 2023)
- In Proceedings of Findings of 61st Annual Meeting of the Association for Computational Linguistics (ACL 2023 Findings)
2022
- In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP 2022)
- In Proceedings of the 36th AAAI Conference on Artificial Intelligence (AAAI 2022)
2021
- In Advances in Neural Information Processing Systems (NeurIPS 2021)
- In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP 2021)