Track Chair: Asst. Prof. Dr. Maleerat Maliyaem, King Mongkut's University of Technology North Bangkok, Thailand
The rapid advancement of AI has made the integration of natural language and visual understanding a critical frontier in research. This Track aims to explore cutting-edge research at the intersection of natural language processing (NLP) and computer vision (CV). As multimodal AI systems become increasingly pivotal in real-world applications—from generative AI and robotics to healthcare and human-computer interaction—this workshop will foster discussions on innovative methodologies, challenges, and future directions for unifying linguistic and visual intelligence.
Topics Multimodal Learning and Fusion Image Captioning and
Text-to-Image Generation Visual Question Answering and
Cross-Modal Reasoning Multimodal Applications and
Datasets Efficiency and Optimization of
Multimodal Models
|
|
IMPORTANT DATES
- Submission opening: 11/01/2024
- Submission deadline: 04/25/2025
- Acceptance notification: 05/10/2025
- Camera-ready paper: 05/20/2025
- Registration and payment: 05/25/2025
* Welcome to submit papers to NLPAI 2025 through Electronic Submission System or Conference Email Box: nlpai@cbees.net. (For paper publication, a full paper is required to be submitted; for presentation only without paper publication, an abstract can be submitted).
* Welcome to join in NLPAI 2025 as the listener if you do not want to publish any paper and present at the conference. The registration should be finished through the Online Registration System before the registration deadline.