Workshop Abstract
Comics are a uniquely compelling visual storytelling medium, blending images and text to convey intricate narratives. Unlike other visual media such as photographs or videos, comics rely on discrete panels, stylized characters, and implicit transitions that require readers to infer context and causality. The interplay between visual elements, speech bubbles, and captions enables rich, multimodal communication, making comics both a fascinating artistic domain and a challenging testbed for AI.
Despite rapid progress in vision-language models, AI systems continue to struggle with comic understanding. Unlike natural images, which depict real-world scenes, or structured documents, which follow rigid layouts, comics present highly abstract and diverse representations. Tasks such as panel sequencing, entity tracking, and cross-panel reasoning remain difficult for even state-of-the-art models. Current approaches often fail to handle the complexities of character consistency across panels, implicit storytelling gaps, and multimodal fusions of text and imagery.
This workshop will bring together researchers from computer vision, cognitive science, and multimedia analysis to advance AI-driven comic understanding. Through invited talks, discussions, and presentations, we will explore new methodologies for multimodal reasoning, self-supervised learning, and dataset curation.
Workshop Schedule
📅 Sunday, October 19th, 2025 | 🕐 13:30 - 17:00 (Hawaii Time)
📍 Venue: Room 305 A | 🌐 Virtual: Join on Zoom
PDT: 4:00 PM - 8:00 PM (Sunday, Oct 19th)
(15 minutes)
Opening Remarks
Introduction and workshop overview
(30 minutes)
(30 minutes)
(15 minutes)
Coffee Break
Networking and informal discussions
(15 minutes)
Oral Presentation #1: MangaVQA
Accepted paper presentation
(15 minutes)
Oral Presentation #2: Onomatopoeia Generation
Accepted paper presentation
(30 minutes)
(30 minutes)
(20 minutes)
Panel Discussion
Open discussion with speakers and attendees
(10 minutes)
Closing Remarks
Workshop wrap-up and future directions
Invited Speakers
Call for Papers
Paper submissions are now closed. We received excellent submissions related to comic understanding, multimodal analysis, and AI-driven comic analysis. Accepted papers will be presented during the workshop, with two exceptional contributions selected for oral presentations. Please note that the accepted papers will NOT appear in conference proceedings.
Paper Format
2-4 pages in ICCV format, including references. Papers will not appear in ICCV proceedings but will be showcased to domain experts.
Topics of Interest
Comic understanding, panel analysis, character recognition, multimodal reasoning, visual storytelling, accessibility, and related AI applications.
Presentation Formats
Accepted papers will be presented as posters. Outstanding contributions may be selected for oral presentations during the workshop.
Important Dates
Paper Submission Opens
Submission system opens for 2-4 page papers
Submission Deadline
Final deadline for paper submissions
Notification of Acceptance
Authors will be notified of acceptance and presentation format
Workshop & Presentations
Poster and oral presentations at ICCV 2025 workshop
Submit Your Paper
Ready to submit? Use our submission form to upload your 2-4 page paper and participate in this exciting workshop. The review process is single-blind.
SubmitOrganizers
Deblina Bhattacharjee
University of Bath