COMIQ: Comic Intelligence Quotient

Workshop Abstract

Comics are a uniquely compelling visual storytelling medium, blending images and text to convey intricate narratives. Unlike other visual media such as photographs or videos, comics rely on discrete panels, stylized characters, and implicit transitions that require readers to infer context and causality. The interplay between visual elements, speech bubbles, and captions enables rich, multimodal communication, making comics both a fascinating artistic domain and a challenging testbed for AI.

Despite rapid progress in vision-language models, AI systems continue to struggle with comic understanding. Unlike natural images, which depict real-world scenes, or structured documents, which follow rigid layouts, comics present highly abstract and diverse representations. Tasks such as panel sequencing, entity tracking, and cross-panel reasoning remain difficult for even state-of-the-art models. Current approaches often fail to handle the complexities of character consistency across panels, implicit storytelling gaps, and multimodal fusions of text and imagery.

This workshop will bring together researchers from computer vision, cognitive science, and multimedia analysis to advance AI-driven comic understanding. Through invited talks, discussions, and presentations, we will explore new methodologies for multimodal reasoning, self-supervised learning, and dataset curation.

Workshop Schedule

📅 Sunday, October 19th, 2025 | 🕐 13:30 - 17:00 (Hawaii Time)

📍 Venue: Room 305 A | 🌐 Virtual: Join on Zoom

PDT: 4:00 PM - 8:00 PM (Sunday, Oct 19th)

13:30 - 13:45
(15 minutes)

Opening Remarks

Introduction and workshop overview

13:45 - 14:15
(30 minutes)

Prof. Kiyoharu Aizawa

Manga Analysis and Benchmarks

14:15 - 14:45
(30 minutes)

Dr. Christophe Rigaud

Comics Accessibility

14:45 - 15:00
(15 minutes)

Coffee Break

Networking and informal discussions

15:00 - 15:15
(15 minutes)

Oral Presentation #1: MangaVQA

Accepted paper presentation

15:15 - 15:30
(15 minutes)

Oral Presentation #2: Onomatopoeia Generation

Accepted paper presentation

15:30 - 16:00
(30 minutes)

Dr. Yael Vinker

Visual Abstraction, Symbolism, and Narrative: Lessons from Comics for AI

16:00 - 16:30
(30 minutes)

Emanuele Vivoli

Past, present and future of Comic: a Vision-Language perspective

16:30 - 16:50
(20 minutes)

Panel Discussion

Open discussion with speakers and attendees

16:50 - 17:00
(10 minutes)

Closing Remarks

Workshop wrap-up and future directions

Invited Speakers

Prof. Kiyoharu Aizawa

University of Tokyo

Manga Analysis and Benchmarks

Google Scholar

Dr. Christophe Rigaud

University of La Rochelle

Comics Accessibility

Google Scholar

Dr. Yael Vinker

MIT CSAIL

Visual Abstraction, Symbolism, and Narrative: Lessons from Comics for AI

Google Scholar

Call for Papers

Paper submissions are now closed. We received excellent submissions related to comic understanding, multimodal analysis, and AI-driven comic analysis. Accepted papers will be presented during the workshop, with two exceptional contributions selected for oral presentations. Please note that the accepted papers will NOT appear in conference proceedings.

📄