PosterSum: A multimodal benchmark for scientific poster summarization

By Miniml Research, February 24, 2025

Scientific posters pack dense visuals and terse text into a single canvas, which makes summarization harder than standard document settings. PosterSum targets this gap with a multimodal benchmark for generating paper abstracts from poster images.

The dataset includes 16,305 posters covering multiple fields, and the evaluation compares end-to-end approaches with segment-then-summarize pipelines. The results highlight that layout-aware segmentation is still critical for strong performance.

PosterSum provides a practical testbed for multimodal summarization systems that need to reason over images, text blocks, and layout structure simultaneously.

Paper: https://aclanthology.org/2025.findings-ijcnlp.114/

Stay ahead with research-backed solutions

From papers to production, we translate cutting-edge AI research into practical systems that give your business a competitive edge.

See how we work