Carmine Zaccagnino

I.

Editing what's already in the image

Most of my present-day work is on teaching diffusion and autoregressive image models to change what's already there in an image — swap a label, fix a typo, alter a price — without redrawing everything around it. It's harder than it sounds: type and graphic context have a font, a weight, an alignment, a shadow, and these must survive the edit unchanged. The method is called IDAttn and the paper is going to ICML, on multi-instance editing in general, with infographics as the central application.

Shifting the breaking point of flow matching for multi-instance editingICML 2026 · arXiv 2602.08749

II.

Handwriting that looks like a person's

Closely adjacent: getting models to write, in image, in a particular hand. Autoregressive image generators are not naturally good at being stylistically consistent or reliably legible at the same time; persuading them to be both is what Eruku, our WACV paper, is about.

Autoregressive styled text image generation, but make it reliableWACV 2026

III.

Parsing historical documents

Earlier work, still occasionally ongoing, on parsing and transcribing historical documents. The µgat paper (ECCV-W 2024) gives document parsers multi-page context for reading collections of medieval papal letters — Regesta Pontificum Romanorum. SCAM (ICDAR 2026) approaches a harder version of the same general problem: a new line-level HTR dataset built from an 11th-century Coptic manuscript in an extinct language, across leaves scattered across two archives and digitized under different conditions — a realistic benchmark for low-resource HTR.

µgat: Improving single-page document parsing by providing multi-page contextECCV-W 2024 · Springer A text recognition dataset from Sahidic Coptic ancient manuscriptsICDAR 2026

IV.

And a side strand: petrol stations

With friends in another lab I work on something altogether different — algorithms for routing drivers along the cheapest, fastest combinations of refueling stops on their itinerary. It started small and became PIENO, IEEE Access journal article, then a pair of follow-ups the year after.

P. I. E. N. O. — Petrol-filling Itinerary Estimation aNd OptimizationIEEE Access · 2024 A workflow for cost- and time-aware refueling itinerary optimizationCCNC 2026 RI-PIENO: Revised and improved PIENOCCNC 2026

What I'm up to.

Editing what's already in the image

Handwriting that looks like a person's

Parsing historical documents

And a side strand: petrol stations

Two books, both about Flutter.

Programming Flutter

Flutter — Guida allo sviluppo

My old Blog.

In town.

Flutter Modena