cz.

Carmine Zaccagnino

Researching document analysis and Generative AI for images — mostly working at their intersection. Working in Dr. Silvia Cascianelli's team.

🕊︎
AImageLab
Università di Modena
e Reggio Emilia

Read on
02 — Work

What I'm up to.

I.

Editing what's already in the image

Most of my present-day work is on teaching diffusion and autoregressive image models to change what's already there in an image — swap a label, fix a typo, alter a price — without redrawing everything around it. It's harder than it sounds: type and graphic context have a font, a weight, an alignment, a shadow, and these must survive the edit unchanged. The method is called IDAttn and the paper is going to ICML, on multi-instance editing in general, with infographics as the central application.

II.

Handwriting that looks like a person's

Closely adjacent: getting models to write, in image, in a particular hand. Autoregressive image generators are not naturally good at being stylistically consistent or reliably legible at the same time; persuading them to be both is what Eruku, our WACV paper, is about.

III.

Parsing historical documents

Earlier work, still occasionally ongoing, on parsing and transcribing historical documents. The µgat paper (ECCV-W 2024) gives document parsers multi-page context for reading collections of medieval papal letters — Regesta Pontificum Romanorum. SCAM (ICDAR 2026) approaches a harder version of the same general problem: a new line-level HTR dataset built from an 11th-century Coptic manuscript in an extinct language, across leaves scattered across two archives and digitized under different conditions — a realistic benchmark for low-resource HTR.

IV.

And a side strand: petrol stations

With friends in another lab I work on something altogether different — algorithms for routing drivers along the cheapest, fastest combinations of refueling stops on their itinerary. It started small and became PIENO, IEEE Access journal article, then a pair of follow-ups the year after.

03 — Books

Two books, both about Flutter.

Before computer vision was the day job there was a lot of cross-platform mobile development. Two books came out of those years — one in English with the Pragmatic Bookshelf, and an Italian translation with Hoepli.

Programming Flutter — cover, Pragmatic Bookshelf, 2020

Programming Flutter

Pragmatic Bookshelf · 2020 · ISBN 9781680506952

An English-language guide to building native cross-platform mobile apps with Flutter — published by the Pragmatic Bookshelf, edited by Michael Swaine.

Flutter — Guida allo sviluppo di app performanti e cross-platform, Hoepli, 2020

Flutter — Guida allo sviluppo

Hoepli · Milan, 2020 · ISBN 9788820399481

An Italian-language companion volume on building performant, cross-platform applications with Flutter — published by Hoepli.

04 — Commonplace

My old Blog.

carmine.dev — est. 2018

A long-running blog on Flutter, Linux, Kubernetes & the quieter workings of DevOps — kept somewhat irregularly, in the manner of a journeyman's notebook.

Some pieces have appeared elsewhere too — on Smashing Magazine and other publications — gathered under the “Articles I've Published Elsewhere” section of the blog.

Visit the blog
05 — Around & About

In town.

Since 2022 Meetup · Modena

Flutter Modena

A bi-monthly meetup for the local Flutter community — usually twenty to thirty people, usually at the same kind of place. I help run it.