Tens of thousands of duplicate images are clogging Melbourne's public digital archives, slowing researcher access and distorting collection counts at institutions across the city — and the people responsible for fixing it cannot agree on a single standard for how to do it.
The issue has moved from a back-office annoyance to a genuine policy flashpoint in 2026, partly because several Victorian government agencies are mid-way through digitisation projects that are actively creating new duplicates faster than old ones can be removed. The Victorian Managed Insurance Authority's digital asset framework, updated in March 2026, flagged duplicate imagery as a cost and liability concern across government-held collections, accelerating pressure on cultural institutions to act before further public money is committed to storage infrastructure.
The State Library of Victoria, on Swanston Street in the CBD, holds one of the largest photographic collections in the southern hemisphere — more than two million images by its own published count. Archivists and researchers who work with the collection regularly note that automated scanning programs generate multiple near-identical files from a single physical item, particularly when older negatives are run through high-resolution equipment more than once. The Library has not publicly stated what proportion of its digital holdings are duplicates, and declined to provide a figure when contacted this week.
Who Owns the Problem
The City of Melbourne's urban planning portal, which holds georeferenced imagery tied to heritage overlays across suburbs from Carlton to Southbank, ran a deduplication audit across part of its holdings in the second half of 2025. Planning professionals using the portal have described finding multiple versions of the same heritage streetscape photo attached to different property records — a problem that can affect development applications if conflicting image dates are cited in submissions.
Digital preservation specialists at RMIT University's Digital Studio, based on the Bundoora campus, have been vocal about the absence of a shared Victorian protocol. Without a mandated hash-based deduplication standard — a technical method that assigns each image a unique fingerprint to identify exact or near-exact copies — individual institutions are spending money on incompatible proprietary solutions. The cost of commercial deduplication software licences for a mid-sized cultural institution ranges from roughly $18,000 to $65,000 annually, depending on collection size and vendor, according to publicly available pricing from suppliers active in the Australian market.
Museums Victoria, headquartered at Carlton Gardens, manages collections across multiple venues including Immigration Museum on Flinders Street and Melbourne Museum itself. The organisation has previously published open-access guidance on digital collection standards but has not released a specific policy addressing duplicate image replacement as a standalone issue. Technology staff at organisations working with Museums Victoria's Collections Online platform have noted the system flags probable duplicates but does not automatically remove or replace them — that decision remains with individual curators.
What Comes Next
Arts Access Victoria and several smaller community heritage groups in Melbourne's inner north — including organisations preserving migrant community photographic records in Fitzroy and Brunswick — have raised a different concern. For collections digitised by volunteers over many years, duplicate images are often not errors at all. They represent different moments of community custody, different annotations, different donors. Replacing one with another without a clear policy risks erasing that provenance entirely.
The Victorian Department of Creative Industries is expected to release updated digital collections guidance before the end of the 2026 calendar year, according to its published workplan. That guidance is widely anticipated to address deduplication for the first time as a standalone issue rather than folding it under broader data governance policy. Whether it will mandate a particular technical standard or leave discretion to individual institutions remains the central unresolved question.
For researchers, the practical advice right now is straightforward: when pulling images from any Victorian government or cultural institution portal, check file metadata for creation dates and source scanner IDs before citing a specific image in academic or planning work. Two files that look identical may carry different rights statuses. Until a unified standard is in place, that due diligence falls on the person doing the downloading.