Victoria's public image archives — spanning everything from State Library Victoria's photographic holdings to the City of Melbourne's urban heritage records — are carrying a significant hidden problem. Years of piecemeal digitisation programs, rushed funding rounds, and incompatible cataloguing systems have produced collections where the same image can exist dozens of times under different file names, different metadata, and in some cases, different rights classifications. Institutions are now actively confronting the issue through what archivists call duplicate image replacement: the process of identifying redundant files, selecting a canonical master version, and retiring the rest.
The timing matters. The Victorian Government's Digital Assets Policy, which took effect in late 2024, imposed new storage efficiency requirements on public agencies, including cultural institutions. Under that framework, agencies were asked to audit their digital holdings and reduce unnecessary redundancy. For image-heavy collections, that directive landed with particular force.
How the Duplication Problem Grew
The roots run back to the early 2000s. When institutions like Museum Victoria — now Museums Victoria — first began scanning physical photographs and slides, there was no shared state-wide standard for file resolution, naming convention, or database format. The Public Record Office Victoria, based in North Melbourne on Macarthur Place, ran its own digitisation stream. The City of Melbourne Archives on Little Collins Street ran another. State Library Victoria on Swanston Street ran a third. Each used different software, different vendor contracts, and different staff training regimes.
When collections were later migrated to newer content management systems, automated batch imports frequently duplicated files rather than merging them. A single glass-plate negative of Flinders Street Station, for example, could generate an original high-resolution TIFF, a web-optimised JPEG, a thumbnail, a re-scan correcting for colour shift, and then all four of those again when the record was migrated a second time. Multiply that pattern across hundreds of thousands of items and the scale of the problem becomes clear.
The problem was compounded by grant-funded community digitisation projects. Programs through Creative Victoria and the former Regional Arts Victoria encouraged local historical societies — including groups in Fitzroy, Coburg, and the inner west — to scan and upload local photographic collections to shared state platforms. Without strict deduplication protocols at the point of upload, community-sourced images frequently duplicated material already held professionally.
The Clean-Up and What Comes Next
Duplicate image replacement is not simply deletion. The process requires archivists to assess each cluster of duplicates, verify which version carries the highest resolution and the most complete provenance metadata, designate that file as the master record, update all linked catalogue references to point to the master, and then remove or formally retire the redundant copies. Done carelessly, the process can strip a collection of contextually valuable variant scans that, while technically duplicates, capture different states of a physical object's deterioration.
Museums Victoria has publicly committed to completing a first-pass audit of its digital image holdings by the end of calendar year 2026. The institution manages more than 17 million objects across Melbourne Museum in Carlton, Scienceworks in Spotswood, and the Immigration Museum on Flinders Street. Its digital collection infrastructure is substantial, and the deduplication work is being handled in stages rather than as a single event.
State Library Victoria is approaching the problem differently. Working with the Trove platform — the National Library of Australia's aggregator — the Library is using algorithmic matching tools to flag probable duplicates for human review rather than automated deletion. That human-in-the-loop approach is slower but reduces the risk of losing genuinely distinct versions of contested historical images.
For researchers and members of the public who rely on these collections, the practical advice is straightforward. If you are currently downloading or citing images from State Library Victoria, Museums Victoria, or the City of Melbourne Archives, record the persistent URL and the catalogue identifier, not just the file name. Catalogue identifiers survive the replacement process; file names and folder paths often do not. The replacement programs are live now, and URLs tied to redundant files may eventually resolve to a master record — or, if the process is imperfect, resolve to nothing at all.