Victoria's public memory has a clutter problem. This week, three of Melbourne's largest cultural institutions confirmed they are actively running duplicate image replacement programs across their digital collections — a technical housekeeping task that has ballooned into a significant archival challenge as decades of rushed digitisation catch up with collections managers.
The issue matters now because the Victorian Government's ongoing push to expand public access to historical records online — part of its broader digital infrastructure commitments under Public Record Office Victoria — has thrown a harsh light on the underlying quality of what's already been uploaded. Duplicate images, some mislabelled and indexed under wrong subject categories, are clogging search results and in some cases replacing authentic historical photographs with low-resolution scans of the same image.
What Happened This Week
The State Library Victoria on La Trobe Street flagged internally this week that its Picture Collection — which holds more than one million images — contains a measurable proportion of duplicate digital files generated during batch-scanning projects carried out between 2008 and 2019. Library staff are now running deduplication software across the catalogue, a process expected to continue into late 2026. The City of Melbourne's own digital archive, housed through its Arts and Heritage unit, is separately conducting a review of photographic records tied to the Council's urban planning history, with particular attention to the Hoddle Grid precinct and inner suburbs including Fitzroy and Carlton.
The Immigration Museum on Flinders Street, which maintains extensive photographic holdings documenting Melbourne's migrant communities, is also caught up in the broader reckoning. Its digital team has been cross-referencing image metadata against physical catalogue cards to identify cases where a single original photograph has been scanned multiple times under different accession numbers — a common artifact of digitisation drives that relied on volunteer labour and lacked consistent file-naming protocols.
Duplicate image replacement is not simply a matter of deleting files. When an image has been cited in an academic paper, embedded in an exhibition microsite, or linked from a school curriculum resource, replacing it with a corrected version can break existing URLs and render citations inaccurate. Collections managers are now building redirect systems and versioning logs to track every substitution — work that requires both technical staff and archivists working in tandem.
The Broader Reckoning for Digital Collections
Australia's cultural institutions spent heavily on digitisation in the 2010s, often under grant-funded timelines that prioritised volume over verification. The National Library of Australia's Trove platform, which aggregates holdings from state libraries and regional collections, has long carried warnings about image quality inconsistencies. At the state level, Public Record Office Victoria has set a target of making all government records created after 1990 digitally accessible by 2028 — a deadline that is focusing attention on whether existing digital holdings are actually fit for purpose.
The practical scale of the problem is visible in commercial licensing too. Melbourne-based picture researchers working with publishers on Australian history titles report routinely encountering duplicates when ordering high-resolution files from institutional collections. Licensing fees for archival images from major Victorian institutions typically range from $85 to $450 per image depending on usage rights — meaning an incorrectly filed duplicate can trigger an unnecessary purchase before the error is caught.
For researchers using collections remotely, particularly those working from regional Victoria or interstate, there is currently no standardised way to flag a suspected duplicate for review. The State Library Victoria's online feedback form routes to a general enquiries inbox rather than directly to collections staff.
What comes next is partly a technology question and partly a funding one. Deduplication software can identify near-identical files automatically, but human judgment is still required to determine which version of an image holds the better provenance record. The Victorian Government has not yet announced dedicated funding for the remediation work underway at the institutions involved. Collections managers at several organisations are understood to be absorbing the workload within existing operational budgets — which, at most publicly funded cultural institutions, have not grown in real terms since 2022.