The Case Studies
1: Ngā Ripo pilot project
The problem: The Alexander Heritage & Research Library at Wanganui has a huge historical index of people and subjects, covering the period from 1840 to 2002, Volunteers for many years have been indexing birth, marriage and death notices from local newspapers and other sources onto ‘index cards’. The Library wanted to make the information contained on the cards accessible online and made a plan to digitise the cards. But that left them with a pile of images that were not searchable. Worse, because a lot of the information on the cards was handwritten, rather than typed, OCR wouldn’t work for more than a percentage of them. Paying for professional transcription was beyond the budget. So what to do? The solution Create an online tool using the Recollect platform that harness the power of enthusiastic volunteers. Because this meant the crowd was ‘wild’ a short online tutorial was developed to train the volunteers to capture the desired content in the correct way. The tutorial had to be successfully completed before they were ‘let loose’ on real data. Cards were displayed one at a time for the volunteers to transcribe, with each card presented and transcribed at least twice by different people. The transcription entries were compared programmatically, and if they met a prescribed level of accuracy the card was ‘accepted’ and made live. Progress graphs and charts were included on the project dashboard. This allowed the Library team to track the overall project, and also showed the volunteers see how many cards they had done, how many were live, and where they sat on the leader board. The pilot and this project was a trial run on over 1000 cards. The project was a major success, with all index cards successfully transcribed by volunteers in a matter of weeks.2: Antarctica NZ’s ADAM slide collection
The problem Antarctica NZ had an enormous slide collection – around 25,000 images – that they wanted to digitise before depositing them with Archives New Zealand. Information about each slide was written by hand around the slide mount – including subject, photographer and dates. This provided two challenges: The challenge for digitisation was to capture both the entire mount – so the handwritten metadata could be transcribed AND capture the image at a high enough resolution. The other was to find a cost effective method to check and transcribe the digital images, as well as supply additional metadata to enhance the discoverability of the photographs. The solution. NZMS developed an innovative solution that allowed both slide image and slide mount to be captured at once. The Recollect team then set up the crowdsourcing module for ADAM’s ‘tame’ crowd to:- Transcribe the metadata from the mount into prescribed fields
- Apply new categories of metadata including subject tags and voting an image into the “Gallery of Awesomeness”
- Hand crop the photo portion from the slide mount to create the final digital image (rotating and flipping the final image where necessary).