Archive for the 'Transcription' Category

Transcription Update – 14 October to 10 November 2017

By Louise Seaward, on 14 November 2017

Hello and welcome to the latest statistics update from Transcribe Bentham.

We’re happy to report that our volunteers have been contributing transcripts steadily across the past four weeks.  We need to take this moment to say a huge thank you to everyone who has spent some time working on the site!   We are also planning some forthcoming improvements for our Transcription Desk, which will hopefully make it more user-friendly.  These developments are based on the results of our recent user survey and the ideas generated at the Bentham Hackathon we hosted in assocation with IBM.  More information coming soon!

Back to the statistics – these are the latest statistics as of 10 November 2017.

19,249 manuscript pages have now been transcribed or partially-transcribed. Of these transcripts, 18,478 (95%) have been checked and approved by TB staff.

Over the past four weeks, volunteers have worked on a total of 113 manuscript pages. This means that an average of 28 pages have been transcribed each week during the past month.

Check out the Benthamometer for more information on how much has been transcribed from each box of Bentham’s papers!

Project Update – the Bentham Hackathon, a weekend well spent

By Louise Seaward, on 24 October 2017

The Bentham Project is tired (but happy!) this week, as we spent the weekend taking part in our first Hackathon.  It was an inspiring few days and we came away hugely impressed by the useful and creative digital research tools that our hackers produced over the course of a weekend.

The Bentham Hackathon was held in partnership with the technology company IBM, along with the support of UCL Centre for Digital Humanities and UCL Innovation and Enterprise.  It was designed as a collaborative and open event where participants could work together to explore how digital tools can help us to research Bentham’s philosophy.

20171021_100021

The Hackathon took place over one evening and two full days between 20 and 22 October 2017 and brought together coders, developers, computer scientists, digital humanists, humanities researchers and some of the volunteer transcribers from Transcribe Bentham.

By the Saturday morning, the participants had formed 6 teams who were ready to #hackBentham.  They were working on the following challenges set out by the Bentham Project:

  1. How can we use keyword searching to explore Bentham’s writings?
  2. Can we use technology to decipher Bentham’s difficult handwriting?
  3. Can we build a user-friendly interface for navigating and transcribing documents?
  4. Can we build a more user-friendly version of the Transcribe Bentham crowdsourcing platform?

The attendees had a large amount of data to work with: thousands of images of Bentham’s manuscripts and transcripts of their content, metadata for the entirety of the Bentham papers held both at UCL and the British Library and various printed editions of Bentham’s writings and correspondence.

IBM provided access to their Bluemix platform where the hackers could experiment with the Object Store, Watson Knowledge Studio and Node-RED applications.  IBM also used this platform to pre-process some of the Bentham data so that the participants could get to work quickly.

The teams worked diligently all weekend, with the support of members of the Bentham Project and developers from IBM.  Coding and discussion went on until 8:30pm on the Saturday evening, fuelled by pizza, coffee and Coca Cola!

On Sunday afternoon it was time for the teams to submit and present their final outputs.  IBM generously provided prize money of £1000 for the event and it was up to a panel of judges from the Bentham Project, IBM and UCL Innovation and Enterprise to award the spoils!

038

 

 

 

 

 

 

 

 

First up was the ‘Bencharms’ team, who used IBM Cloudant to produce a more attractive version of the Transcribe Bentham Transcription Desk, with enhanced functionalities like allowing users to see more easily whether a page has already been transcribed.  They also had the idea of a mobile app where users could contribute to Transcribe Bentham by transcribing single words.

20171022_143909

Presentation from ‘Bencharms’ team.

Team ‘XScribe’ put together a searching interface for the Bentham papers, where users would be able to look for keywords but also see whether certain manuscripts have already been transcribed.  They also worked on image extraction and segmentation to make it easier for transcribers to match the line of their text transcription to the corresponding line in the image.  Again, these ideas have the potential to speed up the transcription process significantly.

20171022_145424(0)

Presentation from ‘XScribe’ team.

Two teams ‘Bentham Budds’ and ‘Benthamligraphy’ chose to work on a language model that could predict the words that Bentham would be most likely to use.  They used Tensorflow and IBM’s Node-RED software for machine learning to train a model using a sample of transcripts of Bentham material.  Such a model could increase productivity of Transcribe Bentham volunteers and Bentham Project researchers as Bentham’s handwriting is often so difficult to read.

20171022_153734(0)

Presentation from ‘Benthamligraphy’ team.

‘QSP’ was a team which included two volunteer transcribers from Transcribe Bentham and they decided to work on a sandpit area to help orientate new users of the platform.  Their ‘Box 999’ area included helpful videos and links for new transcribers and also allowed users to practice transcribing pages and get immediate feedback on any errors.  This was a fitting suggestion as we find it difficult to attract new volunteers to Transcribe Bentham, possibly because people can be daunted by the prospect of transcribing a complete page on their own.

20171022_152647(0)

Presentation from QSP team.

But the winning team was ‘Bentham’s Head’!  Their fantastic site called Locate Bentham not only has the potential to facilitate existing research questions but could also generate new areas of enquiry.  The team created an interface where users can perform keyword searches on Bentham transcripts, view a Google map of the places mentioned in Bentham’s correspondence, trace the development of Bentham’s ideas over time, examine Bentham’s social network based on his list of correspondents and even analyse Bentham’s personality using IBM Watson Personality Insights.  This was an amazing breadth of resources, embedded in a functional and attractive interface.  Well done team!

20171022_154626

Presentation from ‘Bentham’s Head’ team.

The Bentham Project had little idea what could happen at a Hackathon but we were struck by the concentration and creativity of all the teams.  A big thank you to everyone who took part and to our partners at IBM, UCL Centre for Digital Humanities and UCL Innovation and Enterprise.

We want to continue to develop some of the ideas and connections made at the Hackathon; to improve both Transcribe Bentham and the digital research tools at the Bentham Project’s disposal.  IBM have kindly allowed participants continued access to the Bluemix platform in the short-term and we are planning to get involved in the upcoming Learn Hack at UCL on 24-26 November.  Watch this space for more info!

Transcription Update – 16 September to 13 October 2017

By Louise Seaward, on 13 October 2017

Hello!  We’re here with some amazing news for this month’s statistics update.  Volunteers have now transcribed more than 19,000 pages on our site – a phenomenal effort for which we are hugely grateful.  We look forward to the next big milestone – 20,000 pages, the transcribers are coming for you!

We’re attending a conference about crowdsourcing at the University of Angers next week. We’ll be speaking about the results of our latest user survey and suggesting how we hope to use this feedback to make Transcribe Bentham more enjoyable and efficient for users.  Look out for a report in our next blog post!

Back to the statistics – these are the latest statistics as of 13 October 2017.

19,136 manuscript pages have now been transcribed or partially-transcribed.  Of these transcripts, 18,327 (95%) have been checked and approved by TB staff.

Over the past four weeks, volunteers have worked on a total of 180 manuscript pages.  This means that an average of 45 pages have been transcribed each week during the past month.

Check out the Benthamometer for more information on how much has been transcribed from each box of Bentham’s papers!

Transcription Update – 19 August to 15 September 2017

By Louise Seaward, on 25 September 2017

Hi everyone! We’re here with a quick update on the latest statistics to showcase the hard work that our transcribers have put in over the past month.  We continue to be amazed by the efforts of our volunteers and we owe them an enormous thanks!

These are the latest statistics as of 15 September 2017.

18,956 manuscript pages have now been transcribed or partially-transcribed.  Of these transcripts, 18,027 (95%) have been checked and approved by TB staff.

Over the past four weeks, volunteers have worked on a total of 181 manuscript pages.  This means that an average of 45 pages have been transcribed each week during the past month.

Check out the Benthamometer for more information on how much has been transcribed from each box of Bentham’s papers!

Project update – join us at the Bentham Hackathon with IBM

By Louise Seaward, on 23 August 2017

We’re here with news of an exciting event which will take place in October 2017.  UCL have teamed up with the technology company IBM to organise a ‘Bentham Hackathon‘, where participants can work together to explore how digital tools can help us to research Bentham’s philosophy.

For anyone unfamilar with the term, a hackathon is portmanteau of the words ‘hack’ and ‘marathon’.  It originally referred to an intensive meeting where groups of computer developers collaborated on software projects.  The meaning of a hackathon has now expanded and is often applied to cultural or educational events with a technical element, which are designed to generate new ideas and collaborations.  For more on hackathons, have a look at Wikipedia or the useful ‘How to Guide for hackathons in the cultural sector’ produced by the Europeana Space project.

The Bentham Hackathon will take place over the weekend of 20-22 October 2017 at UCL BaseKX.  The Bentham Project, in association with UCL Centre for Digital Humanities and UCL Innovation and Enterprise, will be working with IBM to explore the following question:

How can digital technologies help us to research Bentham’s philosophy?

 

The Bentham Hackathon is an intriguing opportunity for participants to play around with thousands upon thousands of images, transcripts and texts of Bentham’s writings, many of which have been produced in the course of the Transcribe Bentham crowdsourcing initiative.  Let’s see how these amazing resources can be explored and analysed with IBM’s cutting-edge technologies!

We have set four suggested challenges for participants in the Hackathon to work on – although other ideas may emerge in the course of the event.

  1. How can we use keyword searching to explore Bentham’s writings?
  2. Can we use technology to decipher Bentham’s difficult handwriting?
  3. Can we build a user-friendly interface for navigating and transcribing documents?
  4. Can we build a more user-friendly version of the Transcribe Bentham crowdsourcing platform?

Anyone interested in these questions is very welcome to join us at the Bentham Hackathon.  The Hackathon is a free event and there are no pre-requisites for participation.

For technical types, this is a great chance to work with IBM and learn new skills.  Those interested in history, philosophy and Bentham can also give their input to help ensure that digital tools work to enhance learning and research in the humanities. Any Transcribe Bentham volunteers who are close to London might also find the event interesting – your knowledge of Bentham and the process of transcription would be invaluable!

The Hackathon will last for the weekend, starting with an evening presentation on Friday 20 October.  Catering will be provided and participants can get involved in the whole weekend, or just pop in for a while.

The Bentham Hackathon will help us to showcase Bentham’s enormous contribution to philosophical thought, including the way in which his ideas on education inspired the founders of UCL.  And we are hopeful that the innovations developed over the course of this weekend will suggest some new ways to use digital technologies in humanities research.

For more information, check out the Bentham Hackathon webpage or contact us.