Archive for the 'Digitisation' Category

New manuscripts uploaded to the Transcription Desk

By Tim Causer, on 7 February 2011

Four boxes worth of material – comprising of around 1,200 manuscript images – have been uploaded to the Transcription Desk and are now available for transcription.

They contain material written between 1776 and 1826, and cover a number of subjects including the codification of laws, jury trials, legal procedure, and the Panopticon. Bentham refers to works by William Paley and William Blackstone, comments on Publicans’ Laws, and shows us the progress of his thought regarding the Panopticon prison and his frustrations as it becomes clear that it will never be built.

We hope users will enjoy delving into these works, and look forward to hearing how you get on in the discussion forum.

Happy transcribing!

The Bentham Project’s ‘Legacy Transcripts’

By Tim Causer, on 12 November 2010

Vital to the Bentham Project’s aim of producing a new and authoritative edition of The Collected Works of Jeremy Bentham are transcriptions of manuscripts written by the man himself. As noted in an earlier post, the volume of material produced by Bentham is daunting – UCL’s Special Collections holds around 60,000 folios across 176 boxes, while the British Library holds another 12,500 folios. Thanks to the indefatigable Bentham Project staff past and present, around 20,000 folios have been transcribed during the last 25 years.

Transcription has taken various forms at the Bentham Project. At first, it was done by hand and the transcripts typed up on a mechanical typewriter, until the Project’s staff found themselves on the technological cutting-edge upon the arrival of an electronic typewriter. For younger scholars, it is hard to imagine how problematic and time-consuming transcription can be without our trusty laptops.

More recently, this work has been carried out using a variety of word processors, such as Corel WordPerfect, and now Microsoft Word. However, unlike those produced through Transcribe Bentham, Word-produced transcriptions tend not to replicate all of the numerous deletions, insertions and marginalia which litter Bentham’s manuscripts and make them such complex documents. This is with good reason: the Bentham Project’s intention is to produce a Collected Works which are as close to Bentham’s intended vision as possible and which read coherently, to which purpose such markings are usually not conducive.

An example of a transcription from which the Bentham Project editors work.

While they suit the purposes of the Bentham Project editors admirably, these Word files do have drawbacks: they are produced using proprietary software and are thus are not as flexible as XML files, and they do not represent a faithful rendering of the original manuscript. The Word transcripts do not, for instance, tend to reproduce some of the most important and basic TEI elements used by Transcribe Bentham, such as line and page breaks, paragraphing and underlining, as they are unnecessary thanks to Word’s what-you-see-is-what-you-get interface.

As we intend to create a digital repository of Bentham’s works there is, therefore, a real need to take these Word documents – or ‘Legacy Transcripts’, as we’ve become wont to describe them – and encode them just as we would through Transcribe Bentham. This clearly necessitates a great deal of work, which was begun by our colleague Justin Tonra, and continues in earnest. It is complicated by there being 55 boxes worth of transcriptions which were saved using now defunct formats in large files documents; these transcriptions need to be salvaged using Word and separated into individual folios.

Once we have the Word files the real work can begin, though we were faced with an immediate dilemma about how to proceed. We could check and revise each transcription against the original folio and mark them up using all the Transcribe Bentham tags, but by doing so we might only complete a handful of boxes over the next few months. Instead, it was decided that we would aim to get as much material converted into XML as possible, thereby getting a larger amount of material into such a state that it will be ready to be made accessible to the public. This will be done by lightly encoding each transcript using mostly only header and paragraph tags.

A lightly-encoded Legacy Transcript, showing header, paragraph and underlining tags.

We will create valid TEI-compliant XML documents, an example of which can be seen below. This clearly does not look as sophisticated as manuscripts encoded through Transcribe Bentham, but such light encoding provides a working XML file, an extremely useful and valuable starting point, and something which can be edited in more detail at a later date.

The same lightly-encoded Legacy Transcript, rendered in Firefox.

A manuscript encoded through Transcribe Bentham, rendered in Firefox.

The conversion of our Legacy Transcripts complements the great labours of Transcribe Bentham‘s contributors, and it might even be possible to ask volunteers to add more thorough coding at some point in the future. The work will also contribute to the ultimate aim of producing a full-text searchable digital catalogue of the Bentham Papers, open to all, which will be housed in UCL’s Digital Collections.

Update: a small Legacy Transcript experiment is now running at the Transcription Desk

There is plenty to do, so wish us luck!

Two new boxes available!!

By Valerie H Wallace, on 25 October 2010

Two new boxes have now been uploaded to the Transcription Desk. We now have a total of six boxes and 2073 manuscripts available to view. Our latest additions – boxes 73 and 96 – contain manuscripts written in the mid-1770s on such themes as legislation and law-making, crime and punishment and religion. Bentham discusses famous figures such as Voltaire, William Blackstone, David Hume and Francis Hutchison, and he muses on drunkenness, swearing and adultery, as well as suicide, infanticide and homicide. Visit the Transcription Desk and become the first to transcribe Bentham’s potentially thrilling writings on these hot topics. Make sure and share your findings with other users by contributing to the discussion forum. Happy transcribing!

Transcribe Bentham has launched!

By Valerie H Wallace, on 7 September 2010

The Transcribe Bentham crowdsourcing initiative officially launched today. Our Transcription Desk is now open to the public and we encourage everyone to have a go at transcribing Jeremy Bentham’s papers! We welcome all contributions and all thoughts on anything relating to Bentham and the project in general. You are warmly encouraged to explore the site, create a profile and post comments on our discussion board. Tell us about your favourite Bentham quote or invented word! Gain points for your contributions to move up our progress ladder and become a transcribing prodigy! Track the progress of transcription by viewing the Benthamometer.

This is the first major crowdsourcing transcription project, from which we will learn much about the nature of community engagement and social media. We will share these results with you in the near future. In the meantime we hope you enjoy the site!

Have fun!

Beta testing

By Valerie H Wallace, on 25 August 2010

We are pleased to announce that Transcribe Bentham has moved into a beta testing phase! Our Transcription Desk can be viewed and tested here. Please report any bugs by e-mailing us at transcribe.bentham [at] ucl.ac.uk. We welcome all comments! Our formal launch is scheduled for September 8 2010. Stay tuned!