X Close

Open@UCL Blog

Home

Menu

Open Source Software Design for Academia

By Kirsty, on 27 August 2024

Guest post by Julie Fabre, PhD candidate in Systems Neuroscience at UCL. 

As a neuroscientist who has designed several open source software projects, I’ve experienced firsthand both the power and pitfalls of the process. Many researchers, myself included, have learned to code on the job, and there’s often a significant gap between writing functional code and designing robust software systems. This gap becomes especially apparent when developing tools for the scientific community, where reliability, usability, and maintainability are crucial.

My journey in open source software development has led to the creation of several tools that have gained traction in the neuroscience community. One such project is bombcell: a software designed to assess the quality of recorded neural units. This tool replaces what was once a laborious manual process and is now used in over 30 labs worldwide. Additionally, I’ve developed other smaller toolboxes for neuroscience:

These efforts were recognized last year when I received an honourable mention in the UCL Open Science and Scholarship Awards.

In this post, I’ll share insights gained from these experiences. I’ll cover, with some simplified examples from my toolboxes:

  1. Core design principles
  2. Open source best practices for academia

Disclaimer: I am not claiming to be an expert. Don’t view this as a definitive guide, but rather as a conversation starter.


Follow Julie’s lead: Whether you’re directly involved in open source software development or any other aspect of open science and scholarship, or if you simply know someone who has made important contributions, consider applying yourself or nominating a colleague for this year’s UCL Open Science and Scholarship Awards to gain recognition for outstanding work!


Part 1: Core Design Principles

As researchers, we often focus on getting our code to work, but good software design goes beyond just functionality. In order to maintain and build upon your software, following a few principles from the get go will elevate software from “it works” to “it’s a joy to use, maintain and contribute to”.

1. Complexity is the enemy

A primary goal of good software design is to reduce complexity. One effective way to simplify complex functions with many parameters is to use configuration objects. This approach not only reduces parameter clutter but also makes functions more flexible and maintainable. Additionally, breaking down large functions into smaller, more manageable pieces can significantly reduce overall complexity.

Example: Simplifying a data analysis function

For instance, in bombcell we run many different quality metrics, and each quality metric is associated with several other parameters. In the main function, instead of inputting all the different parameters independently:

[qMetric, unitType] = runAllQualityMetrics(plotDetails, plotGlobal, verbose, reExtractRaw, saveAsTSV, removeDuplicateSpikes, duplicateSpikeWindow_s, detrendWaveform, nRawSpikesToExtract, spikeWidth, computeSpatialDecay, probeType, waveformBaselineNoiseWindow, tauR_values, tauC, computeTimeChunks, deltaTimeChunks, presenceRatioBinSize, driftBinSize, ephys_sample_rate, nChannelsIsoDist, normalizeSpDecay, (... many many more parameters ...), rawData, savePath);

they are all stored in a ‘param’ object that is passed onto the function:

[qMetric, unitType] = runAllQualityMetrics(param, rawData, savePath);

This approach reduces parameter clutter and makes the function more flexible and maintainable.

 2. Design for change

Research software often needs to adapt to new hypotheses or methodologies. When writing a function, ask yourself “what additional functionalities might I need in the future?” and design your code accordingly. Implementing modular designs allows for easy modification and extension as research requirements evolve. Consider using dependency injection to make components more flexible and testable. This approach separates the creation of objects from their usage, making it easier to swap out implementations or add new features without affecting existing code.

Example: Modular design for a data processing pipeline

Instead of a monolithic script:

function runAllQualityMetrics(param, rawData, savePath)
% Hundreds of lines of code doing many different things
(...)
end

Create a modular pipeline that separates each quality metric into a different function:

function qMetric = runAllQualityMetrics(param, rawData, savePath)
nUnits = length(rawData);
for iUnit = 1:nUnits
% step 1: calculate percentage spikes missing
qMetric.percSpikesMissing(iUnit) = bc.qm.percSpikesMissing(param, rawData);
% step 2: calculate fraction refractory period violations
qMetric.fractionRPviolations(iUnit) = bc.qm.fractionRPviolations(param, rawData);
% step 3: calculate presence ratio
qMetric.presenceRatio(iUnit) = bc.qm.presenceRatio(param, rawData);
(...)
% step n: calculate distance metrics
qMetric.distanceMetric(iUnit) = bc.qm.getDistanceMetric(param, rawData);
end
bc.qm.saveQMetrics(qMetric, savePath)
end

This structure allows for easy modification of individual steps or addition of new steps without affecting the entire pipeline.

In addition, this structure allows us to define new parameters easily that can then modify the behavior of the subfunctions. For instance we can add different methods (such as adding the ‘gaussian’ option below) without changing how any of the functions are called!

param.percSpikesMissingMethod = 'gaussian';
qMetric.percSpikesMissing(iUnit) = bc.qm.percSpikesMissing(param, rawData);

and then, inside the function:

function percSpikesMissing = percSpikesMissing(param, rawData);
if param.percSpikesMissingMethod == 'gaussian'
(...)
else
(...)
end
end

3. Hide complexity

Expose only what’s necessary to use a module or function, hiding the complex implementation details. Use abstraction layers to separate interface from implementation, providing clear and concise public APIs while keeping complex logic private. This approach not only makes your software easier to use but also allows you to refactor and optimize internal implementations without affecting users of your code.

Example: Complex algorithm with a simple interface

For instance, in bombcell there are many parameters. When we run the main script that calls all quality metrics, we also want to ensure all parameters are present and are in a correct format.

function qMetric = runAllQualityMetrics(param, rawData, savePath)
% Complex input validation that is hidden to the user
param_complete = bc.qm.checkParameterFields(param);

% Core function that calcvulates all quality metrics
nUnits = length(rawData);

for iUnit = 1:nUnits
% steps 1 to n
(...)
end

end

Users of this function don’t need to know about the input validation or other complex calculations. They just need to provide input and options.

4. Write clear code

Clear code reduces the need for extensive documentation and makes your software more accessible to collaborators. Use descriptive and consistent variable names throughout your codebase. When dealing with specific quantities, consider adding units to variable names (e.g., ‘time_ms’ for milliseconds) to improve clarity. You can add comments to explain non-obvious logic and to add general outlines of the steps in your code. Following consistent coding style and formatting guidelines across your project also contributes to overall clarity.

Example: Improving clarity in a data processing function

Instead of an entirely mysterious function

function [ns, sr] = ns(st, t)
ns = numel(st);
sr = ns/t;

Add more descriptive variable and function names and add function headers:

function [nSpikes, spikeRate] = numberSpikes(theseSpikeTimes, totalTime_s)
% Count the number of spikes for the current unit
% ------
% Inputs
% ------
% theseSpikeTimes: [nSpikesforThisUnit × 1 double vector] of time in seconds of each of the unit's spikes.
% totalTime_s: [double] of the total recording time, in seconds.
% ------
% Outputs
% ------
% nSpikes: [double] number of spikes for current unit.
% spikeRate_s : [double] spiking rare for current unit, in seconds.
% ------
nSpikes = numel(theseSpikeTimes);
spikeRate_s = nSpikes/totalTime_s;
end

5. Design for testing

Incorporate testing into your design process from the beginning. This not only catches bugs early but also encourages modular, well-defined components.

Example: Testable design for a data analysis function

For the simple ‘numberSpikes’ function we define above, we can have a few tests to cover various scenarios and edge cases to ensure the function works correctly. For instance, we can test a normal case with a few spikes and an empty spike times input.

function testNormalCase(testCase)
theseSpikeTimes = [0.1, 0.2, 0.3, 0.4, 0.5]; totalTime_s = 1;
[nSpikes, spikeRate] = numberSpikes(theseSpikeTimes, totalTime_s);
verifyEqual(testCase, nSpikes, 5, 'Number of spikes should be 5');
verifyEqual(testCase, spikeRate, 5, 'Spike rate should be 5 Hz');
end

function testEmptySpikeTimes(testCase)
theseSpikeTimes = [];
totalTime_s = 1;
[nSpikes, spikeRate] = numberSpikes(theseSpikeTimes, totalTime_s);
verifyEqual(testCase, nSpikes, 0, 'Number of spikes should be 0 for empty input');
verifyEqual(testCase, spikeRate, 0, 'Spike rate should be 0 for empty input');
end

This design allows for easy unit testing of individual components of the analysis pipeline.

Part 2: Open Source Best Practices for Academia

While using version control and having a README, documentation, license, and contribution guidelines are essential, I have found that these practices have the most impact:

Example Scripts and Toy Data

I have found that the most useful thing you can provide with your software are example scripts, and even better, provide toy data that loads in your example script. Users can then quickly test your software and see how to use it on their own data — and are then more likely to adopt it. If possible, package the example scripts in Jupyter notebooks/MATLAB live scripts (or equivalent) demonstrating key use cases. In bombcell, we provide a small dataset (Bombcell Toy Data on GitHub) and a MATLAB live script that runs bombcell on this small toy dataset (Getting Started with Bombcell on GitHub). 

Issue-Driven Improvement

To manage user feedback effectively, enforce the use of an issue tracker (like GitHub Issues) for all communications. This approach ensures that other users can benefit from conversations and reduces repetitive work. When addressing questions or bugs, consider if there are ways to improve documentation or add safeguards to prevent similar issues in the future. This iterative process leads to more robust and intuitive software.

Citing

Make your software citable quickly. Before (or instead) of publishing, you can generate a citable DOI using software like Zenodo. Consider also publishing in the Journal of Open Source Software (JOSS) for light peer review. Clearly outline how users should cite your software in their publications to ensure proper recognition of your work.

Conclusion

These practices can help create popular, user-friendly, and robust academic software. Remember that good software design is an iterative process, and continuously seeking feedback and improving your codebase (and sometimes entirely rewriting/refactoring parts) will lead to more robust code.

To go deeper into principles of software design, I highly recommend reading “A Philosophy of Software Design” by John Ousterhout or “The Good Research Code Handbook” by Patrick J. Mineault.

Get involved! 

alt=""The UCL Office for Open Science and Scholarship invites you to contribute to the open science and scholarship movement. Join our mailing list, and follow us on X, formerly Twitter and LinkedIn, to stay connected for updates, events, and opportunities.

 

 

 

Copyright and AI, Part 2: Perceived Challenges, Suggested Approaches and the Role of Copyright literacy

By Rafael, on 15 August 2024

Guest post by Christine Daoutis (UCL), Alex Fenlon (University of Birmingham) and Erica Levi (Coventry University).

This blog post is part of a collaborative series between the UCL Office for Open Science and Scholarship and the UCL Copyright team exploring important aspects of copyright and its implications for open research and scholarship. 

A grey square from which many colourful, wavy ribbons with segments in shades of white, blue, light green, orange and black radiate outward against a grey background.

An artist’s illustration of AI by Tim West. Photo by Google DeepMind from Pexels.

A previous post outlined copyright-related questions when creating GenAI materials—questions related to ownership, protection/originality, and infringement when using GenAI. The post discussed how answers to these questions are not straightforward, largely depend on what is at stake and for whom, and are constantly shaped by court cases as they develop.

What does this uncertainty mean for students, academics, and researchers who use GenAI and, crucially, for those in roles that support them? To what extent does GenAI create new challenges, and to what extent are these uncertainties inherent in working with copyright? How can we draw on existing expertise to support and educate on using GenAI, and what new skills do we need to develop?

In this post, we summarise a discussion we led as part of our workshop for library and research support professionals at the Research Libraries UK (RLUK) annual conference in March 2024. This year’s conference title was New Frontiers: The Expanding Scope of the Modern Research Library. Unsurprisingly, when considering the expanding scope of libraries in supporting research, GenAI is one of the first things that comes to mind.

Our 16 workshop participants came from various roles, research institutions, and backgrounds. What they had in common was an appetite to understand and support copyright in the new context of AI, and a collective body of expertise that, as we will see, is very useful when tackling copyright questions in a novel context. The workshop consisted of presentations and small group discussions built around the key themes outlined below.

Perceived Challenges and Opportunities
Does the research library community overall welcome GenAI? It is undoubtedly viewed as a way to make scholarship easier and faster, offering practical solutions—for example, supporting literature reviews or facilitating draft writing by non-English speakers. Beyond that, several participants see an opportunity to experiment, perhaps becoming less risk-averse, and welcome new tools that can make research more efficient in new and unpredictable ways.

However, concerns outweigh the perceived benefits. It was repeatedly mentioned that there is a need for more transparent, reliable, sustainable, and equitable tools before adopting them in research. Crucially, users need to ask themselves what exactly they are doing when using GenAI, their intention, what sources are being used, and how reliable the outputs are.

GenAI’s concerns over copyright were seen as an opportunity to place copyright literacy at the forefront. The need for new guidance is evident, particularly around the use of different tools with varying terms and conditions, and it is also perceived as an opportunity to revive and communicate existing copyright principles in a new light.

Suggested Solutions
One of the main aims of the workshop was to address challenges imposed by GenAI. Participants were very active in putting forward ideas but expressed concerns and frustration. For example, they questioned the feasibility of shaping policy and processes when the tools themselves constantly evolve, when there is very little transparency around the sources used, and when it is challenging to reach agreement even on essential concepts. Debates on whether ‘copying’ is taking place, whether an output is a derivative of a copyrighted work, and even whether an output is protected are bound to limit the guidance we develop.

Drawing from Existing Skills and Expertise
At the same time, it was acknowledged that copyright practitioners already have expertise, guidance, and educational resources relevant to questions about GenAI and copyright. While new guidance and training are necessary, the community can draw from a wealth of resources to tackle questions that arise while using GenAI. Information literacy principles should still apply to GenAI. Perhaps the copyright knowledge and support are already available; what is missing is a thorough understanding of the new technologies, their strengths, and limitations to apply existing knowledge to new scenarios. This is where the need for collaboration arises.

Working Together
To ensure that GenAI is used ethically and creatively, the community needs to work collaboratively—with providers, creators, and users of those tools. By sharing everyday practices, decisions, guidance, and processes will be informed and shaped. It is also important to acknowledge that the onus is not just on the copyright practitioners to understand the tools but also on the developers to make them transparent and reliable. Once the models become more transparent, it should be possible to support researchers better. This is even more crucial in supporting text and data mining (TDM) practices—critical in many research areas—to limit further restrictions following the implementation of AI models.

Magic Changes
With so much excitement around AI, we felt we should ask the group to identify the one magic change that would help remove most of the concerns. Interestingly, the consensus was that clarity around the sources and processes used by GenAI models is essential. How do the models come up with their answers and outputs? Is it possible to have clearer information about the sources’ provenance and the way the models are trained, and can this inform how authorship is established? And what criteria should be put in place to ensure the models are controlled and reliable?

This brings the matter back to the need for GenAI models to be regulated—a challenging but necessary magic change that would help us develop our processes and guidance with much more confidence.

Concluding Remarks
While the community of practitioners waits for decisions and regulations that will frame their approach, it is within their power to continue to support copyright literacy, referring to new and exciting GenAI cases. Not only do those add interest, but they also highlight an old truth about copyright, namely, that copyright-related decisions always come with a degree of uncertainty, risk, and awareness of conflicting interests.

About the authors 

Christine Daoutis is the UCL Copyright Support Officer at UCL. Christine provides support, advice and training on copyright as it applies to learning, teaching and research activities, with a focus on open science practices. Resources created by Christine include the UCL Copyright Essentials tutorial and the UCL Copyright and Your Teaching online tutorial.

Alex Fenlon is the Head of Copyright and Licensing within Libraries and Learning Resources at the University of Birmingham. Alex and his team provide advice and guidance on copyright matters, including text, data mining, and AI, to ensure that all law and practice are understood by all.

Erica Levi is the Digital repository and Copyright Lead at Coventry University. Erica has created various resources to increase awareness of copyright law and open access through gamification. Her resources are available on her website.

Get involved!

alt=""The UCL Office for Open Science and Scholarship invites you to contribute to the open science and scholarship movement. Join our mailing list, and follow us on X, formerly Twitter and LinkedIn, to be part of the conversation and stay connected for updates, events, and opportunities.

 

 

 

From Policy to Practice: UCL Open Science Conference 2024

By Kirsty, on 11 July 2024

Last month, we hosted our 4th UCL Open Science Conference! This year, we focused inward to showcase the innovative and collaborative work of our UCL researchers in our first UCL community-centered conference. We were excited to present a strong lineup of speakers, projects, and posters dedicated to advancing open science and scholarship. The conference was a great success, with nearly 80 registrants and an engaged online audience.

If you missed any sessions or want to revisit the presentations, you can find highlights, recordings, and posters from the event below.

Session 1 – Celebrating Our Open Researchers

The conference began with a celebration of the inaugural winners of the Open Science & Scholarship Awards, recognizing researchers who have significantly contributed to open science. This session also opened nominations for next year’s awards.

Access the full recording of the session 1 on MediaCentral.

Session 2: Policies and Practice

Katherine Welch introduced an innovative approach to policy development through collaborative mosaic-making. Ilan Kelman discussed the ethical limits of open science. He reminded us of the challenges and considerations when opening up research and data to the public. David Perez Suarez introduced the concept of an Open Source Programme Office (OSPO) at UCL and, with Sam Ahern, showcased the Centre of Advanced Research Computing’s unique approach to creating and sharing open educational resources.

Access the full recording of the session 2 on MediaCentral.

Session 3: Enabling Open Science and Scholarship at UCL

This session introduced new and updated services and systems at UCL designed to support open science and scholarship. Highlights included UCL Profiles, Open Science Case Studies, the UCL Press Open Textbooks Project, UCL Citizen Science Academy, and the Open@UCL Newsletter.

Access the full recording of the session 3 on MediaCentral.

Session 4: Research Projects and Collaborations

This session featured presentations on cutting-edge research projects and collaborations transforming scholarly communication and advancing scientific integrity. Klaus Abels discussed the journey of flipping a subscription journal to diamond open-access. Banaz Jalil and Michael Heinrich presented the ConPhyMP guidelines for chemical analysis of medicinal plant extracts, improving healthcare research. Francisco Duran explored social and cultural barriers to data sharing and the role of identity and epistemic virtues in creating transparent and equitable research environments.

Access the full recording of the session 4 on Media Central.

Posters and Networking:

We also hosted a Poster Session and Networking event where attendees explored a variety of posters showcasing ongoing research across UCL’s disciplines, accompanied by drinks and snacks. This interactive session provided a platform for researchers to present their work, exchange ideas, and foster collaborations within and beyond the UCL community.

Participants engaged directly with presenters, learning about research findings and discussing potential synergies for future projects. Themes covered by the posters included innovative approaches to public engagement by UCL’s Institute of Global Prosperity and Citizen Science Academy, as well as discussions on the balance between open access and data security in the digital age.

Explore all the posters presented at the UCL Open Science Conference 2024 on the UCL Research Data Repository. This collection is under construction and will continue to grow.

Reminder for Attendees – Feedback

For those who attended, please take a minute to complete our feedback form. Your input is very important to improve future conferences. We would appreciate your thoughts and suggestions.

A Huge Thank You!

Thank you to everyone who joined us for the UCL Open Science Conference 2024. Your participation and enthusiasm made this event a great success. We appreciate your commitment to advancing open science and scholarship across UCL and beyond, and we look forward to seeing the impact of your work in the years to come.

Please watch the sessions and share your feedback with us. Your insights are invaluable in shaping future events and supporting the open science community.

We look forward to seeing you at next year’s conference!

Spotlight on Ben Watson: Champion of Digital Accessibility at UCL 

By Rafael, on 8 July 2024

This is the first instalment of our profile series, and we shine a light on Ben Watson, UCL’s Head of Digital Accessibility. Ben’s journey from teaching to digital accessibility shows his unwavering dedication to inclusivity. He works hard with the Digital Accessibility Team and together, they put in place and advocate for accessible digital practices. Below are the highlights of his conversation with the UCL Office for Open Science and Scholarship. Here, he shares his story and vision to make UCL more accessible.

Black and white photo of Ben Watson. He has  short hair and wears a dark-colored shirt. He is looking to the side with a neutral expression. The background appears to be outdoors with some wooden structures.

Ben Watson

Current Role

Ben Watson describes his role as putting in ‘ramps and lifts’ to information. He underscores the need of making digital resources not only available but accessible to all. My role is about ensuring that all UCL systems and content, and the way we design, deliver, and share information, addresses potential barriers. If digital resources aren’t accessible to everyone, their full potential remains untapped’. 

Journey to Digital Accessibility

Ben’s path began with a background in teaching and librarianship. Early in his career, he worked at a school for blind and partially sighted students, where he discovered the transformative power of digital information. This experience ignited his passion for advocating better e-resources and focusing on accessible learning design. I’ve always been fascinated by how people consume information. As a teacher, I saw how digital information should be the answer to many accessibility issues’.

Teaching Experience and Advocacy 

His teaching background provided him with insights into diverse learning styles and the adverse impact of inaccessible materials. Now at UCL, the Digital Accessibility team assists research and teaching staff in meeting accessibility standards, promoting the idea that highly accessible digital experiences can only benefit students, staff, and institutions. One of the really lovely things about digital accessibility and promoting it is that genuinely it’s one of those things that benefits everyone’.

Addressing Challenges and Organisational Change 

Ben acknowledges the complexities of ensuring digital accessibility at a large institution like UCL. He emphasises the need to influence content creators to prioritise accessibility and integrate inclusive design from the outset. ‘To make UCL more digitally accessible, we must influence the entire information cycle. We need to support content creators to excel at what they are experts in while giving them confidence to deliver this accessibly. UCL can lead by example in many areas, including accessibility’. 

Progress, Not Perfection 

Ben champions the idea of progress over perfection in the evolving field of digital accessibility. ‘I always talk about progress, not perfection. In such a dynamic landscape, expecting 100% perfection is unrealistic, but we can certainly achieve significant, long-lasting positive progress’ 

Simple Steps for Digital Accessibility This is the accessibility icon. It is black stick figure inside a black circle. It's on a white background.

Making digital content accessible can be straightforward with built-in tools in software applications such as Microsoft’s Accessibility Checker.Ben also highlights the importance of structured headings, alternative text descriptions, and other features to make documents navigable for all users. Simple steps like using headings, alternative text and accessibility checkers can greatly enhance accessibility. UCL Accessibility pages offer extensive guidance on this to support UCL staff’

Accessibility and Open Science: Benefits for All

We asked Ben how accessibility principles can advance Open Science and Scholarship. He advocates for making research accessible to everyone. This involves adding accessibility standards to open science publishing. ‘Publishers need to meet these standards for Open Access to reach its full potential. This helps everyone by boosting the reach and impact of research.’ Challenges exist, but the benefits of accessibility are broad. Accessibility improves research impact, openness, and discoverability. ‘At the end of the day, this isn’t just about legal mandates for public sector bodies, although that’s a compelling reason. It’s about fulfilling ethical obligations to ensure that everyone can fully engage. Why should we prevent anyone from being inspired and affected by our work? With some effort, but not a huge amount, we could make our work future-proof and accessible to everyone’

European Accessibility Regulations 

Ben is enthusiastic about the European Accessibility Act, which extends obligations to commercial suppliers beyond existing legislation. He views this as a significant advancement for the education sector, making it easier to ensure accessibility compliance from the design stage. ‘The European Accessibility Act is a powerful addition to existing legislation. It mandates accessibility by default, simplifying our work to ensure compliance’

Proactive Accessibility and Legacy Content  

It’s much more challenging to retrofit accessibility into already published content, that’s why proactive measures are key. ‘It’s much more difficult to make something accessible after it’s been published. UCL collaborates with partners like the Royal National Institute for Blind People to review legacy content. We aim to integrate accessibility from the start and have clear processes for addressing older content’

Envisioning the Future 

Ben hopes for a future where digital accessibility is an integral part of everyday life and education. He highlights UCL’s research initiatives, like the Global Disability Innovation Hub, and the work of the Institute of Education at UCL (with telepresence robot projects like the one led by Jennifer Rode)  as examples of a commitment to global outreach and inclusive design. ‘I’d love the field of digital accessibility to become everyone’s business as usual. It should be included in standard training, teacher qualifications, and even the school curriculum because it’s really that important’

Accessibility is Not Just About Compliance

We should advocate for a holistic approach to accessibility, considering diverse needs, including those of neurodiverse individuals. Ben stresses the importance of inclusive design in education, promoting flexible assessment methods and delivery modes. ‘Accessibility isn’t just about compliance; it’s about creating inclusive experiences that anticipate diverse needs. By embedding accessibility into every aspect of education, we enable everyone to reach their full potential’

Librarians are Awesome 

Ben’s transition from librarianship to digital accessibility was one of the focal points of his career. Working as a librarian provided him with a unique perspective on accessibility and understanding on how people engage with information. ‘I think about information like a librarian—considering how people find and use it. This perspective is essential for ensuring digital content is accessible’. As a librarian, his involvement in e-book accessibility campaigns solidified his dedication to making digital resources accessible to all. ‘Librarians play a crucial, often underappreciated role in advocating for accessible information. It’s no wonder Michael Moore calls them the most important public servants in a democracy

Accessibility Can Make a Difference 

He explains that his commitment to accessibility comes from a deep belief in equitable information access. He is driven by a desire to remove barriers to inclusion and citizenship. ‘I’ve seen firsthand the impact of inaccessible information—it can hinder potential and make people feel excluded. My motivation is to remove barriers and ensure everyone feels included and respected. Information is vital for education and broader citizenship. Accessible design, is good design, and ensures no one is excluded and that information is adaptable for new technologies like AI’.

Inspirations  

Ben draws inspiration from key figures in disability studies, such as British sociologist and activist Mike Oliver, a founding theorist of the social model of disability. ‘Meeting Professor Mike Oliver at the University of Kent profoundly influenced my work. His theories on disability have been foundational.’ He also acknowledges the contributions of colleagues like Kirsty Wallis in raising accessibility awareness in open science. ‘Kirsty’s work in promoting accessibility in open science has been outstanding.’

Passions  

Outside of his professional life, Ben takes greater pride in his daughters’ kindness: ‘I’m incredibly proud of my two daughters. They are kind, caring, and thoughtful young people who genuinely put the feelings of others before their own’. He also has a passion that takes him on quite a ride: ‘I love motorbikes. Riding them, fixing them, and generally thinking about them!’

UCL’s Vision: We Should Lead by Example  

Ben advocates for UCL to be a leader in digital accessibility and inclusive design. ‘UCL can lead by example, ensuring accessibility is integral to everything we do. Accessibility is not extra work; it’s about completing what was left unfinished. Embracing this ethos across UCL would ensure no work is deemed complete until it’s accessible to all’. 

The Digital Accessibility Team stands beside a banner. The banner reads "Digital Accessibility" and has more information. They are in an office environment, with visible smiles.

UCL Digital Accessibility Team.

Ben Watson’s work at UCL shows he is strongly committed to making information and education open and inclusive. He invites everyone to embrace inclusive design at UCL with the Digital Accessibility Team. Through a collaborative effort, we can pave the way for a better future.  

Follow Ben’s lead. Integrate accessibility into your work by following the steps provided on the UCL Accessibility Pages. You could also attend a Digital Skills training course, or join the Accessibility Champions Network. 

 For more information or help, check out the Digital Accessibility Services, or contact the Digital Accessibility Team. 

Get involved!

alt=""The UCL Office for Open Science and Scholarship invites you to contribute to the open science and scholarship movement. Stay connected for updates, events, and opportunities. Follow us on X, formerly Twitter, LinkedIn, and join our mailing list to be part of the conversation!

 

 

UCL Discovery reaches 50 million downloads!

By Rafael, on 27 June 2024

Guest Post by Dominic Allington-Smith (Open Access Publications Manager)

Decorative image displaying fireworks filling the night sky with bursts of red, orange, and blue lights. Sparkling circles of light create a festive and celebratory atmosphere.

Photo by Erwan Hesry on Unsplash

UCL Publications Board and the Open Access Team are delighted to announce that on Monday 24 June, UCL’s institutional repository, UCL Discovery, reached the milestone of 50 million downloads! UCL Discovery is UCL’s open access repository, showcasing and providing access to UCL research outputs from all UCL disciplines. UCL authors currently deposit around 1,675 outputs in the repository every month (average figure for the current academic year).

The 50 millionth download was of the paper ‘Replenishing IRAK-M expression in retinal pigment epithelium attenuates outer retinal degeneration’ originally published in Science Translational Medicine by a team of researchers including UCL co-lead author Professor Andrew Dick.  This paper found that increasing the levels of a key protein in the cells at the back of the eye could help protect against the age-related macular degeneration, the leading cause of vision loss among older adults.

UCL Discovery hosts over 178,500 open access publications at the time of writing, comprising mostly self-archived copies of research outputs published elsewhere to bypass publisher paywalls, but also including doctoral and research master’s theses (contemporary submissions and historic digitisations), and books published by UCL Press.  This variety of resources is displayed when viewing the highest-downloaded publication within the UCL hierarchy:

This amazing milestone shows the scope and reach that sharing research through UCL Discovery has. There are a number of ways you can share your research at UCL, and we encourage you to continue sharing your research publications via UCL RPS and Profiles. Additionally, consider sharing other types of outputs such as data, code and software to further enhance the visibility and reproducibility of your work. The Research Data Management team maintain a guide on best practice for software sustainability, preservation and sharing, and can give further support to UCL researchers as required.

Congratulations to everyone involved in reaching this incredible milestone, and let’s continue to push the boundaries of open access and research sharing at UCL!

Get involved!

alt=""The UCL Office for Open Science and Scholarship invites you to contribute to the open science and scholarship movement. Stay connected for updates, events, and opportunities. Follow us on X, formerly Twitter, LinkedIn, and join our mailing list to be part of the conversation!

 

 

 

 

UCL Open Science & Scholarship Conference 2024: Programme Now Available!

By Rafael, on 13 June 2024

Image of UCL Front Quad and Portico over spring. With less than a week until this year’s UCL Open Science Conference, anticipation is building! We are thrilled to announce that the programme for the UCL Open Science & Scholarship Conference 2024 is now ready. Scheduled for Thursday, June 20, 2024, from 1:00 pm to 5:00 pm BST, both onsite at UCL and online, this year’s conference promises to be an exciting opportunity to explore how the UCL community is leading Open Science and Scholarship initiatives across the university and beyond.

Programme Outline:

1:00-1:05 pm
Welcome and Introductions
Join us as we kick off the conference with a warm welcome and set the stage for the afternoon.

1:05-1:45 pm
Session 1: Celebrating our Open Researchers
Learn about the outstanding contributions of our Open Science champions and their work recognised at the UCL Open Science & Scholarship Awards last year.

1:45-2:45 pm
Session 2: Policies and Practice
Explore discussions on policy development and ethical considerations in Open Science, including talks on collaborative policy-making and the role of Open Source Programme Offices (OSPOs).

2:45-3:15 pm
Coffee Break
Network and engage with our fellow attendees over coffee, tea, and biscuits.

3:15-4:00 pm
Session 3: Enabling Open Science and Scholarship at UCL
Check out services and initiatives that empower UCL researchers to embrace Open Science, including updates on UCL Profiles, UCL Citizen Science Academy, and Open Science Case Studies.

4:00-4:45 pm
Session 4: Research Projects and Collaborations
Discover cutting-edge research projects and collaborations across UCL, including case studies involving the transition to Open Access publishing, reproducible research using medicinal plants, and social and cultural barriers to data sharing.

" "4:45-5:00 pm
Summary and Close of Main Conference
Reflect on key insights from the day’s discussions and wrap up the main conference.

5:00-6:30 pm
Evening Session: Poster Viewing and Networking Event
Engage with our presenters and attendees over drinks and nibbles, while exploring posters showcasing research and discussions in Open Science and Scholarship through diverse perspectives.

For the complete programme details, please access the full document uploaded on the UCL Research Data Repository, or access the QR code.

Join us – Tickets are still available!
Whether you’re attending in person or joining us virtually, we invite you to participate in discussions that shape the future of Open Science and Scholarship at UCL. Sales will close on Monday. Secure your spot now! Register here.

Thank you!
Thank you once more to everyone who submitted their ideas to the Call for Papers and Posters. We received brilliant contributions and are grateful for our packed programme of insightful discussions and projects from our community.

We look forward to welcoming you to the UCL Open Science & Scholarship Conference 2024!

Get involved!

alt=""The UCL Office for Open Science and Scholarship invites you to contribute to the open science and scholarship movement. Stay connected for updates, events, and opportunities. Follow us on X, formerly Twitter, LinkedIn, and join our mailing list to be part of the conversation!

 

Text and Data Mining (TDM) and Your Research: Copyright Implications and New Website Guidance

By Rafael, on 13 May 2024

This the second blog post of our collaborative series between the UCL Office for Open Science and Scholarship and the UCL Copyright team. Here, we continue our exploration of important aspects of copyright and its implications for open research and scholarship. In this instalment, we examine Text and Data Mining (TDM) and its impact on research along with the associated copyright considerations.

Data processing concept illustration

Image by storyset on Freepik.

The development of advanced computational tools and techniques for analysing large amounts of data has opened up new possibilities for researchers. Text and Data Mining (TDM) is a broad term referring to a range of ‘automated analytical techniques to analyse text and data for patterns, trends, and useful information’ (Intellectual Property Office definition). TDM has many applications in academic research across disciplines (Intellectual Property Office definition). TDM has many applications in academic research across disciplines.

In an academic context, the most common sources of data for TDM include journal articles, books, datasets, images, and websites. TDM involves accessing, analysing, and often reusing (parts of) these materials. As these materials are, by default, protected by copyright, there are limitations around what you can do as part of TDM. In the UK, you may rely on section 29A of the Copyright, Designs and Patents Act, a copyright exception for making copies for text and data analysis for non-commercial research. You must have lawful access to the materials (for example via a UCL subscription or via an open license). However, there are often technological barriers imposed by publishers preventing you from copying large amounts of materials for TDM purposes – measures that you must not try to circumvent. Understanding what you can do with copyright materials, what may be more problematic and where to get support if in doubt, should help you manage these barriers when you use TDM in your research.

The copyright support team works with e-resources, the Library Skills librarians, and the Office for Open Science and Scholarship to support the TDM activities of UCL staff and students. New guidance is available on the copyright website. TDM libguide and addresses questions that often arise during TDM, including:

  • Can you copy journal articles, books, images, and other materials? What conditions apply?
  • What do you need to consider when sharing the outcomes of a TDM analysis?
  • What do publishers and other suppliers of the TDM sources expect you to do?

To learn more about copyright (including how it applies to TDM):

Get involved!

alt=""The UCL Office for Open Science and Scholarship invites you to contribute to the open science and scholarship movement. Stay connected for updates, events, and opportunities. Follow us on X, formerly Twitter, LinkedIn, and join our mailing list to be part of the conversation!

 

 

How understanding copyright can help you as a researcher

By Rafael, on 4 April 2024

Guest post by Christine Daoutis, Copyright Support Officer

Welcome to the inaugural blog post of a collaborative series between the UCL Office for Open Science and Scholarship and the UCL Copyright team. In this series, we will explore important aspects of copyright and its implications for open research and scholarship.

Research ideas, projects, and their outcomes often involve using and producing materials that may be protected by copyright. Copyright protects a range of creative works, whether we are talking about a couple of notes in a notebook, a draft thesis chapter, the rough write-up of a data, a full monograph and the content of this very blog. While a basic knowledge of copyright is essential, particularly to stay within the law, there is much more to copyright than compliance. Understanding certain aspects of copyright can help you use copyright materials with more confidence, make use of your own rights and overall, enhance the openness of your research.

Two stick figures are facing each other. A large red copyright symbol is behind the first one. The first person is holding a document and says: ‘Ah, copyright! I have the right to copy!’. The second person is rubbing their chin and saying: ‘Err…’.

Image attribution: Patrick Hochstenbach, 2014. Available under https://creativecommons.org/licenses/by/4.0/

This first post in our series is dedicated to exploring common questions that arise during research projects. In future posts, we will explore some of these questions further, providing guidance, linking to new resources, and signposting relevant workshops. Copyright-related enquiries often arise in the following areas:

Reusing other people’s materials: How do you GET permission to reuse someone else’s images, figures, software, questionnaires, or research data? Do you always need permission? Is use for ‘non-commercial, research’ purposes always permitted, or are there other factors to consider? How do licenses work, and what can you do when a license does not cover your use? It’s easy to be overconfident when using others’ materials, for example, by assuming that images found on the internet can be reused without permission. It’s equally easy to be too cautious, ending up not making use of valuable resources for fear of infringing someone’s rights. Understanding permissions, licenses, and copyright exceptions – what may be within your rights to do as a user – can help you.

Disseminating your research throughout the research cycle: There are open access options for your publications and theses, supporting access to and often, reuse of your work. How do you license your work for reuse? What do the different licenses mean, and which one is most suitable? What about materials produced early on in your research: study preregistrations, research data, preprints? How can you make data FAIR through licensing? What do you need to consider when making software and other materials open source?

Is your work protected in the first place? Documents, images, video and other materials are usually protected by copyright. Facts are not. For a work to be protected it needs to be ‘original’. What does ‘original’ mean in this context? Are data protected by copyright? What other rights may apply to a work?

Who owns your research? We are raising questions about licensing and disseminating your research, but is it yours to license? What does the law say, and what is the default position for staff and students at UCL? How do contracts, including publisher copyright transfer agreements and data sharing agreements, affect how you can share your research?

‘Text and data mining’. Many research projects involve computational analysis of large amounts of data. This involves copying and processing materials protected by copyright, and often publishing the outcomes of this analysis. In which cases is this lawful? How do licences permit you to do, exactly, and what can you do under exceptions to copyright? How are your text and data mining activities limited if you are collaborating with others, across institutions and countries?

The use of AI. Speaking of accessing large amounts of data, what is the current situation on intellectual property and generative AI? What do you need to know about legal implications where use of AI is involved?

These questions are not here to overwhelm you but to highlight areas where we can offer you support, training, and opportunities for discussion. To know more:

Get involved!

alt=""The UCL Office for Open Science and Scholarship invites you to contribute to the open science and scholarship movement. Stay connected for updates, events, and opportunities. Follow us on X, formerly Twitter, LinkedIn, and join our mailing list to be part of the conversation!

 

(Update: Deadline Extended!) Call for Papers & Posters – UCL Open Science Conference 2024 

By Rafael, on 21 March 2024

Theme: Open Science & Scholarship in Practice 
Date: Thursday, June 20th, 2024 1-5pm, followed by Poster display and networking
Location: UCL Institute of Advanced Studies, IAS Common Ground room (G11), South Wing, Wilkins Building 

We are delighted to announce the forthcoming UCL Open Science Conference 2024, scheduled for June 20, 2024. We are inviting submissions for papers and posters showcasing innovative practices, research, and initiatives at UCL that exemplify the application of Open Science and Scholarship principles. This internally focused event aims to showcase the dynamic landscape of Open Science at UCL and explore its practical applications in scholarship and research, including Open Access Publishing, Open Data and Software, Transparency, Reproducibility, Open Educational Resources, Citizen Science, Co-Production, Public Engagement, and other open practices and methodologies. Early career researchers and PhD students from all disciplines are particularly encouraged to participate.

A group of attendees gathered around four rectangular tables engaging in discussions. In the middle of the room, a screen displays the text: "What are the challenges and opportunities that need to be addressed to create equitable conditions in relation to authorship?"

Attendees of the UCL Open Science Conference 2023 participating in a workshop

Conference Format: 

Our conference will adopt a hybrid format, offering both in-person and online participation options, with a preference for in-person attendance. The afternoon will feature approximately four thematic sessions, followed by a poster session and networking opportunities. Session recordings will be available for viewing after the conference. 

Call for papers

Submission Guidelines: 

We invite all colleagues at UCL to submit paper proposals related to Open Science and Scholarship in Practice, some example themes are below. Papers could include original research, case studies, practical implementations, and reflections on Open Science initiatives. Submissions should adhere to the following guidelines: 

  • Abstracts: Maximum 300 words
  • Presentation Length: 15 minutes (including time for questions)
  • Deadline for Abstract Submission: F̶r̶i̶d̶a̶y̶, A̶p̶r̶i̶l̶ 2̶6̶  Friday, May 3. (Deadline Extended!) 

Please submit your abstract proposals using this form.  

Potential Subthemes: 

  • Case Studies and Best Practices in Open Science and Scholarship
  • Open Methodologies, Transparency, and Reproducibility in Research Practices
  • Open Science Supporting Career Development and Progression
  • Innovative Open Data and Software Initiatives
  • Promoting and Advancing Open Access Publishing within UCL
  • Citizen Science, Co-Production, and Public Engagement Case Studies
  • Open Educational Resources to Support Teaching and Learning Experiences

Call for Posters

Session Format: 

The poster session will take place in person during the evening following the afternoon conference. Posters will be displayed for networking and engagement opportunities. Additionally, posters will be published online after the conference, potentially through the Research Data Repository. All attendees are encouraged to participate in the poster session, offering a platform to present their work and engage in interdisciplinary discussions. 

Submission Guidelines: 

All attendees are invited to propose posters showcasing their work related to Open Science and Scholarship in Practice. Posters may include research findings, project summaries, methodological approaches, and initiatives pertaining to Open Science and Scholarship. 

Deadline: Friday, May 24 

Please submit your poster proposals using this form.

Next Steps

A neon colourful sign that says 'Watch this Space'

Photo by Samuel Regan-Asante on Unplash

Notifications of acceptance will be sent in the week ending May 10th for Papers and June 7th for Posters. 

Recordings of the UCL Open Science Conference 2023, are available on this blog post from May 2023.

For additional information about the conference or the calls, feel free to reach out to us at openscience@ucl.ac.uk. 

Watch this space for more news and information about the upcoming UCL Open Science Conference 2024!  

FAIR Data in Practice

By Rafael, on 15 February 2024

Guest post by Victor Olago, Senior Research Data Steward and Shipra Suman, Research Data Steward, in celebration of International Love Data Week 2024.

Image depicting the FAIR guiding principles for data resources: Findable, Accessible, Interoperable, and Reusable. Created by SangyaPundir.

Credit: Sangya Pundir, CC BY-SA 4.0 via Wikimedia Commons

The problem:

We all know sharing is caring, and so data needs to be shared to explore its full potential and usefulness. This makes it possible for researchers to answer questions that were not the primary research objective of the initial study. The shared data also allows other researchers to replicate the findings underpinning the manuscript, which is important in knowledge sharing. It also allows other researchers to integrate these datasets with other existing datasets, either already collected or which will be collected in the future.

There are several factors that can hamper research data sharing. These might include a lack of technical skill, inadequate funding, an absence of data sharing agreements, or ethical barriers. As Data Stewards we support appropriate ways of collecting, standardizing, using, sharing, and archiving research data. We are also responsible for advocating best practices and policies on data. One of such best practices and policies includes the promotion and the implementation of the FAIR data principles.

FAIR is an acronym for Findable, Accessible Interoperable and Reusable [1]. FAIR is about making data discoverable to other researchers, but it does not translate exactly to Open Data. Some data can only be shared with others once security considerations have been addressed. For researchers to use the data, a concept-note or protocol must be in place to help gatekeepers of that data understand what each data request is meant for, how the data will be processed and expected outcomes of the study or sub study. Findability and Accessibility is ensured through metadata and enforcing the use of persistent identifiers for a given dataset. Interoperability relates to applying standards and encoding such as ICD-10, ICDO-3 [2] and, lastly, Reusability means making it possible for the data to be used by other researchers.

What we are doing:

We are currently supporting a data reuse project at the Medical Research Council Clinical Trials Unit (MRC CTU). This project enables the secondary analysis of clinical trial data. We use pseudonymisation techniques and prepare metadata that goes along with each data set.

Pseudonymisation helps process personal data in such a way that the data cannot be attributed to specific data subjects without the use of additional information [3]. This reduces the risks of reidentification of personal data. When data is pseudonymized direct identifiers are dropped while potentially identifiable information is coded. Data may also be aggregated. For example, age is transformed to age groups. There are instances where data is sampled from the original distribution, allowing only sharing of the sample data. Pseudonymised data is still personal data which must be protected with GDPR regulation [4].

The metadata makes it possible for other researchers to locate and request access to reuse clinical trials data at MRC CTU. With the extensive documentation that is attached, when access is approved, reanalysis and or integration with other datasets are made possible.  Pseudonymisation and metadata preparation helps in promoting FAIR data.

We have so far prepared one data-pack for RT01 studies which is ‘A randomized controlled trial of high dose versus standard dose conformal radiotherapy for localized prostate cancer’ which is currently in review phase and almost ready to share with requestors. Over the next few years, we hope to repeat and standardise the process for past, current and future studies of Cancer, HIV, and other trials.

References:    

  1. 8 Pillars of Open Science.
  2. Digital N: National Clinical Coding Standards ICD-10 5th Edition (2022), 5 edn; 2022.
  3. Anonymisation and Pseudonymisation.
  4. Complete guide to GDPR compliance.

Get involved!

alt=""The UCL Office for Open Science and Scholarship invites you to contribute to the open science and scholarship movement. Stay connected for updates, events, and opportunities. Follow us on X, formerly Twitter, and join our mailing list to be part of the conversation!