A 2007 overview of JISC projects in terms of openness

by Ramón Casero Cañas on 7 June 2007 , last updated

Archived This page has been archived. Its content will not be updated. Further details of our archive policy.

Introduction

This report summarises the review of 104 JISC-funded projects that took place from 6 March to 1 April 2007. While its initial goal was to simply identify projects that produce open source software, from an early stage it became evident that it would be interesting to assess “openness” in a wider sense, and identify potential open communities and points of collaboration between projects.

This has provided OSS Watch with an opportunity to evaluate the extent to which open source and open community practices can be found in JISC funded projects. This helps OSS Watch to focus its support efforts.

Methodology

The JISC list of projects sorts all active JISC-funded projects into alphabetical order. Each entry has a short description of the project and a link to a “JISC project page” with a general description of its background, goals, deliverables, team, stakeholders, contact information etc. While the content varies in the level of detail, sectioning, and type of information, all project pages have a similar appearance and structure up to a certain point. Automatic extraction of information would probably be feasible, but the level of detail so obtained would be insufficient for most practical purposes.

Most projects also have an external link to a “project website”. The collection of project websites is completely heterogeneous. It often provides more in-depth information about the project, but this information cannot be compiled in an automated way. Evaluating projects in this way is tedious and time-consuming.

These difficulties are being addressed by the open source project Simal, that aims to provide tools for building project catalogues in an automated way. Simal was started at OSS Watch following open development and open community guidelines, and its requirements are partly informed by this study.

For the present review, each JISC project page and external project website were browsed, and notes were taken regarding the following categories:

Governance and community
Is the project carried out by a single team/institution? Are several institutions collaborating? Is it an open project where anybody can join?
Visibility
Does the project have a website? Is it good? Is it possible to contact the project team?
Deliverables
What is the project going to deliver? Software? Data? Documentation? Web services? Under what conditions or licence?
Standards
What standards are projects using to encode, store, and transmit data and resources?
Relationship between projects
Is the project using outputs from or collaborating with other projects (JISC or not)? In some cases, there is a fine line between partnership and third party. Partnership is more about teams of people working for the same project, and belongs in the “Governance and community section”. Third party is more about taking on the results from a previous project, being collaborators to an open source project led by another group, or using a piece of software that is developed externally.

The categories above may seem too broad, and in fact they are so by design, as this study started without much previous knowledge of the wide range of types of JISC projects, or what issues regarding “openness” they would raise.

But thanks to this study we have been able to learn how to better characterise JISC projects. Combined with the experience gained doing the review of online survey tools, we are producing templates for the review of JISC projects that will allow future studies to be more focused.

Governance and community

Projects were classified into 3 categories according to the type of teams working in them. This is a rough classification, as it is very difficult to learn from project websites who is actually doing what for each project.

Single institution or team
A small closed team of people, usually from a single lab or office in an institution.
Closed community projects
Teams split between different institutions working on the same project; only team members can contribute to the project. Drawing the line between this category and the former has been almost random sometimes. Many projects nominally have a lead institution plus partner institutions, but this says nothing about the degree of involvement of each party. I have the feeling that in some cases, partners are active contributors to the project, and thus belong in this category, but in others they may be institutions where the project deliverables are tested.
Open community projects
While all JISC projects have a team leader and team members, projects in this category allow or even encourage input from outside collaborators, in particular direct involvement of the user.

Of the open community projects, none is truly open in governance, as all project teams are closed. The JISC Guide to Bidding states: “Do not underestimate the problems in recruiting suitable staff to work on the project. Staff with suitable qualifications in areas where the JISC is interested can be in short supply or expensive. If you have appropriate people to be seconded to the work this should be stated in your bid. Alternatively you should provide contingency plans in the event that you experience problems with recruitment.”

The responsibility of outcomes (“value for money” ) is placed on the bidders, who have limited budget and time to achieve the objectives of the project. Therefore, all JISC projects can be expected to have decided on the project team beforehand, and be unwilling to move to an open governance model.

An argument that could support open governance is better sustainability, but this is only tangentially mentioned in the Guide to Bidding “Comment on sustainability issues when project funding ceases;”

and the review of projects shows that it is not a big concern in practice.

In addition, open governance seems counterintuitive for some projects that just aim at producing some data or installing an in-house system, because it is understood that the data or system will be maintained within the institution, so only employees shall be granted administrative access and information.

Therefore, lack of open governance is arguably a result of both JISC’s bidding process and project idiosyncrasy.

But while full open governance can be unrealistic or even inapplicable, many projects could benefit from becoming involved with existing communities or creating new open communities where none currently exist. This is particularly true for projects

  • that produce academic output, e.g. Virtual Learning Objects (VLOs)
  • with software outcomes
  • that produce documentation (HowTo papers, evaluations, requirements…) on the implementation and usage of third-party software, e.g. Virtual Learning Environments (VLEs), Virtual Research Environments (VREs), ePortfolios, etc.

The main benefit of creating open communities for JISC projects would arguably be sustainability, mostly survival of the project once the JISC funding has ended. For a wider discussion on sustainability, see Sustainable Open Source and the JISC sustainability study.

Visibility

Visibility is both a consequence of and a requisite for project openness. Specific issues of visibility, like access to source code and public documentation will be dealt with in the following sections. This section focuses on project websites and contact channels.

There are several types of JISC projects in terms of websites:

  • Very good quality in terms of layout and content, but not specific to the project. For example, projects that link to the Archival Sound Recordings website, copyright of The British Library Board, or to Histpop, The Online Historical Population Reports website, copyright of the University of Essex
  • Good websites rich in information, offering project resources, directly or indirectly. They offer a good overview of the project, contact channels, the tools used and developed, documentation and links to related projects. They sometimes do not look sophisticated (mostly plain HTML), but provide useful content, links and a development version of the system
  • Good looking site, but quite empty of content. They provide no access to learning materials, no information about the evolution of the project, and almost no documentation. Sometimes the project website is a copy of the JISC project page, with no extra content
  • Minimal website. Websites with only a presentation paragraph
  • Half built websites with, for example, blogs that need to be configured
  • No website
  • The link to the website is broken

Websites are the main entry point for any potential user/contributor to the project, and the lack thereof nips the building of communities in the bud because projects are simply invisible to the outside world. The same can be said about contact channels; without the means to get in touch with project members, no community can be built.

Contact channels come in different shapes and sizes. Almost all JISC projects provide at least a contact address for the project manager, and quite often to other team members, but in order to build communities, projects need to provide means for users and collaborators to communicate with each other too (mailing lists, forums, wikis, etc).

Finally, websites and contact channels are not only entry points to the project, but the living memory of the project and the core around which the community is built. Therefore, it is paramount that they are set up right at the beginning and frequently updated.

Deliverables

JISC projects belong to different programmes and cover a wide range of aims and goals, and consequently produce a variety of outputs. This section identifies the main types of deliverables (each type identifies potential communities) and the features that can make them more or less open.

Digitisation of documents and/or creation of digital data libraries

These projects aim at building collections of digital data enriched with metadata from different sources:

  • scanning and OCR of paper pamphlets, books and journals, cartoons, documents of historic interest, newspapers, theses
  • art works
  • sound recordings
  • images
  • theatre resources: playbills, programmes, press cuttings, photographs
  • video tape
  • combinations of images, text, audio and video

Shibboleth

Shibboleth is an architecture for (and open-source implementation of) federated identity-based authentication and authorization infrastructure. There seems to be a trend towards using Shibboleth’s Web Single SignOn (SSO) to manage access to UK academic resources. The “SDSS: Shibboleth Development and Support Services” project aimed to build a federation of institutions, but it has been now absorbed into the UK Access Management Federation. Shibboleth is open source software (Apache 2.0) developed by the Internet2/MACE community. The specific contexts that JISC projects address for the use of Shibboleth are:

  • HE institutions
  • digital repositories
  • geospatial services

eLearning

Given that one of JISC’s focus is ICT in education, it is only natural that a large quantity of projects engage with eLearning systems. There is a whole vocabulary around eLearning:

Virtual Learning Environments (VLEs), Learning Objects (LOs), Units of Learning (UoLs), learning nugget
VLEs are computing systems that allow students to follow an online course. They consist of LOs, UoLs or learning nuggets, i.e. the course materials. Examples of LOs, UoLs or nuggets include papers, flash presentations, multi-choice online tests, slides etc.
Learning Design
The patterns or recipes that describe learning activities, i.e. how students learn (e.g. by reading a paper, holding a discussion over it and sitting a test). Learning Design is a general term, but it often refers in particular to the IMS Learning Design Specification, a standard model originally developed at the Open University of the Netherlands (OUNL)
blended learning
Combining different methodologies for teaching a course, e.g. face to face seminars with online resources
eAssessment
Systems and materials to allow teachers and students themselves to evaluate their progress in a course. ePortfolios, Personal Development Planning (PDP): Computer systems to create a demonstration of the user’s skills and knowledge. They may include notes, blogs, images, pdf files, multimedia etc.
course evaluation
Assessment of the course quality done by the students

Projects that aim to provide a web service in one or more institutions are quite closed in terms of the information they offer. There is a significant use of open source solutions, but they remain mixed with proprietary software.

Examples of software used or considered for use by JISC projects for eLearning are:

Moodle
An open source (GPL) VLE used to create online courses. Development is led by the Moodle company, Australia
LAMS
An open source (GPL) tool for the design of learning activities (http://www.lamsfoundation.org/). Development is led by Macquarie University, Australia
DialogPlus Toolkit
An online environment to create, modify and share learning activities and resources. Everybody can use it, but the source code or binaries are not available
RELOAD
Collection of open source (MIT licence) tools for packaging content (IMS CP), editing IMS Metadata, playing SCORM packages and editing and playing IMS Learning Designs. One of the projects aims to develop a plugin for to connect RELOAD to the UoLs repository OpenDocument.net (not online yet)
WCKER
Open source (MIT licence) plugin to create wizards for creation of online courses in RELOAD
Blackboard/WebCT
A suit of proprietary eLearning applications
Bodington
An open source (Apache License 2.0) VLE developed at the University of Leeds
SPIDER
Proprietary VLE developed by the University of Strathclyde
RLO-CETL Evaluation Toolkit
A questionnaire (CC attribution noncommercial sharealike) to evaluate LOs
Blogsom
an open source (BSD licence) multi-blog system
TikiWiki
Open source (LGPL) CMS
SIMPLE
Gaming technology being developed for teaching courses
WebPA
An online peer moderated marking system in development that will be released under the GPL
Xmarks
Middleware to link the staff and students record database (Oracle 9i) with Moodle
PebblePad
A proprietary ePortfolio system
Progress
A proprietary PDP service developed at Canterbury Christ Church University
‘Generic’ ePortfolio
Developed at Newcastle University to provide a baseline open source ePortfolio that can be customized for specific institutions. I could not find information about the licence, or the source code
ePET
A web service interface for the ‘Generic’ ePortfolio
ePARs
A proprietary PDP system developed at the University of Nottingham, and based on Microsoft products
LUSID
A proprietary PDP system developed at the University of Liverpool
Reflective Learning Diary
A proprietary PDP system developed at the University of Southampton
PETAL
In theory, an open source general e-portfolio tool Open Source Portfolio Initiative (OSPI) and the Certified Member of ALT scheme (CMALT). Development finished in March 2005, but there are no source code or binaries available
L4All
An ePortfolio Portal; I could not find information about the licence, source code or a link to the portal
ioPortal
A web application for PDP. I could not find any link to the source code or the binaries
ioNode
Middleware for exchange of Individual Learner Record data. I could not find any link to the source code or the binaries
EFSCE
e-Framework Services for Course Evaluation. Proprietary system developed by the University of Southampton
Horus
An open source (LGPL) course evaluation system, that is being deprecated

Virtual Research Environments (VREs)

Another focus of the JISC in education is research. VREs are “frameworks that offer tools, services and resources for research”. The JISC has 15 projects in the Virtual Research Environments Programme; at the time of this review, only 3 were active, but a further £2 million have been allocated until March 2009 for this programme [http://www.jisc.ac.uk/whatwedo/programmes/vre1].

There is little information on how the VREs are set up, especially in terms of hardware. Software includes a mix of open source and proprietary software, but there is not a clear use of open source development techniques.

The projects cover:

  • Support for the Virtual Research Environment Programme, in the sense of reviewing tools, “information on interoperability and associated standards and specifications”
  • A VRE for the Integrative Biology (IB) Research Consortium, in particular the user community’s requirements in the areas of graphical user interfaces and collaborative tools
  • A VRE for an MA in the History of Political Discourse 1500-1800, based on the Sakai Collaborative Environment [http://www.sakaiproject.org/]. This MA provided the “excuse” to build the VRE that is now used in the broader context of Early Modern Texts [http://www.earlymoderntexts.org/vre/]

Digital repositories

A digital repository is a collection of data or publications in digital format. The JISC has a specific programme for digital repositories (pdf link).

Projects in this category offer little or no information about the hardware they use, and those that provide a web service remain quite closed. However, there is a significant number of projects that have considered and/or are using open source solutions.

Projects reviewed included:

  • LOs
  • Statistical data of downloads from repositories
  • Policy, infrastructure, legal studies, DRM, hardware procurement
  • Search engines for engineering resources, institutional repositories
  • Development of digital repositories
  • Integration of research data with corresponding publications. Experimental physics, UK Data Archive
  • Implementation of digital repositories software, e.g. FEDORA, GNU ePrints, LionShare
  • Add features to digital repositories, e.g. tagging
  • Automated curation of robot generated data
  • Peer-reviewed papers
  • Shared clinical high stakes assessment

Products used or explored by these projects include:

GNU ePrints
An open source (GPL) digital repository system initially developed by the University of Southampton. It is a very mature open source project closely involved with Open Access (OA) and offers consultation, installation and support services
Curator
A proprietary digital repository system. I could not find much information online. Apparently it was developed by Endeavor Information Systems, part of the Ex Libris group
Digitool
A proprietary digital repository system also developed by the Ex Libris group
Fedora
An open source (Educational Community License) digital repository system
DSpace
An open source (BSD licence) digital repository system initially developed by the MIT Libraries and Hewlett-Packard (HP)
Greenstone
An open source (GPL) digital library system developed by the University of Waikato in collaboration with UNESCO and the Human Info NGO
PerX
A search engine pilot for engineering resources
LionShare
An open source (GPL) digital repository system based on P2P. Only registered users can download the binaries (source code available for everybody), and the system will only be used by registered users at partner universities of the “SPIRE” project
JHOVE
An open source (LGPL) framework for automatic identification and validation of digital content. “RepoMMan: Repository Metadata and Management”
ActiveWebflow Professional
A proprietary plugin for Eclipse to write Business Process Execution Language (BPEL)
The Depot
A UK digital repository for peer-reviewed papers, articles, and book chapters, developed by “The Depot” project and based on ePrints
Jorum
An online digital repository service for learning and teaching materials developed as a JISC project. The software behind the service is proprietary and it does not seem to be possible to obtain it, although access to the service is free
Contextual Learning Activity Repository (CLARe)
A service developed at the University of Southampton. Proprietary software, and there seems to be no public access to the service either

It is also worth mentioning Open Access (OA), an initiative for immediate, free and unrestricted online access to digital scholarly material that is in the focus of several digital repository projects. This is the scientific publication equivalent to open source in software, although there are some differences. For example, in scientific literature peer-review is usually enforced before publication, and the ability to reuse and modify source code is a main characteristic of open source, but not a big concern in scientific papers.

The “IRIScotland” project concluded that populating repositories now presents a greater challenge than setting them up, and that there are 3 factors that hinder making them actually useful in Scotland [http://www.iriscotland.lib.ed.ac.uk/about/index.html]:

  • Lack of policies, procedures and mechanisms for OA in institutions
  • Lack of a wide institutional digital repository infrastructure
  • Lack of understanding of what functions should be undertaken at an institution level, and at a national level

Technical support

Some projects aim at providing technical support for other projects. This review found 2 instances:

  • Pedagogic and technical support to all projects in the Design for Learning Programme, hosting and/or searching services for learning designs
  • Provide a Shibboleth infrastructure for projects supported by the Core Middleware Development programmes

Software

The main reason for this category, that is quite wide in scope and overlaps with others, is to have a starting point to consider projects that are potentially interested in OSS Watch advisory services.

Projects were grouped in 3 subcategories:

Proprietary
When there is evidence of a commercial licence, or lack of evidence of an open source licence, combined with a lack of open development practices:
  • A web service is publicly available, but not the source code, binaries, and there is no indication of what licence they plan to use, if any
  • Binaries are provided, but no source code with an open source licence
  • Development is behind closed doors, and there is no indication of the type of licence they will use if they decide to release the software
Semi-open
When the project aims to produce open source software, but they are lacking a licence, or follow proprietary software development methods:
  • The project has manifested the intention to use open development and an open source licence, but has failed for the moment to do so
  • Source code is available, but there is no licence information
  • Source code is available, and an open source-like licence provided, but it is not an OSI-approved licence
  • There is no stated intention to develop open source software, no available binaries, source code or licence information, but the project builds on top of software with a viral licence (e.g. the GPL); thus the project will be forced to release their software under the same licence
Open
When the project produces open source software following open development methods: InterLoc
An open source (Apache v2.0, LGPL and Mozilla) groupware tool for educational dialogues. The source code is available from SourceForge [http://sourceforge.net/projects/interloc]. Based on jabber
D4LD
A front-end user interface to run Learning Designs. Builds on 2 previous open source projects, SleD (LGPL) and the CopperCore LD Engine (GPL) and improves them, also doing integration with Moodle
LauLima
An open source (LGPL) customised TikiWiki-based system comprising a learning environment and digital library
Integrative Biology VRE
An open source (BSD licence) VRE for the Integrative Biology project. The source code is available from SourceForge [http://sourceforge.net/projects/ibvre/]

Some tools used for software development:

Trac
An advanced wiki and issue tracking system with a modified BSD licence
Mantis
An open source (GPL) bug tracking system
MediaWiki
An open source (GPL) Wiki system
Joomla!
An open source (GPL) CMS
SourceForge
A proprietary collaborative development environment
LISTSERV
A proprietary mailing list server
Jabber
A set of free XML protocols used for instant messaging
WordPress
An open source blogging system
Confluence
A proprietary wiki system

National Centre

One project in the review dealt with building a new “National Centre for Text Mining”. Of the reviewed projects, this is one of the most ambitious in terms of the broad scope of its deliverables (software tools, services, documentation, training) and sustainability. It is important for OSS Watch to acknowledge the complexity of this kind of project when providing support and developing tools like Simal.

Data

As well as software or infrastructure, several projects produce data (documentation will not be considered data in this section). The availability and conditions under which data is released reflect on the openness of the project. Similarly to software, project outputs can be grouped in 3 categories:

  • Proprietary data, typically “All rights reserved”
  • Semi-open data: Data with some access restrictions, for instance only academic institutions
  • Open data: Data publicly available, for example through OA policies

Some limitations found in proprietary data:

  • It is necessary to ask the copyright holder for permission to reproduce, display, store, etc. the material
  • Only thumbnails of pictures are provided; to obtain full size reproductions it is necessary to contact the picture library
  • Materials only available for the college where the project takes place

Characteristics of semi-open data:

  • Materials are publicly available, but they are under “All rights reserved” copyright
  • Materials are publicly available, but I could not find information about the licence

    • icons
    • video
    • video and audio
    • maps and georeferenced data
  • Materials available for private, research and student use

    • Only UK FE and HE licensed institutions can download and play the recordings
    • Free registration access to database is required. Copyright grants private or student use, but not publication
    • Crown copyright, material can be downloaded from DocumentsOnline, and reproduced, but there is a fee for images, and use is only for research, non-commercial or education
    • VLOs only available for students
  • Materials freely available within the UK only
  • Materials available only or mostly in proprietary formats
  • The project shows interest in OA, but there is no data and/or enough details

    • laboratory robot data
    • theses
    • images and documents of everyday life in the UK
    • drawings
  • No available information on how data will be released

Finally, for open data:

  • Public access through JSTOR and Google Scholar search
  • OA content; the website with the project outcomes has publicly available data

Documentation

Projects produce documentation for many reasons: To describe the project as a condition of the bid for funding, as deliverables of projects that perform reviews or produce standards or policies, to document software application or network architectures, etc.

Availability and licensing of this documentation is important in terms of openness. JISC projects typically offer online documentation, but do not release it under copyleft licences.

This section follows a similar classification of projects as the “Data” section.

Proprietary documentation:

  • Username and password required to access documentation, and no public registration
  • No website, or website with no documentation
  • Website provides little documentation, and no licence information

Semi-open documentation:

  • Some documentation is available, but it cannot be redistributed
  • Public documentation with open formats, but there is no licence information. It is interesting to comment on the “RepoMMan: Repository Metadata and Management” project. They investigated the development of a workflow tool for the Fedora open-source repository software. As they found no proper documentation, they wrote their own manual and made it publicly available
  • Public documentation using proprietary formats
  • Public documentation in a mix of open and proprietary formats

    • PDF and MS Office
    • Scientific journals
  • Public wiki but no registration possible
  • Promise of making contents “open” in the future

Open documentation:

  • Public wiki that anyone can edit, although there is no licence information
  • Public documentation that anyone can edit, with a Creative Commons licence

Standards

A measure of openness for projects is the use of standards. Standards avoid vendor lock-ins, improve interoperability and sustainability. The standards most often used by JISC projects are:

Metadata Encoding Transmission Standard (METS)
An XML standard for encoding descriptive, administrative, and structural metadata regarding objects within a digital library
eXchanging Course-Related Information (XCRI)
A vocabulary and appropriate technology bindings (e.g. XML, RDF) to describe UK course-related information that encompasses course marketing, course quality assurance, enrolment and reporting requirements
Open Archive Initiative (OAI)
Interoperability standards for dissemination of content
L2O metadata scheme
Standard for metadata in language skills LOs
MathML
A W3C ‘low-level’ specification for describing mathematics as a basis for machine to machine communication’
IMS
‘Open technical specifications for interoperable learning technology’ defined by the IMS Global Learning Consortium, Inc. For example, ‘the IMS Question & Test Interoperability (QTI) specification describes a data model for the representation of question (assessmentItem) and test (assessmentTest) data and their corresponding results reports’ [http://www.imsglobal.org/question/]
SCORM
A collection of standards and specifications for ‘interoperability, accessibility and reusability of Web-based learning content’
Chemical Markup Language (CML)
An SGML-based standard to manage molecular information
Business Process Execution Language (BPEL)
A language for the formal specification of business processes and business interaction protocols
Security Assertion Markup Language (SAML)
An XML standard for exchanging authentication and authorization data between security domains developed by the OASIS Security Services Technical Committee and used by Shibboleth

Relationship between projects

There are strong connections between JISC and non-JISC projects, present and past. These relationships take many different forms, e.g.

  • Providing support
  • Providing sustainability
  • Adding features or functionality
  • Combining projects
  • Using outputs from other projects
  • Creating open source software versions of proprietary software products
  • Continuation of previous projects
  • Massively collaborative projects
  • A JISC service manages and works for a JISC project

This not only points out that JISC projects can naturally build communities, but it is of particular importance to OSS Watch when considering how to describe and make catalogues of projects.

Conclusions

While the only condition for software to be considered open source by OSS Watch is that it is released under an OSI certified licence, the openness of a JISC project needs to be evaluated more widely.

This review has identified several categories of interest for openness: Governance and community, visibility, deliverables, standards and relationship between projects, and found out that many JISC projects are either following proprietary or semi-open practices, but there is noteworthy interest in the usage of open source products, open standards and documentation too.

Moreover, the ability of a project to integrate within and/or build a community is vital in terms of sustainability.

OSS Watch can help existing and new projects identify and work with their communities of interest as well as assisting with the engagement of openness in each category described above.

Regarding the methodology, this study had to be made visiting each project website one by one, a very time consuming process that does not allow for good quantitative analysis. But it is to be expected that Simal, the new project started at OSS Watch to provide catalogue building tools will greatly improve future reviews.