Parasitology Grid
Project Leader: Michael Mee, University of Technology, Sydney
Description
The primary goal of the work is to develop a framework to enable the sharing and remote processing of biological data for the ARC Network for Parasitology. This program commenced July 2005 as part of the creation of a new support unit at UTS. Due to the late start of the project, a higher rate of effort has been put into the project.
This project of work was developed at UTS during the second half of 2005. The particular aims are as follows:
- Set up a grid-accessible storage repository at ac3/UTS to store biological data for the Parasitology Network.
- Develop standardised bioinformatics workflows and interfaces to the APAC grid by adapting tools developed in the European myGrid program (in particular Taverna).
- Assist in the installation of bioinformatics programs and in the integration of biological databases onto the ac3 cluster.
Additionally, in conjunction with the APAC Information Infrastructure Program:
- Assist in the development of infrastructure to support access to public biological data stored on the Bio-mirror server.
- Assist in the development of intelligent update procedures for the publicly available databases.
Achievements
| 2005 |
- Creation of Computational Research Support Unit.
- Preliminary discussion held at Bioinformatics Application Support Project Workshop (at Murdoch University).
- Preliminary discussions with members of IT Subcommittee of ARC Network for Parasitology (Prof. Ross Coppel [Monash] and Dr Sabina Belli [UTS]).
- Development of project proposal.
- Initial requirements analysis for Parasitology Network portal.
- Demonstration of workflow to automate queries against public web portals containing parasitology data.
- Specification and ordering of test server hardware based on APAC gateway machine specifications (UTS funded).
|
| 1H2006 |
- Installed test server at UTS including required software such as SRB, globus, environment modules, bioinformatics software, Trac/subversion for project management.
- Set up of website framework for the Parasitology network and bulletin board to allow for collaboration. The website is based on wiki format so the members can maintain site.
- Assessment of Taverna workbench as platform for workflow development and execution.
- Created parasitology specific Taverna modules to access public data portals and for use in the running of a specific workflow from the desktop. There has been some difficulty in getting feedback and requirements from the Parasitology network.
- Developed and ran course for biologists/bioinformaticians in the use of Taverna.
- Installed GUS/WDK to test its suitability as a framework for holding Parasitology data and creating tools to access the data.
- Developed procedures to install bioinformatics tools on ac3 cluster to satisfy the associated grid framework requirements. The initial application installed was the NCBI toolkit (blast) as this is a prerequisite for later a milestone.
- Initial development of web portal (http://crsu.science.uts.edu.au/reblast) to perform automated regular sequence searches to report new matches in public databases.
|
Milestones for 2006
- Continued development of ‘reblast’ web portal.
- Create standardised database of test parasite data – awaiting data.
- Creation and deployment of portal for data access, searching, and post-processing.
- Ability to run bioinformatic programs at ac3 from desktop.
Participating Organisations
- University of Technology Sydney
- ac3
- ARC Network for Parasitology
Resources for 2006
- Total resources available to the project for 2006 are 1.49 efts (UTS 1.49)
- APAC is providing funds to support 0.67 efts (UTS 0.67).