|
Centre for Bioinformation Science The Centre for Bioinformation Science (CBiS) forms a bridge
between two areas of major strength at ANU, the mathematical and biological
sciences. CBiS brings together researchers with backgrounds in mathematics,
statistics and quantitative biology with the goal of developing a conceptual
architecture for an information-based, integrative approach to complex
biological systems. These cross-campus collaborations provide a nucleus for the development of further links. To encourage further collaborations and improve the quality of bioinformatics practice on Campus, we undertook two initiatives in 2003. The first was to support a half time position to work in collaboration with staff in the Biomedical Resource Facility, mainly concentrating on development of statistically rigorous methods for analysis of microarray data. The second initiative was development by CBiS of Graduate courses “Bioinformatics for Biologists” organised by Gavin Huttley. We have also been heavily involved with outside organizations and with the organization of outside activities. A highlight of 2003 was the extremely successful BioInfoSummer Symposium "Mathematical, Statistical and Computational Challenges in Bioinformatics" organised by Hilary Booth (Chair), Yi Fang, Alex Isaev and Sue Wilson. |
Dr H Booth
Research during the year was undertaken on the following projects:
Dr C Burden
In collaboration with S. Wilson and Y. Pittelkow, research has been carried
out on a statistical analysis of physically based adsorption models of oligonucleotide
microarrays. The hybridization of labelled target cRNA to short oligonucleotide
probes at the surface of microarray chips is believed to be driven by competing
processes of adsorption and desorption. Using data from the Affymetrix HG-U95A
Latin Square experiment, we have assessed the relative merits of various physical
models including equilibrium and non-equilibrium Langmuir models. This treatment
differs from existing analyses in the use of more relevant generalized linear
models. Particular attention is paid to questioning the validity of assuming
universality of parameters across genes, or across probes within a probeset.
We have also improved the accuracy of existing methods of inferring absolute
mRNA concentrations from probe intensity values using adsorption models.
Professor S Easteal
Research direction has continued to be aimed at understanding the basis of
a number of genetic disorders, and of the patterns of normal genetic variation
in human populations. A particular focus is the genetic basis of personality
variation, which is associated with a predisposition to common forms of mental
illness involving depression and anxiety. This work is done in collaboration
with the NH&MRC Centre for Mental Health Research.
Dr Y Fang
My work during 2003 may be considered under three main headings:
Dr G Huttley
A software toolkit aimed at comparative genomics has now largely been completed.
The toolkit is the basis for papers currently submitted, and ongoing research
projects. The outstanding questions currently being assessed are an examination
of sex specific mutation processes in humans; the influence of the DNA modification
5-methylcytosine on the pattern of divergence between coding and non-coding
DNA sequences; characterising the factors influencing sequence variation at
the breast and ovarian cancer susceptibility locus BRCA1 in the mammals; a
maximum-likelihood based phylogenetic footprinting approach; and, statistical
techniques for estimating functional dependence between sequence residues
using comparative sequence data.
Dr A Isaev
Bioinformatics research continued on models of dependent evolution of DNA
and protein sequences. Implementation is now in progress, and I helped resolve
many issues arising in the implementation process.
Dr J Maindonald
During 2003 I was involved in the following projects:
Estimation of between slide variance, for use in the denominator of t-statistics and F-statistics, is a major issue for the statistical analysis of cDNA microarray data. On the one hand, estimates that are based on individual gene variation are, when the number of slides per treatment is small, highly affected by sampling variation. On the other hand, systematic changes in variance with average intensity, with print tip and with print tip order complicate the attempt to combine variance information from multiple genes.
Dr R Turakulov
Analysis of genome scale SNP genotyping data.
Genetic strategies for understanding genotype-phenotype relationships in human
disease depend on assumptions about the architecture of variation in the human
genome. We have analysed patterns of SNP variation at >5,000 SNPs for samples
of African-American, Asian and Caucasian populations available from The Single
Nucleotide Consortia Ltd (TSCL) [http://snp.cshl.org/].
From this analysis two major conclusions were found: 1. Evaluation of the number
of SNPs required to reliably assign individuals to the population from which
they are derived. Our results show that less than 100 loci are required to identify,
with high confidence, the population source of individuals in these samples.
2. Identification of loci with unusually high levels of variation among populations.
A list of these loci has been generated. They are candidates for the action
of diversifying natural selection and they are highly informative indicators
of the population affinity of individuals.
Dr M Wakefield
The major research focus continued to be Comparative Genomics, with particular
emphasis on using divergent mammals to understand genes of medical importance.
The adaptive evolution of the breast cancer gene BRCA1 and the evolutionary forces that have driven rapid change in this gene throughout the mammalian radiation are the main focus of current research efforts. This year saw the initiation of experiments to determine if differences in marsupial and monotreme lactation and X chromosome inactivation have been the driving force for waves of adaptive changes in BRCA1.
Through involvement in the ARC Centre for Kangaroo Genomics initial work on the curration of genome mapping data and development of analysis methods has proceeded in 2003, resulting in the development of a new maximum likelihood based phylogenetic footprinting program for the identification of conserved segments of the genome that are indicative of control regions.
Professor S Wilson
Research continued in statistical genomics and bioinformatics with emphasis on the following projects:
BioInfoSummer 1-5 December, 2003
"Mathematical, Statistical and Computational Challenges in Bioinformatics"
was the theme of the extremely successful Australian Mathematical Sciences
Institute (AMSI) Summer Symposium in Bioinformatics, hosted by the Australian
National University's Centre for Bioinformation Science, December 1-5. Sponsors
also included Cray, Ceanet, the Australian Partnership for Advanced Computing
and the Australian National University's National Institute for Bioscience
(NIB). The days were organised into Themes, allowing attendees who were unable
to attend the whole week to select those areas of particular interest. The
Themes were: Introduction to Molecular Biology; Sequences & Data; Evolutionary
Models & Genetics; Protein Structure & Function and Microarrays &
Experimental Design, the last theme being held jointly with the IBS Australasian
Region's biennial meeting. Each day started with Educational Lectures followed
by a mix of Keynote and Specialist talks. Most days had parallel sessions
in the afternoon, allowing those students who had registered for the Graduate
Course Award in Bioinformatics to do the Educational Computer Lab session
while others attended Specialist talks.
The keynote speakers were
John Mattick (Institute for Molecular Bioscience, Brisbane)
Terry Speed (Walter & Eliza Hall Institute, Melbourne)
Simon Easteal (ANU)
Warren Kaplan (Garvan Institute, Sydney)
Mark Ragan (IMB, Brisbane)
Gordon Smyth (WEHI, Melbourne)
Sue Wilson (ANU)
A poster session was held on Tuesday evening, and student scholarships were awarded to 25 successful applicants. The ceremony was followed by a well-attended public lecture by Jenny Marshall-Graves (ANU) on "Unravelling the Kangaroo Genome". NIB prizes were awarded to Ann Kwan (best poster), Penny Bennett and Natalie Thorne (best student talk, shared) and Jim Stankovich and Antonio Reverter (best talk by a researcher within ten years of receipt of PhD, shared).