Microarray Data Analysis Assignment
(25% of CA578/579 As Appropriate)
NEW! Due Monday 21st April
2008 at 5pm
Please submit to the School
Secretary in the School of Computing
Instructions:
- Take the raw data
- DataSet_1
- DataSet_2
- DataSet_3
- DataSet_4
- DataSet_5
- Implement the Hierarchical Clustering algorithm with UPGMA
(Average
distance
between clusters) and various distance metrics (Pearson, Manhattan).
- Results required: plot of data in experiment space showing the
clustering
according to the various distance metrics and a text file with gene
clustering
using the notation: (A, ((B,C),D)) meaning B & C are closest and
then
D etc
- Write a concise (2 page max) report on the project summarising
your
results.
- Individual effort is required. Please write and sign a
declaration
on your submission of the form: "Except where otherwise stated, the
following is all my own work. I have read and am aware of the
University's
rules concerning plagiarism." These rules are shown here
- Due 5pm on Monday 21st April. Penalty of 5% per day
late.
- Here are some hints
(and the weblink
from
the tutorial)
- Notes from the
tutorial