Friday, December 15, 2017
1.4. How microarray platforms are annotated
1. General

All the microarray platforms in the database go through the following annotation procedure. The probe sequences are mapped to physical coordinates of the human genome with a MegaBlast analysis.

Because cDNA clones do not contain introns, the analysis is likely to yield multiple hits flanked by the intronic regions. Also, cDNA or BAC clones can be too long to be entirely sequenced, but in the optimal case both of their ends have been sequenced, which also leads to multiple hits for a given probe.

For this reason, all of the MegaBlast hits are joined together, if they meet the following two criteria:
- they are from the same chromosome
- they are within 2.5 megabases from each other

If the hits cannot be joined, the probe is excluded from further analysis.

