Volume 53 (3): 413-422, 2005 Copyright ©The Histochemical Society, Inc. Molecular Karyotyping : Array CGH Quality Criteria for Constitutional Genetic Diagnosis
Center for Human Genetics, University Hospital Gasthuisberg, Leuven, Belgium (JRV,CM,NM,J-PF,KD); Center for Human Genetics, Flanders Interuniversity Institute for Biotechnology (VIB4), Department of Human Genetics (GF,PM), MicroArray Facility, Flanders Interuniversity Institute for Biotechnology (VIB) (BD,PVH), Leuven, Belgium; ESAT-SISTA K.U. Leuven (SVV,BDM), Leuven, Belgium; and Department of Medical Genetics, Ghent University, Ghent, Belgium (SV,BM,FS) Correspondence to: J.R. Vermeesch, Center for Human Genetics, Herestraat 49, 3000 Leuven, Belgium. E-mail: Joris.Vermeesch{at}uz.kuleuven.ac.be
Array CGH (comparative genomic hybridization) enables the identification of chromosomal copy number changes. The availability of clone sets covering the human genome opens the possibility for the widespread use of array CGH for both research and diagnostic purposes. In this manuscript we report on the parameters that were critical for successful implementation of the technology, assess quality criteria, and discuss the potential benefits and pitfalls of the technology for improved pre- and postnatal constitutional genetic diagnosis. We propose to name the genome-wide array CGH "molecular karyotyping," in analogy with conventional karyotyping that uses staining methods to visualize chromosomes. (J Histochem Cytochem 53:413422, 2005)
Key Words: array CGH molecular karyotyping constitutional cytogenetics prenatal diagnosis postnatal diagnosis
PRE- AND POSTNATAL cytogenetic testing is aimed at the genome-wide analysis for the detection of chromosome aneuploidies and segmental aneusomies. Traditionally, karyotyping has performed this role. In addition to its clinical importance, the identification of a chromosomal aberration in specific patients has proven to be a successful way to identify the implicated genes and to gain insight in the pathogenesis of different genetic conditions.
Over the last decades, improved resolution of cytogenetic techniques has lead to a significant increase in the detection rate of chromosomal aberrations in patients with mental retardation (MR) and/or congenital anomalies. However, resolution of traditional cytogenetic techniques is limited to at best Although fluorescence in situ hybridization (FISH) has dramatically increased the sensitivity of detection of genomic imbalances, this approach requires prior knowledge of the chromosomal region(s) of interest and therefore is not applicable for whole genome screening approaches required in a diagnostic setting.
More recently, so-called array or matrix CGH (comparative genomic hybridization) utilizes mapped DNA sequences in a microarray format as a platform for the detection of chromosomal deletions/duplications. In this technique, genomic DNA from the patient is labeled with one fluorescent dye while a normal reference sample is labeled with a different dye, and these samples are co-hybridized to the array containing the genomic DNA targets. Chromosomal imbalances across the genome can thus be quantified and positionally defined by analyzing the ratio of the fluorescence of the two dyes along the targets. The resolution of array CGH depends on the size of the genomic fragments as well as on their density. Proof of principle was established in 1997 (Solinas-Toldo et al. 1997
Array CGH Arrays were constructed using a 1 Mb Clone Set, which contains 3527 BAC and PAC clones, 3275 with known and fixed genomic position (Fiegler et al. 2003 In a second round of amplification, an aminolinked primer was used. A 100-µl reaction was performed by combining 2 µl of the first reaction product with 1.5 µM primer (5'-NH2GGAAACAGCCCGACTCGAG-3'), 200 µM dNTPs (Amersham Biosciences), PCR buffer, 5 mM MgCl2 and 2.5 U Platinum Taq DNA polymerase (Invitrogen). Thermal cycles were performed as follows: 10 min denaturation at 95C; 35 cycles of 1 min at 95C, 1.5 min at 60C and 7 min at 72C, and a final elongation at 72C for 10 min. Following amplification, the PCR products were purified by Qiaquick 96-well PCR purification kit following the manufacturers instruction (QIAGEN Inc.; Valencia, CA). The purified DOP PCR products were EtOH precipitated.
DOP PCR products were spotted on either CodeLink Bioarray System slides (Amersham Biosciences), type VII star silane-coated mirror slides (Amersham Biosciences) or UltraGAPS amino-silane-coated slides (Corning, Corning, NY). For spotting on amino-silane-coated slides, regular DOP PCR primers (without aminolinker) were used to amplify the BAC DNA (Van Buggenhout et al. 2004 To test printing quality, a pretreated slide was hybridized with 70 pmol of Cy3-labeled oligo primer (5'-Cy3-GGAAACAGCCCGACTCGAG-3') for 1 hr. Following hybridization, the slide was washed with H2O and scanned to check printing quality.
Genomic DNA (gDNA) from an anonymous cell line with karyotype 46,XX or 46,XY was used as a reference. DNA was extracted from the blood of a trisomy-13 carrier following standard procedures. Test and reference gDNA were labeled by a random prime labeling system (Bioprime DNA Labeling System; Invitrogen) using Cy3- and Cy5-labeled dCTPs (Amersham Biosciences) as described (Van Buggenhout et al. 2004
The slide was blocked with 50 µg Cot-1 DNA and 300 µg salmon sperm DNA dissolved in 60 µl hybridization buffer. Blocking solution and probe mixture were denaturated for 10 min in a water bath at 75C. Blocking solution was then placed on the slide covered with a coverslip (24 x 60 mm) and placed in a humid chamber. Meanwhile, the probe was placed at 37C for preannealing. After 1 hr of blocking, the coverslip was removed and probe was placed on the slide. After placing a coverslip (24 x 60 mm), the slide was placed in a humid chamber saturated with 20% formamide and 2x SSC. Hybridization was allowed to take place for two nights at 37C. While optimizing the protocol, it was noticed that the targets at the outer ends of the slide often showed reduced signal intensities when sealing the slide with rubber cement. Eliminating the sealing of the coverslips by hybridizing the slides in small humid chambers yielded equal intensity ratios over the entire (full) slide. Post-hybridization washes were performed as described by Fiegler et al. (2003) Three types of slide were compared: amino-silane-coated slides on which the target DNA is cross-linked by UV light, CodeLink Bioarray System slides on which the DNA is covalently linked via the amino group of the DOP primer, and type VII star silane-coated mirror slides. When hybridized with 250 pmol probe each, the signal-to-noise ratios on amino-silane-coated and CodeLink Bioarray System slides were 610 and 2030 for the Cy3 and Cy5, respectively. The mirror slides were hybridized with 200 pmol probe resulting in signal-to-noise ratios of, respectively, 5.3 and 2.7 for the Cy3 and Cy5. These values are derived from at least 10 hybridizations.
Image and Data Analysis
Parameters Influencing Array CGH Quality The two main technical challenges when performing array CGH are obtaining adequate signal-to-noise ratios and low standard deviations (SD) of the intensity ratios. In contrast to expression arrays, genomic array CGH deals with a vastly more complex probe mixture, and the copy number variations are much lower. In pre- and postnatal diagnostic samples, copy-number ratios vary mainly between 1/2 for deletions and 3/2 for duplications. To obtain optimal signal-to-noise ratios, controlling the fluorescent dye incorporation during labeling, assaying probe quantity, type of slides used, and adequate hybridization skills proved essential. Table 1 provides an overview of these parameters and lists the measures that were taken to optimize them.
Because usually only a small subset of the genome is expected to be aberrant in pre- and postnatal constitutional abnormalities, the intensity ratios of patient vs control DNA will be equal for most spots on the array. This offers a unique possibility to measure the quality of an array experiment by determining the SD of all intensity ratios. The log2 values of these intensity ratios behave in a linear fashion because the relation of the intensities of both dyes inversely influences the value of the intensity ratio at a spot. The normalized log2 of the intensity ratio values will typically fluctuate around 0. The lower this fluctuation, the better the array CGH experiment. To obtain the lowest SD of the intensity ratios, adequate signal-to-noise ratios are essential as well as an optimized Cot-1/probe ratio (Figure 1). During these optimization experiments we noted that not only insufficient Cot-1 (Figure 1C) but also an excess of Cot-1 (Figure 1A) leads to an increase in the SD. The SD of all pooled clones can be further reduced by averaging several independent hybridization experiments (Figure 2).
Quality Criteria and Threshold Values When karyotyping, the resolution of an experiment is defined by the number of bands that can be discerned. When performing a genome-wide array CGH experiment, in theory all clones will provide a value, and the resolution of the experiment is defined by the number of genomic fragments on the array as well as by their size. However, in practice, a number of spots fail due to bad quality, e.g., bad spotting or hybridization efficiency. Therefore, it is essential to report the percentage of spots that were amenable to analysis. In our present study, typically more than 97.5% of the spots provided adequate intensity ratio values, i.e., signal-to-noise over two and the SD among the duplicates less than 10%.
When performing array CGH, the main challenge is to define a threshold value above which no false positives are retained without eliminating true positives (i.e., avoiding false negatives). Since the log2 transformed normalized intensity ratios fluctuate in a Gaussian fashion around 0, the SD can be used to define thresholds. A threshold level is described as mean plus or minus three or four times the SD (Schwaenen et al. 2004 In addition, using four times the SD as a threshold value defines an important array quality value: the value of the SD of the log2 transformed normalized ratios. To be applicable in constitutional diagnosis, the value of four times the SD needs to be below the detection limit of an autosomal deletion or duplication. Because the ratio of a duplication (3/2) is closer to a normal ratio than the ratio of a deletion (1/2), four times the SD needs to be below the detection limit of a duplication. This threshold can be defined as the log2 transformed mean intensity ratio of the duplicated loci minus two times the SD or
4*SD
Polymorphisms Polymorphisms can be identified on arrays as those clones for which the intensity ratios of independent experiments repeatedly fall outside the above-defined cut-off level. Identifying and reporting these polymorphic clones is thus an essential first step toward proper interpretation of array CGH data. When abnormal values are obtained in a number of separate experiments either using DNA from individuals without an obvious clinical phenotypic or in different experiments with DNA from individuals with very different clinical phenotypes, it can be assumed that these clones are polymorphic. Based on the DNA analysis of 30 individuals, aberrant signal intensity ratios were obtained in at least three different experiments in 18 loci (Table 2). However, their status as polymorphic loci awaits further experimental ascertainment.
Polymorphic loci will not only be detected by intensity ratios repeatedly surpassing the four times SD threshold value, but when the polymorphisms are embedded within a single clone on the array, they will be evidenced as smaller deviations from the average intensity ratio. Indirect evidence for such polymorphisms comes from experiments with a low SD. A low SD results in a low threshold level which, in turn, results contradictory in many more DNA fragments that are (false) positive for a deletion or duplication than in a less optimal experiment with a higher SD (data not shown).
Therefore, to eliminate further analysis of these "false positive" values, we suggest using not only the log2(1) ±4*SD as a threshold but, in addition, use the log2 of the mean of the duplicated loci minus two times the SD as a second higher threshold97.72% of duplicated loci and To empirically test these theoretical figures, DNA from a normal cell line was hybridized vs DNA from a cell line trisomic for chromosome 13 (Figure 3A). The observed log2 transformed mean of the intensity ratios of duplicated loci was 0.53 rather than the theoretical 0.58. The SD of the log2 transformed intensity ratios at all normal spots was 0.08 while the SD of the duplicated loci was 0.1, rather than the theoretical SD of 0.08. From a total of 102 chromosome 13-derived loci, 88 were above the higher threshold, 10 between the higher and lower threshold, and 4 loci below the lower threshold. Hence, the intensity ratios of 13% rather than the expected 2.3% of the duplicated loci are below the higher threshold value. That more than expected values are below the threshold values is in part caused by the lower empiric mean value of the duplicated loci. This lower mean value could be caused by incomplete blocking or repetitive sequences and/or by the occurrence of some clones that contain low copy repeats which will render the theoretical value for a deletion lower than 3/2.
Mosaicisms When using DNA from patients carrying mosaic segmental or chromosomal aneuploidies using array CGH, none of the datapoints derived from the aneuploid loci may reach the above-defined threshold levels. However, a series of intensity ratios will deviate from the mean value in a similar direction. Visual inspection of array CGH data can rapidly recognize the presence of non-random spreading of a series of flanking datapoints. DNA from a 100% trisomy-13 cell line DNA and a mixture containing only 20% of the trisomy-13 cell line DNA was hybridized vs DNA from a normal cell line. In Figure 3B, the presence of a trisomy 13 in 20% can be identified by viewing the intensity ratio values of all the chromosome 13 loci, which are all unidirectionally above the mean intensity ratio value. Molecular karyotyping can thus, in analogy with conventional karyotyping, readily be used to identify low-grade mosaicisms of at least 20%. However, smaller-sized mosaic segmental duplications or deletions that span only a few clones on an array will inevitably not be detected.
Because both genome-wide array CGH analysis and conventional karyotyping, based on staining chromosomes, aim to identify chromosomal aberrations by screening the genome, we propose to call this technology molecular karyotyping. Molecular karyotyping is likely to become part of the routine genetic diagnosis and to replace in part current karyotyping technologies. Advantages over conventional karyotyping include a higher resolution, direct mapping of aberrations to the genome sequence, amenability to automatization and quality control procedures and, probably, higher throughput and shorter reporting times. Essential for the successful introduction of the technology in current cytogenetic laboratories are (a) robust protocols and a good understanding of the critical technical factors, (b) quality criteria which define a successful array experiment, and (c) reporting guidelines to enable correct interpretation of the results obtained by different laboratories that includes a good knowledge of polymorphic loci in the human population.
Parameters Influencing Array CGH Quality
Quality Criteria
Current Pitfalls for Pre- and Postnatal Diagnosis: Polymorphic Loci Polymorphic clone information is likely to become integrated in the genome annotations. However, in the absence of a large-scale concerted effort, the question can be raised how such polymorphic sites in the genome will be ascertained. It is likely that continued feedback from a series of dedicated laboratories may lead to a validated database of candidate or proven genomic polymorphisms.
Future Prospects Molecular karyotyping is likely to replace, in part, current karyotyping technologies based on staining chromosomes both in pre- and postnatal diagnosis. In addition to the many advantages, molecular karyotyping also has some caveats as compared with conventional karyotyping, e.g., it fails to identify balanced translocations and ploidy variations. Because arrays rely on the occurrence of genomic copy number differences between patient and control samples, balanced translocations, which by definition do not have genomic losses or gains, cannot be detected. Also, ploidy variations are likely to escape detection by array CGH as the technique relies on normalization of the intensity ratios. Double-dye intensity derived from a triploid DNA sample would subsequently be normalized and thus not be detected. Because of these limitations, it seems likely that we will continue to enjoy the view of banded chromosomes in the foreseeable future, and that banded chromosomes will remain an invaluable tool in the genetic diagnostic laboratory.
This work was made possible by grants G.0200.03 from the FWO (Fund for Scientific ResearchFlanders) and OT/O2/40 from the University of Leuven. We would like to thank the Mapping Core and Map Finishing groups of the Wellcome Trust Sanger Institute for initial clone supply and verification.
Presented in part at the 14th Workshop on Fetal Cells and Fetal DNA: Recent Progress in Molecular Genetic and Cytogenetic Investigations for Early Prenatal and Postnatal Diagnosis, Friedrich-Schiller-University, Jena, Germany, April 1718, 2004. Received for publication June 9, 2004; accepted November 12, 2004
Albertson DG, Ylstra B, Segraves R, Collins C, Dairkee SH, Kowbel D, Kuo WL, et al. (2000) Quantitative mapping of amplicon structure by array CGH identifies CYP24 as a candidate oncogene. Nat Genet 25:144146[CrossRef][Medline] Albertson DG, Pinkel D (2003) Genomic microarrays in human genetic disease and cancer. Hum Mol Genet 12(Spec No 2):R145152 Bruder CE, Hirvela C, Tapia-Paez I, Fransson I, Segraves R, Hamilton G, Zhang XX, et al. (2001) High resolution deletion analysis of constitutional DNA from neurofibromatosis type 2 (NF2) patients using microarray-CGH. Hum Mol Genet 10:271282 Carter NP, Fiegler H, Piper J (2002) Comparative analysis of comparative genomic hybridization microarray technologies: report of a workshop sponsored by the Wellcome Trust. Cytometry 49:4348[CrossRef][Medline] Fiegler H, Carr P, Douglas EJ, Burford DC, Hunt S, Scott CE, Smith J, et al. (2003) DNA microarrays for comparative genomic hybridization based on DOP-PCR amplification of BAC and PAC clones. Genes Chromosomes Cancer 36:361374[CrossRef][Medline] Flint J, Wilkie AO, Buckle VJ, Winter RM, Holland AJ, McDermid HE (1995) The detection of subtelomeric chromosomal rearrangements in idiopathic mental retardation. Nat Genet 9:132140[CrossRef][Medline] Hand JL, Michels VV, Marinello MJ, Ketterling RP, Jalal SM (2000) Inherited interstitial deletion of chromosomes 5p and 16q without apparent phenotypic effect: further confirmation. Prenat Diagn 20:144148[CrossRef][Medline] Hodgson G, Hager JH, Volik S, Hariono S, Wernick M, Moore D, Albertson DG, et al. (2001) Genome scanning with array CGH delineates regional alterations in mouse islet carcinomas. Nat Genet 29:459464[CrossRef][Medline] Ishkanian AS, Malloff CA, Watson SK, DeLeeuw RJ, Chi B, Coe BP, Snijders A, et al. (2004) A tiling resolution DNA microarray with complete coverage of the human genome. Nat Genet 36:299303[CrossRef][Medline] Knight SJ, Regan R, Nicod A, Horsley SW, Kearney L, Homfray T, Winter RM, et al. (1999) Subtle chromosomal rearrangements in children with unexplained mental retardation. Lancet 354:16761681[CrossRef][Medline] Lucito R, Healy J, Alexander J, Reiner A, Esposito D, Chi M, Rodgers L, et al. (2003) Representational oligonucleotide microarray analysis: a high-resolution method to detect genome copy number variation. Genome Res 13:22912305 Mantripragada KK, Tapia-Paez I, Blennow E, Nilsson P, Wedell A, Dumanski JP (2004) DNA copy-number analysis of the 22q11 deletion-syndrome region using array-CGH with genomic and PCR-based targets. Int J Mol Med 13:273279[Medline] Martin CL, Waggoner DJ, Wong A, Uhrig S, Roseberry JA, Hedrick JF, Pack SD, et al. (2002) Molecular rulers" for calibrating phenotypic effects of telomere imbalance. J Med Genet 39:734740 Pinkel D, Segraves R, Sudar D, Clark S, Poole I, Kowbel D, Collins C, et al. (1998) High resolution analysis of DNA copy number variation using comparative genomic hybridization to microarrays. Nat Genet 20:207211[CrossRef][Medline] Pollack JR, Perou CM, Alizadeh AA, Eisen MB, Pergamenschikov A, Williams CF, Jeffrey SS, et al. (1999) Genome-wide analysis of DNA copy-number changes using cDNA microarrays. Nat Genet 23:4146[Medline] Riegel M, Baumer A, Jamar M, Delbecque K, Herens C, Verloes A, Schinzel A (2001) Submicroscopic terminal deletions and duplications in retarded patients with unclassified malformation syndromes. Hum Genet 109:286294[CrossRef][Medline] Rosenberg MJ, Killoran C, Dziadzio L, Chang S, Stone DL, Meck J, Aughton D, et al. (2001) Scanning for telomeric deletions and duplications and uniparental disomy using genetic markers in 120 children with malformations. Hum Genet 109:311318[CrossRef][Medline] Schaeffer AJ, Chung J, Heretis K, Wong A, Ledbetter DH, Lese MC (2004) Comparative genomic hybridization-array analysis enhances the detection of aneuploidies and submicroscopic imbalances in spontaneous miscarriages. Am J Hum Genet 74:11681174[CrossRef][Medline] Schoumans J, Anderlid BM, Blennow E, Teh BT, Nordenskjold M (2004) The performance of CGH array for the detection of cryptic constitutional chromosome imbalances. J Med Genet 41:198202 Sebat J, Lakshmi B, Troge J, Alexander J, Young J, Lundin P, Maner S, et al. (2004) Large-scale copy number polymorphism in the human genome. Science 305:525528 Shaw-Smith C, Redon R, Rickman L, Rio M, Willatt L, Fiegler H, Firth H, et al. (2004) Microarray based comparative genomic hybridisation (array-CGH) detects submicroscopic chromosomal deletions and duplications in patients with learning disability/mental retardation and dysmorphic features. J Med Genet 41:241248 Slavotinek A, Rosenberg M, Knight S, Gaunt L, Fergusson W, Killoran C, Clayton-Smith J, et al. (1999) Screening for submicroscopic chromosome rearrangements in children with idiopathic mental retardation using microsatellite markers for the chromosome telomeres. J Med Genet 36:405411 Snijders AM, Nowak N, Segraves R, Blackwood S, Brown N, Conroy J, Hamilton G, et al. (2001) Assembly of microarrays for genome-wide measurement of DNA copy number. Nat Genet 29:263264[CrossRef][Medline] Solinas-Toldo S, Lampel S, Stilgenbauer S, Nickolenko J, Benner A, Dohner H, Cremer T, et al. (1997) Matrix-based comparative genomic hybridization: biochips to screen for genomic imbalances. Genes Chromosomes Cancer 20:399407[CrossRef][Medline] Starke H, Nietzel A, Weise A, Heller A, Mrasek K, Belitz B, Kelbova C, et al. (2003) Small supernumerary marker chromosomes (SMCs): genotype-phenotype correlation and classification. Hum Genet 114:5167[CrossRef][Medline] Schwaenen C, Nessling M, Wessendorf S, Salvi T, Wrobel G, Radlwimmer B, Kestler HA, et al. (2004) Automated array-based genomic profiling in chronic lymphocytic leukemia: development of a clinical tool and discovery of recurrent genomic alterations. Proc Natl Acad Sci USA 101:10391044 Van Buggenhout G, Melotte C, Dutta B, Froyen G, Van Hummelen P, Marynen P, Matthijs G, et al. (2004) Mild Wolf-Hirschhorn syndrome: microarray CGH analysis of atypical 4p16.3 deletions enables refinement of the genotype-phenotype map. J Med Genet 41:691698 Vissers LE, de Vries BB, Osoegawa K, Janssen IM, Feuth T, Choy CO, Straatman H, et al. (2003) Array-based comparative genomic hybridization for the genomewide detection of submicroscopic chromosomal abnormalities. Am J Hum Genet 73:12611270[CrossRef][Medline]
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||