Emerging Infectious Diseases(3881 bytes)
Current Issue (847 bytes)Current Issue | Upcoming Issue | Past Issue | Search | Home (1779 bytes)
Volume 4 No. 1 / January - March 1998 (1625 bytes)


Download Article - | ASCII Text | PDF | Postscript (1016 bytes)

Perspectives

Accommodating Error Analysis in Comparison and Clustering of Molecular Fingerprints

Hugh Salamon,* Mark R. Segal,* Alfredo Ponce de Leon,† and Peter M. Small‡
*University of California, San Francisco, California, USA; †Instituto Nacional Nutrición, Zubriran, Mexico City, Mexico; and ‡Stanford University Medical Center, Stanford, California, USA


sala1t.gif (2018 bytes)
Back to article
Figure 1. The align-and-count method finds the maximum number of mutually closest bands within a threshold deviation value Delta.gif (855 bytes), for a search across a range S of scaling values. The two lanes are scaled incrementally, thus searching for the best alignment.

sala2t.gif (4253 bytes)
Back to article
Figure 2. Means and two standard errors of the mean error bars for pairwise comparisons among 116 12-banded H37Rv lanes show that error is consistently larger when comparing lanes between gels than when comparing lanes from the same gel. The x-axis corresponds to w(b), and the y-axis to d(b), as presented in the text. It is evident that error is proportional to fragment length in the range of fragment lengths found in H37Rv. The data exhibit 2% to 3% error for between gel comparisons, but only approximately 1% error on average for within gel comparisons.

sala3.gif (4408 bytes)
Back to article
Figure 3. Additional alignment of very similar patterns can identify clearly distinct patterns. Measurement noise obscures the detailed relationships between 26 patterns that were identified from 1,335 as being very similar. However, after alignment to a consensus pattern, a clearly distinct pattern (an outlier from the other members of this autocluster) can be readily identified. Fragment lengths are given in kilobasepairs (kb).

sala4a.gif (22101 bytes)
Back to article
Figure 4. Histograms of the fragment lengths for 84 two-banded patterns connected by identity (autoclustered with in-house software) exhibit enough spread in values to make detecting outliers and band shifts difficult (a,b). Aligning the 84 lanes to the mean-value lane for this collection reveals that the lanes do not align well, but instead shows bimodal distributions for the fragment lengths (c,d). Dividing the 84 fingerprints into two sets and separating the distinct distributions detected when aligning all 84 fingerprints show that 26 fingerprints align well to their mean-value lane (e,f), and the remaining 58 also align well to their respective mean value lane (g,h). The smaller fragment length fragment does not appear shifted between the two sets of 2-banders (comparing e to g), but the larger fragment is clearly shifted (comparing f to h).

sala5t.gif (1621 bytes)
Back to article
Figure 5. Prior to alignment of two sets of 2-banders, lanes are difficult to cluster (lanes a-d are from the distributions in Figures 4a and 4b). Subsequent to alignment, lanes are much easier to cluster (lanes a' and b' are specific examples from the distributions in Figures 4e and 4f; lanes c' and d' likewise correspond to Figures 4g and 4h). Fragment lengths are given in kilobasepairs (kb).

bottmbar.gif (72 bytes)
Top of Page  | Current Issue  |  Upcoming Issue  |  Past Issue  |  Search   |  Home

NCID Home
CDC Home

Emerging Infectious Diseases
National Center for Infectious Diseases
Centers for Disease Control and Prevention
Atlanta, GA

URL: http://www.cdc.gov/ncidod/EID/vol4no2/salamong.htm