Dendrogram
This article needs additional citations for verification. (January 2017) |
![]() | It has been suggested that this article be merged into Tree structure. (Discuss) Proposed since May 2025. |

A dendrogram is a diagram representing a tree graph and a similarity metric, based on numerical taxonomy methods. This diagrammatic representation is frequently used in different contexts:
- in hierarchical clustering, it illustrates the arrangement of the clusters produced by the corresponding analyses.[2]
- in computational biology, it shows the clustering of genes or samples, sometimes in the margins of heatmaps.[3]
- in phylogenetics, it displays the evolutionary relationships among various biological taxa. In this case, the dendrogram is also called a phylogenetic tree.[4]
The name dendrogram derives from the two ancient greek words δένδρον (déndron), meaning "tree", and γράμμα (grámma), meaning "drawing, mathematical figure".[5][6] Below a typical one:

The figure is a dendrogram output for hierarchical clustering of marine provinces using presence / absence of sponge species.[7]
History and methods
[edit]The diagrams were popularized and methods refined by Robert R. Sokal and Peter H. A. Sneath in the 1960s, as part of the numerical taxonomy tools. Despite having been born out of a demand from biological taxonomy, it was adopted as a method of data visualization in statistics, with cluster analysis.
Nowadays dendrograms are key tools in hierarchical clustering, and their methods are known as cluster linkage.
Other examples
[edit]Diagram variations.
Phylogenetic
[edit]A dendrogram of the Tree of Life. The figure does not use a metric axis, only the approximate distance on the tree, to show evolutionary order.

This phylogenetic tree is adapted from Woese et al. rRNA analysis.[8] The vertical line at bottom represents the last universal common ancestor (LUCA).
Clustering
[edit]For a clustering example, suppose that five taxa ( to ) have been clustered by UPGMA based on a matrix of genetic distances. The matrix is showed as a heatmap of RNA-Seq, with data showing two dendrograms in the left and top margins.

The hierarchical clustering dendrogram would show a column of five nodes representing the initial data (here individual taxa), and the remaining nodes represent the clusters to which the data belong, with the arrows representing the distance (dissimilarity). The distance between merged clusters is monotone, increasing with the level of the merger: the height of each node in the plot is proportional to the value of the intergroup dissimilarity between its two daughters (the nodes on the right representing individual observations all plotted at zero height).
See also
[edit]- Cladogram
- Distance matrices in phylogeny
- Hierarchical clustering
- MEGA, a freeware for drawing dendrograms
- yEd, a freeware for drawing and automatically arranging dendrograms
- Taxonomy
- Numerical taxonomy
References
[edit]Citations
[edit]- ^ Swofford DL, Olsen GJ, Waddell PJ, Hillis DM (1996). "Phylogenetic inference". In Hillis DM, Moritz C, Mable BK (eds.). Molecular Systematics, 2nd edition. Sunderland, MA: Sinauer. pp. 407–514. ISBN 9780878932825.
- ^ Everitt, Brian (1998). Dictionary of Statistics. Cambridge, UK: Cambridge University Press. p. 96. ISBN 0-521-59346-8.
- ^ Wilkinson, Leland; Friendly, Michael (May 2009). "The History of the Cluster Heat Map". The American Statistician. 63 (2): 179–184. CiteSeerX 10.1.1.165.7924. doi:10.1198/tas.2009.0033. S2CID 122792460.
- ^ "Phylogenetic tree (biology)". Encyclopedia Britannica. Retrieved 2018-10-22.
- ^ Bailly, Anatole (1981-01-01). Abrégé du dictionnaire grec français. Paris: Hachette. ISBN 2010035283. OCLC 461974285.
- ^ Bailly, Anatole. "Greek-french dictionary online". www.tabularium.be. Retrieved October 20, 2018.
- ^ Van Soest R, Boury-Esnault N, Vacelet J, Dohrmann M, Erpenbeck D, De Voogd N, Santodomingo N, Vanhoorne B, Kelly M, Hooper J (2012). "Global Diversity of Sponges (Porifera)". PLOS ONE. 7 (4): e35105. Bibcode:2012PLoSO...735105V. doi:10.1371/journal.pone.0035105. PMC 3338747. PMID 22558119.
- ^ Woese, Carl R.; Kandler, O; Wheelis, M (1990). "Towards a natural system of organisms: proposal for the domains Archaea, Bacteria, and Eucarya" (PDF). Proc Natl Acad Sci USA. 87 (12): 4576–4579. Bibcode:1990PNAS...87.4576W. doi:10.1073/pnas.87.12.4576. PMC 54159. PMID 2112744.
Sources
[edit]- Galili, T. (2015). "dendextend: an R package for visualizing, adjusting and comparing trees of hierarchical clustering". Bioinformatics. 31 (22): 3718–3720. doi:10.1093/bioinformatics/btv428. PMC 4817050. PMID 26209431.
External links
[edit]- Iris dendrogram - Example of using a dendrogram to visualize the 3 clusters from hierarchical clustering using the "complete" method vs the real species category (using R).