What a phylogenetic tree is. How to read phylogenetic trees and determine which species are most related.
- A phylogenetic tree is a diagram that represents evolutionary relationships among organisms. Phylogenetic trees are hypotheses, not definitive facts.
- The pattern of branching in a phylogenetic tree reflects how species or other groups evolved from a series of common ancestors.
- In trees, two species are more related if they have a more recent common ancestor and less related if they have a less recent common ancestor.
- Phylogenetic trees can be drawn in various equivalent styles. Rotating a tree about its branch points doesn't change the information it carries.
Humans as a group are big on organizing things. Not necessarily things like closets or rooms; I personally score low on the organization front for both of those things. Instead, people often like to group and order the things they see in the world around them. Starting with the Greek philosopher Aristotle, this desire to classify has extended to the many and diverse living things of Earth.
Most modern systems of classification are based on evolutionary relationships among organisms – that is, on the organisms’ phylogeny. Classification systems based on phylogeny organize species or other groups in ways that reflect our understanding of how they evolved from their common ancestors.
In this article, we'll take a look at phylogenetic trees, diagrams that represent evolutionary relationships among organisms. We'll see exactly what we can (and can't!) infer from a phylogenetic tree, as well as what it means for organisms to be more or less related in the context of these trees.
Anatomy of a phylogenetic tree
When we draw a phylogenetic tree, we are representing our best hypothesis about how a set of species (or other groups) evolved from a common ancestor
. As we'll explore further in the article on building trees, this hypothesis is based on information we’ve collected about our set of species – things like their physical features and the DNA sequences of their genes.
In a phylogenetic tree, the species or groups of interest are found at the tips of lines referred to as the tree's branches. For example, the phylogenetic tree below represents relationships between five species, A, B, C, D, and E, which are positioned at the ends of the branches:
The pattern in which the branches connect represents our understanding of how the species in the tree evolved from a series of common ancestors. Each branch point (also called an internal node) represents a divergence event, or splitting apart of a single group into two descendant groups.
At each branch point lies the most recent common ancestor of all the groups descended from that branch point. For instance, at the branch point giving rise to species A and B, we would find the most recent common ancestor of those two species. At the branch point right above the root of the tree, we would find the most recent common ancestor of all the species in the tree (A, B, C, D, E).
Each horizontal line in our tree represents a series of ancestors, leading up to the species at its end. For instance, the line leading up to species E represents the species' ancestors since it diverged from the other species in the tree. Similarly, the root represents a series of ancestors leading up to the most recent common ancestor of all the species in the tree.
Which species are more related?
In a phylogenetic tree, the relatedness of two species has a very specific meaning. Two species are more related if they have a more recent common ancestor, and less related if they have a less recent common ancestor.
We can use a pretty straightforward method to find the most recent common ancestor of any pair or group of species. In this method, we start at the branch ends carrying the two species of interest and “walk backwards” in the tree until we find the point where the species’ lines converge.
For instance, suppose that we wanted to say whether A and B or B and C are more closely related. To do so, we would follow the lines of both pairs of species backward in the tree. Since A and B converge at a common ancestor first as we move backwards, and B only converges with C after its junction point with A, we can say that A and B are more related than B and C.
Importantly, there are some species whose relatedness we can't compare using this method. For instance, we can't say whether A and B are more closely related than C and D. That’s because, by default, the horizontal axis of the tree doesn't represent time in a direct way. So, we can only compare the timing of branching events that occur on the same lineage (same direct line from the root of the tree), and not those that occur on different lineages.
Some tips for reading phylogenetic trees
You may see phylogenetic trees drawn in many different formats. Some are blocky, like the tree at left below. Others use diagonal lines, like the tree at right below. You may also see trees of either kind oriented vertically or flipped on their sides, as shown for the blocky tree.
The three trees above represent identical relationships among species A, B, C, D, and E. You may want to take a moment to convince yourself that this is really the case – that is, that no branching patterns or recent-ness of common ancestors are different between the two trees. The identical information in these different-looking trees reminds us that it's the branching pattern (and not the lengths of branches) that's meaningful in a typical tree.
Another critical point about these trees is that if you rotate the structures, using one of the branch points as a pivot, you don’t change the relationships. So just like the two trees above, which show the same relationships even though they are formatted differently, all of the trees below show the same relationships among four species:
If you don’t see right away how that is true (and I didn’t, on first read!), just concentrate on the relationships and the branch points rather than on the ordering of species (W, X, Y, and Z) across the tops of the diagrams. That ordering actually doesn’t give us useful information. Instead, it’s the branch structure of each diagram that tells us what we need to understand the tree.
So far, all the trees we've looked at have had nice, clean branching patterns, with just two lineages (lines of descent) emerging from each branch point. However, you may see trees with a polytomy (poly, many; tomy, cuts), meaning a branch point that has three or more different species coming off of it
. In general, a polytomy shows where we don't have enough information to determine branching order.
If we later get more information about the species in a tree, we may be able to resolve a polytomy using the new information.
Where do these trees come from?
To generate a phylogenetic tree, scientists often compare and analyze many characteristics of the species or other groups involved. These characteristics can include external morphology (shape/appearance), internal anatomy, behaviors, biochemical pathways, DNA and protein sequences, and even the characteristics of fossils.
To build accurate, meaningful trees, biologists will often use many different characteristics (reducing the chances of any one imperfect piece of data leading to a wrong tree). Still, phylogenetic trees are hypotheses, not definitive answers, and they can only be as good as the data available when they're made. Trees are revised and updated over time as new data becomes available and can be added to the analysis. This is particularly true today, as DNA sequencing increases our ability to compare genes between species.
In the next article on building a tree, we’ll see concrete examples of how different types of data are used to organize species into phylogenetic trees.
Want to join the conversation?
- Can a phylogenetic tree show which organism is more evolved, if they evolved at the same nod?(15 votes)
- Look at (or make) a tree showing your family going back at least to your grandparents.
First question: What does this tell you about people in your family?
Phylogenetic trees are really very similar, but for species rather than individuals within a family.
Second question: What do you mean by "more evolved"?
Does this help?(18 votes)
- What phylogenetic trees can and can’t tell us(5 votes)
- One example that comes to mind is that a phylogenetic tree determines where two organisms diverged from their common ancestors but not specifically when. These diagrams are not chronological in a direct way, more so a before and after situation. (Hope I helped, correct me if I am wrong)(14 votes)
- how to represent an extinct specie in a tree diagram(6 votes)
- In the phylogenetic tree containing A,B,C,D,E, what is the closest relative to E?(1 vote)
- E's closest relative is whatever species is at the first node (the first 2 branches that extend from the root, or trunk)(3 votes)
- how did a common whale evolve from a common ancestor?(4 votes)
- I'm not sure what you mean by a "common whale".
The following has information on whale evolution:
If that doesn't help, can you please clarify your question?(6 votes)
- how does phylogenetic classification related to phenetic classification?(2 votes)
- Phenetic speciation means classification of species by appearance alone. For example if two frogs look similar they are called a species even if they cannot mate.
Phylogenetic classification does consider appearance and phenotype, but it also goes much further in terms of looking at functionality, and comparison of genomes. Phylogentics explains an organisms evolutionary history. This is a more rigorous system than phenetics.(6 votes)
- How do you know if the species on Phylogenetic Trees are extinct or not?(3 votes)
- On a phylogenetic tree, some extinct species will have a mark on it that shows how it's extinct. From what I know, that mark is just usually a line marked on the branch.(4 votes)
- How would you draw a phylogenetic tree given simple DNA sequences between species?(1 vote)
- It is a difficult task. What you are asking is phylogenetic reconstruction from genomic sequence analysis. There are some ways to do this. One obvious way is to consider two species closest if they match at more base pairs. But what is difficult is to decide if one arose from the other or if they are at the same level, arising from a different common ancestor.
All of this is very difficult and many algorithms are available, especially since genomic data itself is large, complex (different kinds of genomic data is available - RNA, DNA, Methylated DNA etc) and also based on what we know of the function of the genes (difference in functional gene is a more significant difference than the difference in non-functional genes).
If you are interested, look up maximum-parsimony methods of phylogenetic tree reconstructions.(8 votes)
- What are the characters used to determine the most accurate evolutionary trees?(2 votes)
- The most accurate phylogenetic tree will have the fewest nodes. It's something called parsimony which means that the best tree is the simplest.(5 votes)
- Can someone explain to me the process when a new species will emerge on the tree? Thank you(2 votes)
- Speciation is a huge topic and still being researched — I recommend starting with the following material on KhanAcademy:
You can also browse through the KhanAcademy material on evolution to learn more: