Mutations

A mutation is a change in genetic information. Since genetic information is encoded by the order of the nucleotide bases of DNA, adenine (A), thymine (T), guanine (G), and cytosine (C), a mutation represents some sort of change in that order. Mutations may occur in both somatic and sex cells. Only mutations that occur in sex cells can be passed from parent to offspring.

Point Mutations or SNPs

A point mutation or SNP (Single Nucleotide Polymorphism) involves the substitution of a wrong base during the replication process. As the enzyme DNA polymerase chugs down one side of a DNA molecule, forming base pairs to build a new complementary strand, it occasionally adds the wrong base. Below is a small portion of the genetic code for the protein hemoglobin. The top row is the normal sequence for making normal hemoglobin. The bottom row shows a single nucleotide polymorphism—a change from A to T—that results in a mutated form of hemoglobin that results in the sickle cell anemia disorder.

GTGCATCTGACTCCTGAGGAGAAGTCT
GTGCATCTGACTCCTGTGGAGAAGTCT

Fortunately, DNA polymerase makes very few errors and it corrects most of those quickly. In addition, there are other enzymes that follow along and "proofread" the nucleotides to be certain that the new nucleotides are actually complementary to the template strand. Any misfits are booted out and replaced with the proper base. Thanks to this magnificent system, DNA is consistently replicated with less than one mistake per billion nucleotides.

Nonetheless, this type of mutation does occur. It is rare but it is responsible for many subtle and not so subtle variations found within and among species.

There are three possible outcomes of a mistaken change of one nucleotide during DNA replication during the formation of a sex cell. Surprisingly, the "mistake" may have no effect on the organism. For one thing, the genetic code is such that many amino acids are represented by more than one codon (see also Gene Expression).

Each individual codon is a three nucleotide sequence. In many instances, there are several sequences that code for the same amino acid. Their sequence frequently differs by only one nucleotide.

23andme (18 April 2010) Genetics 101: What are SNPs? [Video file] retrieved from https://youtu.be/tJjXpiWKMyA

For example, the triplet codons for the amino acid isoleucine are AUU, AUC, AUA, and AUG. Substitution of the the last nucleotide in the DNA sequence coding for this amino acid would result in no change in the resulting protein because isoleucine would be inserted in the protein chain in each case. If the substitution affects a less critical region of the protein or occurs in a noncoding region of DNA, there may also be no discernible effect.

On the other hand, an error which changes the first base of the codon to either a U, C, or G would cause the wrong amino acid to be inserted in place of isoleucine. For example, a substitution of a G for the first A in the codon would result in insertion of the amino acid valine instead of isoleucine.

The insertion of the wrong amino acid in a functional region of a protein may cause the protein to be so severely misshapen that it cannot function--even to the point of causing the death of the organism. The function of normal human red blood cells, which are disk-shaped, is to transport oxygen from the lungs to the other organs of the body. Each red blood cell contains millions of molecules of hemoglobin that carries oxygen. A slight change in the order of the amino acids in the hemoglobin molecule (valine substituted for glutamine), which has only 146 amino acids, causes sickle-cell disease. Abnormal hemoglobin molecules stick together and crystallize deforming the red blood cells. The deformed blood cells then clog tiny blood vessels impeding the flow of blood. Sickle-cell anemia kills about 100,000 people per year in the US.

And finally, there are the rare substitutions that are actually beneficial causing the protein to function in such a way as to give the organism a survival advantage.

SNPs are the type of mutations reported in autosomal, and some kinds of Y chromosome DNA testing for genealogy.

SNPs are so infrequent that it is reasonable to assume they have occurred at any particular position in the genome only once in the course of human evolution

Insertions or Deletions (Indels)

Another type of mutation involves either the insertion or deletion of one or more (some number that is not a multiple of three) nucleotides into a DNA sequence. This type of mutation is known as a frameshift mutation. For an illustration of how devastating this type of mutation can be if it occurs in the coding region of a gene, delete the w from the sentence below.

The cow jumped over the moon.
becomes
The coj umpedo vert hem oon.

The insertion of nucleotides in multiples of three, if not corrected during the "proofreading" of messenger RNA, will cause the insertion of an extra amino acid for each three additional nucleotides. Trinucleotide repeats are sequences of three nucleotides that repeat in tandem and vary in the the number of repeats. Trinucleotide repeat mutations are known to cause at least eight genetic disorders affecting the nervous or neuromuscular system.

You can read more about repeating nucleotide patterns in the section about Y chromosome DNA testing for genealogy.

YAP, an alu insertion

One insertion particularly useful in population studies is the YAP, which stands for "Y chromosome alu polymorphism." Alu is a sequence of approximately 300 base pairs which has inserted itself into a particular region of the DNA. There have been some half a million alu insertions in human DNA; YAP is one of the more recent.

Unstable indels and SNPs are relatively rare.

Short Tandem Repeats (STRs)

Only about five percent of human DNA is actually thought to code for traits. Most of the rest is made of long stretches of nucleotide base pairs whose function is not known. Within these stretches are short, moderately repetitive base pair sequences. The number of repeats is inherited and is easily detectable making them ideal identifying markers. The number of repeating units can occasionally change during evolution and descent. Though these changes are rare, they happen frequently enough to make them useful markers for familial relationships.

There are two types of these repetitive sequences. VNTRs (variable number tandem repeats) are repeated sequences that typically range from 10 to 80 bps. These occur fairly frequently in the human genome but there are relatively few different types among the human population.

Short Tandem Repeat (STR) sequences (sometimes called microsatellites) are much shorter (2-10 bps) and may be repeated as many as 100 times at a given location on a chromosome. The human genome contains hundreds of thousands of these STRs evenly distributed on all the chromosomes. STRs represent ideal markers for genetic typing for genealogy because of their rich diversity (polymorphism) and wide distribution. As a further advantage, they are technically somewhat easier to characterize than VNTRs.

Here is a simplified example. Humans have two sets of 23 chromosomes--one set from their mother and one set from their father. So, for example, an individual, Thelma, might inherit a chromosome #17 marker with a short sequence of four bps repeated eight times from her mother, Ethel, and the same sequence repeated three times from her father, Art.

To illustrate :
Maternal chromosome #17 GATCGATCGATCGATCGATCGATCGATCGATC
Paternal chromosome #17 GATCGATCGATC

In real life, more than one STR must be analyzed to establish a person's identity. An STR marker from DNA found at the scene of a crime may match one marker of a suspect. However there will most likely be thousands of unrelated people with the same pattern for that one marker. Increasing the number of markers examined increases the chances of an accurate identification. Matches on three selected STRs gives more than a 2000 to 1 probability that the DNA samples are from the same person. Using nine STRs gives more than a 1 billion to 1 probability.

In 1997, the FBI announced the selection of 13 STR markers to be used in forensic investigations. If any two samples of DNA obtained from different sources (say a crime scene and a suspect) have matching numbers of repeats at all 13 markers, it is virtually certain they are from the same person. Conversely, and as important, if the markers do not match, it can be said with complete confidence that the samples are from two different individuals.

The markers in this set are the CODIS markers (an acronym for Combined DNA Index System)

The Different Types of Mutations

Khan Academy khanacademymedicine (2 Apr 2014) The Different Types of Mutations | Biomolecules | MCAT | Khan Academy [Video file] retrieved from https://youtu.be/xYOK-yzUWSI

Mutation

Point Mutations or SNPs

Insertions or Deletions (Indels)

YAP, an alu insertion

Short Tandem Repeats (STRs)

Genotypes and Phenotypes

What are SNPs?

STRs - Short Tandem Repeat (Better Explaned)

The Different Types of Mutations

Where Can I Go From Here?

Where Can I Go From Here?