Kinnu

Gene Theory

DNA

As we continue exploring the fundamental concepts of biology, our focus now shifts to the mechanisms of heredity and how traits are passed down, building on our earlier exploration of cell structure and the role of DNA and RNA in encoding life’s information.

In this tile on Gene Theory, we'll first examine DNA: its structure, replication, and how mutations lead to genetic variation.

Deoxyribonucleic acid (DNA) is the molecule that carries the genetic instructions used in the growth, development, functioning, and reproduction of all known living organisms and many viruses.

DNA strands (CC0) <http://creativecommons.org/publicdomain/zero/1.0/deed.en>, via Wikimedia Commons

To put it simply, DNA is like a set of instructions or a blueprint that tells an organism how to grow, develop, and function.

DNA’s role in biology cannot be overstated; it is the chemical basis of heredity, guiding the biological processes that maintain life from one generation to the next.

The discovery of DNA as the hereditary material dates back to the mid-20th century, but its significance wasn’t fully understood right away.

Initially, many scientists believed that proteins, not DNA, were responsible for heredity because proteins were more complex. However, as research progressed, it became clear that DNA played a crucial role in passing genetic information from one generation to the next.

The first step in understanding this came from the use of X-ray crystallography. In the early 1950s, Rosalind Franklin, working at King’s College London, captured critical images of DNA using this technique. Her most famous image, known as "Photo 51," was taken in 1952 and provided crucial evidence that DNA had a helical structure.

Rosalind Franklin by MRC Laboratory of Molecular Biology (CC BY-SA 4.0) <https://creativecommons.org/licenses/by-sa/4.0>, via Wikimedia Commons

The story took a controversial turn when Maurice Wilkins, a colleague of Franklin’s at King’s College, shared her X-ray data with James Watson and Francis Crick without her knowledge or permission. Watson and Crick, who were working at the University of Cambridge, used Franklin’s data—combined with their own research—to build the first accurate model of the DNA double helix in 1953.

While Watson, Crick, and Wilkins were awarded the Nobel Prize in Physiology or Medicine in 1962 for their discovery, Franklin’s contributions were largely overlooked. By the time the prize was awarded, Franklin had tragically died of ovarian cancer in 1958, at the age of 37.

The double helix refers to the twisted ladder-like shape of the DNA molecule, where two long strands wind around each other.

DNA is composed of two long strands forming a double helix, with each strand consisting of a sugar-phosphate backbone and nitrogenous bases (adenine, thymine, cytosine, and guanine) paired together through hydrogen bonds.

DNA-structure-and-bases (Public domain), via Wikimedia Commons

Let’s break that down:

The sugar-phosphate backbone is like the sides of the ladder, made up of alternating sugar (deoxyribose) and phosphate groups.

The nitrogenous bases (adenine, thymine, cytosine, and guanine) are like the rungs of the ladder. These bases pair up in a very specific way: adenine (A) pairs with thymine (T), and cytosine (C) pairs with guanine (G).

The specificity of these base pairings—adenine with thymine (A-T) and cytosine with guanine (C-G)— is crucial because it ensures that the genetic code is accurately copied during cell division.

The discovery of the double helix model explained not only the structure of DNA and how it is stored but also how it could replicate itself, ensuring the faithful transmission of genetic information during cell division.

When any cell prepares to divide (as it must for an organism to grow and develop), the DNA must make a copy.

During replication, the DNA strands separate, and each strand serves as a template for the formation of a new complementary strand.

Nucleic acids - Transcription by Laboratoires Servier (CC BY-SA 3.0) <https://creativecommons.org/licenses/by-sa/3.0>, via Wikimedia Commons

This semi-conservative replication ensures that each daughter cell receives an identical copy of the DNA, preserving the genetic information across generations.

"Semi-conservative" means that each new DNA molecule has one old strand and one new strand, which helps maintain accuracy.

Errors during the replication process, though rare, can lead to mutations—changes in the DNA sequence that may result in variations in the organism.

Mutations can be thought of as "mistakes" in the DNA code. Some mutations are neutral, some are harmful, and occasionally, a mutation may confer an advantage, contributing to evolution over time — a process we’ll come back to in the final tile.

Genes

Building upon our understanding of DNA, we now turn our attention to genes, the specific sequences of DNA that carry the instructions for building and maintaining an organism.

A gene is a segment of DNA that contains the necessary information to produce a functional product, typically a protein.

Proteins, as we know, are important molecules that perform a wide range of functions in the body, from building tissues to regulating processes in cells.

The most common range for the length of a human gene is typically between 3,000 to 100,000 base pairs, though genes can vary greatly in size.

Some genes are as short as a few hundred base pairs, while others can span millions of base pairs.

The structure of a gene is complex, encompassing more than just a straightforward sequence of nucleotides.

A gene is partially made up of coding regions called exons, which contain the instructions for making proteins.

These exons are transcribed into RNA and ultimately translated into proteins, playing a direct role in determining the traits of an organism.

Haploid human genome sequence by NHS National Genetics and Genomics Education Centre (CC BY 2.0) <https://creativecommons.org/licenses/by/2.0>, via Wikimedia Commons

It's also important to note that despite the vast size of the human genome, only about 2% of our DNA actually codes for proteins. The remaining 98% includes non-coding regions, and some elements of these regions are still not fully understood.

Non-coding regions of genes are known as ‘introns’, and they appear interspersed between exons.

Although introns are transcribed into RNA, they are removed before the RNA is translated into a protein, and the exons are spliced together.

This splicing actually serves an important purpose, because this splicing of exons doesn’t always have to occur in the same way.

Different combinations of exons can be spliced together in different ways, leading to the production of different proteins from the same gene.

This ability to create multiple proteins from a single gene significantly increases the variety of proteins your body can produce, which is important for the complexity and adaptability of living organisms​

Non-coding regions of genes are not just "junk DNA": they include regulatory sequences that control the expression of these coding regions.

By ‘gene expression’, we mean the process by which the information in a gene is used to create a functional product, like a protein. We’ll come back to this in the following orb.

These regulatory sequences act like switches, determining when, where, and how much of a particular protein is produced.

Promoters, enhancers, and silencers are specific DNA regulatory sequences that act as binding sites for proteins known as transcription factors. These transcription factors either promote or inhibit the transcription of the gene into RNA, depending on the needs of the cell.

Promoters are like starting points, signaling where transcription should begin.

Enhancers are sequences that boost the activity of promoters, increasing gene expression.

Silencers do the opposite—they decrease gene expression.

This regulation ensures that genes are expressed at the right time and in the right cells, contributing to the proper development and function of an organism.

Genes are not isolated entities; they interact with each other and with various molecular pathways to influence an organism’s phenotype, the set of observable traits.

Phenotype includes everything from eye color to blood type—traits that you can see or measure.

This interaction between genes and the environment also plays a crucial role in how traits are expressed.

Mutations in genes, whether spontaneous or induced by external factors, can lead to variations in these traits.

Darwin Hybrid Tulip Mutation 2014-05-01 by LepoRello (CC BY-SA 3.0) <https://creativecommons.org/licenses/by-sa/3.0>, via Wikimedia Commons

Some mutations may have no effect, others can lead to genetic disorders, and some might even confer advantages that contribute to evolutionary change. Genetic disorders are diseases or conditions caused by alterations in the DNA.

Chromosomes

Having explored the structure and function of genes, the next step in understanding how genetic information is organized and transmitted is to look at chromosomes.

Genes, which carry the instructions for building and maintaining an organism, are not scattered randomly within the cell but are instead meticulously organized into structures called chromosomes. This organization ensures that the vast amount of genetic information in our cells is both compactly stored and efficiently accessible.

Pairs of human chromosomes. (Public domain), via Wikimedia Commons

Chromosomes are thread-like structures located in the nucleus of each cell, composed of tightly wound DNA.

If you think of genes as individual recipes in a vast cookbook, then chromosomes are like the chapters that organize these recipes into manageable sections.

In humans, the DNA contained within chromosomes is organized into 23 pairs, totaling 46 chromosomes. Each chromosome within a pair is inherited from one parent, so you receive 23 chromosomes from your mother and 23 from your father.

This pairing ensures that offspring have a combination of genetic material from both parents, which is the foundation of biological diversity.

Each chromosome contains a long, continuous molecule of DNA, which is tightly coiled and condensed with the help of histones.

This compact structure allows the long DNA molecules to fit within the confined space of the cell nucleus. Think of it as packing a long, delicate piece of thread into a small spool to keep it organized and protected.

The DNA is wrapped around histones to form nucleosomes, which further coil and fold to create the compact structure of a chromosome.

Chromatin and histones (Public domain), via Wikimedia Commons

The genes on each chromosome are arranged in a specific order, and their position on a chromosome is referred to as a locus.

This order is consistent across individuals of the same species, which is why scientists can map genes to specific locations on specific chromosomes.

For example, if a gene responsible for a particular trait is located near the tip of chromosome 4, it will be found in the same place in every human.

This consistency allows scientists to precisely identify and study the genetic basis of various traits.

Disease Gene Mapping with Multiple Chromosomes. Image by Esherma1 (CC BY-SA 3.0) <https://creativecommons.org/licenses/by-sa/3.0>, via Wikimedia Commons

Chromosomes also contain regions that do not code for proteins but are essential for maintaining chromosome integrity and regulating gene expression.

Telomeres by AJC1 (CC BY-SA 4.0) <https://creativecommons.org/licenses/by-sa/4.0>, via Wikimedia Commons

Telomeres, for instance (which you can see above), are repetitive sequences at the ends of chromosomes that protect them from degradation, much like the plastic tips on shoelaces prevent them from fraying.

Centromeres, located near the center of chromosomes, are essential for the proper segregation of chromosomes during cell division, ensuring that each daughter cell receives the correct number of chromosomes.

During cell division, chromosomes play a crucial role in ensuring that genetic information is accurately passed on to daughter cells.

When a cell divides, its chromosomes are duplicated, and each new cell receives an identical set of chromosomes.

During mitosis, this ensures that every cell in your body has the same genetic information.

However, as you may remember from an earlier orb, in reproductive cells, a different type of cell division called meiosis occurs, reducing the chromosome number by half so that when fertilization happens, the resulting zygote has the correct number of chromosomes—restoring the full set of 46 in humans.

Gene Expression

Gene expression is the process by which the information encoded in DNA is used to produce the observable traits of an organism.

This concept is central to understanding how the instructions in our genes are translated into the proteins that perform almost every function in the body.

To grasp gene expression, we need to start with the central dogma of molecular biology, which outlines the flow of genetic information from DNA to RNA to protein.

The process begins with transcription, where the DNA sequence of a gene is copied into messenger RNA (mRNA).

TranscriptionGraphic PublicDomain (CC0) <http://creativecommons.org/publicdomain/zero/1.0/deed.en>, via Wikimedia Commons

Imagine DNA as a large cookbook, and each gene as a specific recipe within it.

Transcription is like copying one of these recipes onto a separate sheet of paper—this sheet is the mRNA, which will carry the instructions out of the DNA “cookbook” and into the cell’s kitchen.

Transcription is initiated when an enzyme called RNA polymerase binds to a specific region of the DNA, known as the promoter, which signals the start of the gene.

RNA polymerase unwinds the DNA helix and creates a complementary strand of mRNA by matching RNA nucleotides to the DNA template.

Image: Genomics Education Programme, CC BY 2.0 <https://creativecommons.org/licenses/by/2.0>, via Wikimedia Commons

Unlike DNA, where adenine (A) pairs with thymine (T), in RNA, adenine pairs with uracil (U).

This newly formed mRNA strand is like a portable recipe, ready to be used outside the nucleus of the cell.

After transcription, once the mRNA is synthesized, it undergoes processing. This involves removing introns—non-coding sections of RNA—and splicing together exons, the coding regions that will be used to build the protein.

DNA exons and introns splicing process. Image (Public domain), via Wikimedia Commons

The processed mRNA then exits the nucleus and enters the cytoplasm, where it meets the cell's ribosomes.

Think of ribosomes as chefs in the cell’s kitchen, responsible for reading the mRNA "recipe" and assembling the protein.

In the translation stage, the ribosome reads the mRNA sequence and translates it into a chain of amino acids, the building blocks of proteins.

During translation, transfer RNA (tRNA) molecules bring the appropriate amino acids to the ribosome, where they are linked together in a particular order.

The Transfer RNA (tRNA) molecules know the correct order in which to bring amino acids to the ribosome because they follow the sequence of codons in the mRNA.

Transcription and translation. Image: NHS National Genetics and Genomics Education Centre, CC BY 2.0 <https://creativecommons.org/licenses/by/2.0>, via Wikimedia Commons

Codons are groups of three nucleotides in the mRNA that specify which amino acid should be added next.

The growing chain of amino acids eventually folds into a functional protein, ready to carry out its role in the cell.

Gene expression is one-way directed process, though not a simple one; it is carefully regulated at multiple stages to ensure that proteins are produced at the right time, in the right place, and in the right amounts.

Additionally, other mechanisms like regulatory RNAs and epigenetic modifications can fine-tune the expression of genes in response to the cell’s needs or environmental changes.

Epigenetic modifications involve changes that affect gene expression without altering the DNA sequence itself. A common example is the addition of a chemical group called a methyl group to DNA, which can turn a gene off or reduce its activity.

Epigenetic mechanisms (Public domain), via Wikimedia Commons

For example, in response to chronic stress, methyl groups may be added to genes involved in stress regulation, lowering their activity. This could be the body's way of adapting to prolonged stress by lessening the impact of stress hormones.

This tightly regulated process of gene expression is also the foundation of many modern biotechnologies, such as genetic engineering and gene therapy, which aim to manipulate gene expression to treat genetic disorders by restoring the proper levels of key proteins.

Heredity

With a solid understanding of how genes are organized into chromosomes and how they function within cells, we can now explore how these genes are passed from one generation to the next—a process known as heredity.

Heredity is the mechanism by which genetic information is transmitted from parents to offspring, ensuring the continuity of life and the preservation of traits within a species.

The foundational principles of heredity were first uncovered by Gregor Mendel, a 19th-century scientist often referred to as the "father of modern genetics."

Gregor Mendel (Public domain), via Wikimedia Commons

Mendel discovered these patterns through meticulous experiments with pea plants, Mendel discovered that traits are inherited in specific, predictable patterns, governed by what we now understand to be genes.

Doperwt rijserwt peulen Pisum sativum by Rasbak (CC BY-SA 3.0) <http://creativecommons.org/licenses/by-sa/3.0/>, via Wikimedia Commons

These laws of heredity laid the groundwork for modern genetics by establishing the idea that genes are the fundamental units of inheritance, passed from parents to offspring in a predictable manner.

The discovery of chromosomes as the carriers of genes later confirmed and expanded on Mendel’s principles, integrating them into the broader framework of molecular biology.

One of Mendel’s key discoveries was the concept that genes exist in pairs, with each parent contributing one 'version' of that gene to their offspring. These pairs of genes are known as alleles, and they can be either dominant or recessive.

This principle is known as Mendel’s Third Law of Dominance.

A dominant allele is like the louder voice in a pair—it tends to express itself in the organism’s traits, even if the other allele (from the other parent) is different.

For instance, if the allele for purple flower color is dominant, a pea plant with one purple flower allele and one white flower allele will still have purple flowers because the dominant purple allele "masks" the presence of the recessive white allele.

Pisum sativum biflorum1 (Public domain), via Wikimedia Commons

A recessive allele is quieter and will only express itself in the organism’s traits if both alleles for that trait are recessive.

In our example, the white flower color would only appear if the pea plant inherited the white allele from both parents. But if it appeared alongside a dominant purple the white allele would remain hidden, but could still be passed on to the next generation.

So, if a pea plant carries one allele for purple flowers (dominant) and one for white flowers (recessive), the dominant purple allele will determine the flower color, while the white allele remains hidden but can still be passed on to the next generation.

Mendel’s first law, the Law of Segregation, explains how these alleles are separated during the formation of gametes—sperm and egg cells—so that each gamete carries only one allele for each trait.

When fertilization occurs, the offspring inherits one allele from each parent, restoring the pair.

Sperm-egg (Public domain), via Wikimedia Commons

Another key principle Mendel discovered is the Law of Independent Assortment, which explains how different traits are inherited independently of one another.

This law states that the alleles for different traits segregate independently during the formation of gametes.

As a result, the inheritance of one trait (such as flower color) does not influence the inheritance of another (such as seed shape).

This principle applies primarily to genes located on different chromosomes or far apart on the same chromosome.

Mendel’s experiments showed that when crossing plants with two different traits, the offspring exhibited combinations of traits in ratios that could be mathematically predicted, demonstrating the independent nature of allele assortment.

As well as introducing the concept of dominant and recessive traits, Mendel’s work laid the foundation for the concepts of genotype and phenotype.

The genotype refers to the genetic makeup of an organism—the specific combination of alleles it possesses.

The phenotype is the observable expression of these traits—what we can actually see, such as eye color or height.

Coquina variation in phenotype. Image by Debivort (CC BY-SA 3.0) <http://creativecommons.org/licenses/by-sa/3.0/>, via Wikimedia Commons

While the genotype determines the potential for a trait, the phenotype can be influenced by interactions between different alleles, as well as environmental factors.

For example, even if someone has the genetic potential for tall stature (genotype), factors like nutrition during growth years can influence the actual height (phenotype).

The principles of Mendel’s laws apply universally to all sexually reproducing organisms, from plants to animals to humans.

They form the basis for understanding not only simple inheritance patterns but also more complex phenomena such as genetic disorders, polygenic traits (traits controlled by multiple genes), and the role of genetic variation in evolution.

Heredity is the engine of evolution—and this will be the topic of our final orb.