I studied the C programming language, and one of the things that was stressed was the danger of null pointers. A pointer is a type of variable that points directly to a location in memory, so it is a fast and efficient way to access data. However, when you create one, if you forget to set it to an initial value, such as zero, it will have a random hexadecimal number representing whatever combination of ones and zeros happened to be at that memory location when the computer was turned on. If that random number points to a key area of the computer’s operating system or video memory, and you start writing to it, VERY BAD THINGS can happen.
Now think about DNA. It has over 3 billion base pairs, made up of only four different ‘letters,’ C, T, A, and G, which create amino acids which then create proteins. Now imagine that an error has crept into either the sperm or egg cell which have combined to create an embryo. Every one of the daughter cells resulting from cell division will have the same change. Errors involving a single base pair could cause serious consequences by changing which amino acids are created, thereby altering the start or end sequence of a protein string. If the defect causes the RNA to add base pairs from before the actual start of the string, or after its end, or if it causes the string to terminate early, the resulting protein would be non-functional. This means that a single mutation in one of the 3 billion base pairs could result in that baby not being able to make a specific protein, and VERY BAD THINGS can happen.
Scientists learn how it all works by studying what happens when things go wrong, and this book describes a number of horrifying genetic conditions, from Fragile X syndrome, to Lou Gehrig’s Disease, Burkitt’s lymphoma, Ohio Amish Dwarfism, and many, many more. It’s enough to make you lie in bed at night staring up at the ceiling wondering how any of us turn out more or less normal.
Want something else to worry about? Consider telomeres. We get one set of 23 chromosomes from our mother and one from our father, 46 total, each with a beginning and an end, for 92 in all. Telomeres are used to mark these ends, an important function because otherwise the ends of different chromosomes could get joined up, resulting in the creation of too much or two little of essential proteins, such as the ones that regulate cell division, which would greatly increase our chances of cancer. “The telomeric DNA is formed from repeats of the same six base pairs, TTAGGG, repeated over and over again. These stretch for an average of about 10,000 base pairs in total on each end of every chromosome in the umbilical cord blood of a newborn human baby.” (p. 49) However, the number of telomeres decreases every time the cell divides, so the longer we live, the greater the chance we will get cancer or something equally awful.
The key segments of the human genome were sequenced in 2001, although it was not one hundred percent sequenced until 2022. Many researchers had predicted they would find 50,000 genes, and some estimates were three times that, so it was a shock when the actual number was only about 24,000. However, at least 70 percent of our genes can fold into different configurations, producing multiple different proteins. Another surprise was that over 98 percent of the DNA in a human cell does not code for proteins at all, and thus was labeled ‘junk.’
Some of it truly is junk, consisting of sequences that found their way into our DNA far, far back in time. It must have been evolutionarily useful at some point, or it would have been weeded out. Over time its usefulness was lost, and since it was not harmful there was no pressure to remove it, and without that it could mutate freely, so by now those stretches of DNA are just random junk. There are some mechanisms in the DNA replication sequences that ensure this extraneous DNA remains suppressed, but long stretched of it are harmless anyway, just excess baggage passed down from very distant ancestors.
After the genome sequencing was completed, further research yielded interesting findings. Human proteins themselves are not especially large, and about the same size as those of flies or worms. “The only genomic feature that increased in number as animals became more complicated were the regions of junk DNA. The more sophisticated an organism, the higher the percentage of junk DNA it contains.” (p. 12)
Clearly, there must be method in this madness, and it is unfortunate that the name junk has stuck. Junk DNA performs a number of critical functions, which this book describes chapter by chapter. These include things like retrogenes, assisting protein sequence creation, non-protein coding RNAs, telomeres, centromeres, enhancers, promoters, epigenetics, 3D interactions, splicing, and insulators. I admit that I had some trouble following a few of the explanations, but the book has a number of good illustrations to show what is going on.
There are also has some surprising facts. In discussing stem cells, which can form any type of cell as needed, I did not know how energetic the process of red blood cell creation is. “The human body produces about 2 million red blood cells every second. That requires an awfully active stem cell population, in a pretty much constant state of cell division. This is one of the reasons why cancer rates rise with age. Our immune system usually does a good job of destroying abnormal cells, but the effectiveness of this surveillance declines as stem cells die off.” (p. 52)
And here’s a fun fact: “If you stretched out the DNA from one human cell it would reach for two metres, assuming you joined up the material from all the chromosomes. But this DNA has to fit into the nucleus of a cell, and the nucleus has a diameter of just one hundredth of a millimetre.” (p. 64)
Finally, something I had never heard before, “the two strands of DNA run in opposite directions.” (p. 79) So, the first chromosome from one parent is joined to chromosome twenty-three of the other.
I enjoyed this book, and learned a lot from it. To help illustrate her points the author uses lots of homey metaphors, such as toast and butter in explaining how one X chromosome is preferentially, but randomly, selected over the other. She has another book, on epigenetics, which I have added to my reading list.