All UK Exam Boards included

The structure of the genome

Download Topic

As previously touched on, the genome is the entirety of genetic material carried by an individual or species and varies accordingly. The database of genomes of different species is growing and includes humans (the Human Genome Project). For example, the human genome, by chromosome, is viewable here:


A minority of the genetic material that is DNA actually encodes amino acids to make proteins. Wherever a DNA sequence in a genome does this, it is called a gene. The vast majority of DNA in humans, for example, does not consist of genes.



The rest of the DNA has functions in making RNA, some of which isn’t used as mRNA to make proteins via translation, but has other functions such as tRNA which carries out translation alongside ribosomes and mRNA, and rRNA (ribosomal RNA) that makes up ribosomes.



There’s also a whole load of other non-coding DNA that just repeats itself a million times and even bits that jump around within the genome (transposable elements, TEs). Some non-coding DNA is involved in regulation of transcription, while other non-coding DNA is just there not taking active part in any obvious way, or perhaps waiting to be discovered sometime soon!


Surprisingly, as little as 1.5% of the human genome is used in the production of proteins – these are the exons that get joined following mRNA splicing to remove the introns, before heading out to the ribosome for translation. Some of the repetitive sequences are very short, while others are quite long. For perspective, the human insulin gene is almost 5,000 DNA base pairs (5 kbp) long, while the whole genome is 3,234,830,000 base pairs (3.23 Gbp) long.


Within the repetitive sequences including TEs, there are subtypes of elements such as Alu elements, so called because they are short DNA sequences that enzymes from the bacterium Arthrobacter luteus recognise and cleave (restriction endonucleases, commonly used in molecular biology work with DNA), as well as other sequences with specific characteristics termed L1, L2 and L3, a subset of LINEs (long interspersed nuclear elements).


L1 elements are the main ones, and only ones actually still active in humans at present. Only remnants of L2 and L3 exist in the genome.





Sorry! There are no posts.

Sorry! There are no posts.

Your Reviews

I’ve struggled so much with feeling overwhelmed with biology revision, and I don’t know where to start. But your website is just what I need! It tells me all the information I need, and the knowledge I need to then build on, and it’s written in a way that soaks straight up into my brain!

Prettyhetty The Student Room

Hi! I have recently finished my first year doing biology at uni and I subscribed to you back when I was doing AQA A levels (glad I don’t live in Wales after watching this vid lol) and you and your website helped me so much. Just wanted to say it’s so nice to see you

Mr Peanutbutter YouTube

wow just checked out your website and think it’s pretty cool.

Eager bug The Student Room

Thank you for the help, your website and videos are awesome

pika mart YouTube

Just a huge thank you for spending your time helping others. I love your site and I'm seriously very grateful. No word of a lie

Neuron13 The Student Room