1. Coding Sequence (CDS): This is the region of the gene that contains the instructions for building a protein. It's made up of codons, which are three-nucleotide sequences that code for specific amino acids.
2. Regulatory Sequences: These are regions that control when and where a gene is expressed. They include:
* Promoter: This is the binding site for RNA polymerase, the enzyme that transcribes the gene into RNA.
* Enhancer: These are regions that can enhance transcription, often located far away from the promoter.
* Silencer: These are regions that repress transcription.
3. Introns and Exons: Eukaryotic genes are often interrupted by non-coding regions called introns, which are removed during RNA processing. The coding regions are called exons and are spliced together to form the mature mRNA.
4. 5' and 3' Untranslated Regions (UTRs): These are regions that are not translated into protein but are important for regulating gene expression.
* 5'UTR: Contains sequences that influence the initiation of translation.
* 3'UTR: Contains sequences that affect mRNA stability, localization, and translation.
5. Other Sequences:
* Polyadenylation signal: A sequence in the 3'UTR that signals for the addition of a poly-A tail to the mRNA, important for stability.
* Splice sites: Sequences at the ends of introns that guide the splicing machinery to remove them.
It's important to remember that the organization and structure of eukaryotic genes can be quite diverse. Some genes may have multiple exons and introns, while others may be entirely uninterrupted. Similarly, the length and complexity of regulatory sequences can vary greatly.