Table of contents

Prepare for your exams

Upload your syllabus and get recommendations on what to study and when. No syllabus? Sharing your exam schedule works too.

Skip topic navigation

1. Introduction to Biochemistry4h 34m
2. Water3h 23m
3. Amino Acids8h 10m
4. Protein Structure10h 4m
5. Protein Techniques14h 5m
6. Enzymes and Enzyme Kinetics13h 38m
7. Enzyme Inhibition and Regulation 8h 42m
8. Protein Function 9h 41m
9. Carbohydrates7h 49m
10. Lipids5h 49m
11. Biological Membranes and Transport 6h 37m
12. Biosignaling9h 45m
Review 1: Nucleic Acids, Lipids, & Membranes2h 47m
Review 2: Biosignaling, Glycolysis, Gluconeogenesis, & PP-Pathway3h 12m
Review 3: Pyruvate & Fatty Acid Oxidation, Citric Acid Cycle, & Glycogen Metabolism2h 26m
Review 4: Amino Acid Oxidation, Oxidative Phosphorylation, & Photophosphorylation1h 48m

5. Protein Techniques

Indirect Protein Sequencing Via Geneomic Analyses

5. Protein Techniques

Indirect Protein Sequencing Via Geneomic Analyses: Videos & Practice Problems

Video Lessons Practice Worksheet

Topic summary

Indirect protein sequencing through genomic analyses is more efficient than direct methods like tandem mass spectrometry. Working with DNA is easier due to its stability, allowing faster and cheaper sequencing. While genomic analyses can derive amino acid sequences from nucleotide sequences, they cannot identify unknown proteins or detect chemically modified residues, which direct sequencing can. Understanding the genetic code is crucial for translating mRNA codons into amino acids, revealing peptide sequences. This synthesis of genomic and direct methods enhances our ability to analyze proteins effectively.

concept

Indirect Protein Sequencing Via Genomic Analyses

Video duration:

Play a video:

Was this helpful?

Indirect Protein Sequencing Via Genomic Analyses Video Summary

Indirect protein sequencing through genomic analyses is a crucial method in biochemistry that allows researchers to derive protein sequences from nucleotide sequences of genes. While direct protein sequencing techniques, such as tandem mass spectrometry and Edman degradation, focus on already isolated proteins, they do not reflect the primary source of protein sequencing data, which is often obtained indirectly from genomic analyses.

The primary advantage of using genomic analyses lies in its efficiency. Working with DNA is generally easier and more stable than working with proteins, which can be sensitive to environmental conditions such as temperature and pH. This stability allows for faster, cheaper, and more informative sequencing processes. While direct protein sequencing provides the amino acid sequence, genomic analyses enable the retrieval of nucleotide sequences, which can then be translated into amino acid sequences using the genetic code.

However, direct protein sequencing remains essential for several reasons. It can identify unknown protein samples without requiring a corresponding DNA sample, which is a limitation of genomic analyses. Additionally, direct protein sequencing techniques can detect chemically modified amino acid residues, providing insights into post-translational modifications that genomic analyses cannot reveal.

Understanding the genetic code is fundamental to performing genomic analyses. The genetic code consists of codons, which are sequences of three nucleotides that correspond to specific amino acids. For example, the codon AUG codes for methionine, while GCU codes for alanine. The process begins with the transcription of a DNA coding sequence into mRNA, where thymine (T) is replaced by uracil (U). The mRNA is then read in sets of three nucleotides (codons) to determine the corresponding amino acids, ultimately revealing the peptide sequence.

For instance, if a DNA coding sequence is transcribed into mRNA, the first codon AUG would be identified as methionine, followed by GCU for alanine, GGC for glycine, CGG for arginine, AGC for serine, and AAA for lysine. This systematic approach illustrates how genomic analyses can effectively yield the amino acid sequence of a peptide through indirect sequencing methods.

In summary, while genomic analyses provide a rapid and efficient means of obtaining protein sequencing data, direct protein sequencing techniques are indispensable for identifying unknown proteins and detecting post-translational modifications. Understanding the interplay between these methods enhances our ability to study proteins and their functions in biological systems.

Problem

Use the genetic code above & the coding DNA sequence below to determine the protein sequence.

Video duration:

Play a video:

Was this helpful?

Indirect Protein Sequencing Via Genomic Analyses Practice 1 Video Summary

To determine the protein sequence from a coding DNA sequence, we first need to convert the DNA into mRNA. The key difference between DNA and RNA is that RNA contains uracil (U) instead of thymine (T). Therefore, when converting the coding DNA sequence to mRNA, every thymine in the DNA is replaced with uracil.

For example, if the coding DNA sequence contains five thymine bases, we will replace each of these with uracil in the corresponding mRNA sequence. The resulting mRNA sequence is read from the 5' to 3' direction, and it consists of codons, which are groups of three nucleotides.

Once we have the mRNA sequence, we can identify the codons. Each codon corresponds to a specific amino acid based on the genetic code. For instance, the first codon, AUG, is recognized as methionine (M), which is often the start codon in protein synthesis. The subsequent codons are translated as follows:

GCC → Alanine (A)
UGC → Cysteine (C)
GUU → Valine (V)
CUC → Leucine (L)
AAG → Lysine (K)

Thus, the complete protein sequence derived from the original coding DNA sequence is methionine-alanine-cysteine-valine-leucine-lysine, often abbreviated as M-A-C-V-L-K. This sequence represents the order of amino acids that will form the protein, highlighting the importance of accurate transcription and translation in gene expression.

Problem

Suppose the sequence below is a template DNA sequence. What is the corresponding protein sequence?

Video duration:

Play a video:

Was this helpful?

Indirect Protein Sequencing Via Genomic Analyses Practice 2 Video Summary

In this practice problem, we explore the process of deriving a protein sequence from a template DNA sequence, which is distinct from a coding DNA sequence. The template DNA serves as a guide for creating a complementary coding DNA sequence through specific base pairing rules: adenine (A) pairs with thymine (T), and cytosine (C) pairs with guanine (G). To convert the template DNA sequence into the coding DNA sequence, we apply these rules systematically, ensuring that the resulting coding sequence is oriented in the 5' to 3' direction.

Once we have the coding DNA sequence, we recognize that it is antiparallel to the template strand, meaning that while one strand runs from 5' to 3', the complementary strand runs in the opposite direction. To obtain the mRNA sequence, we transcribe the coding DNA sequence by replacing all thymine (T) nucleotides with uracil (U). This mRNA sequence is also read from the 5' to 3' direction, which is crucial for the next step.

After constructing the mRNA sequence, we divide it into codons, which are groups of three nucleotides. Each codon corresponds to a specific amino acid, as defined by the genetic code. For instance, the codon CUU codes for leucine (L), GAG for glutamic acid (E), AAC for asparagine (N), GCA for alanine (A), GGC for glycine (G), and CAU for histidine (H). This sequence of amino acids forms the protein, which is represented from the N-terminal to the C-terminal end of the peptide.

In summary, the derived protein sequence from the given template DNA is leucine, glutamic acid, asparagine, alanine, glycine, and histidine. Understanding the differences between template and coding DNA sequences, as well as the transcription process to mRNA and translation to amino acids, is essential for grasping the fundamentals of molecular biology.

Problem

Even when the sequence of nucleotides for a gene is available and genomic analyses can be performed, direct chemical techniques on the physical protein are still required to determine:

The molecular weight of a simple protein.

The N-terminal amino acid residue.

The total number of amino acid residues in the protein.

The location of disulfide bonds.

Do you want more practice?

We have more practice problems on Indirect Protein Sequencing Via Geneomic Analyses

Here’s what students ask on this topic:

What is indirect protein sequencing via genomic analyses?

Indirect protein sequencing via genomic analyses involves determining the amino acid sequence of a protein by first sequencing the DNA that encodes it. This method leverages the stability and ease of working with DNA compared to proteins. By sequencing the DNA, we can obtain the nucleotide sequence, which can then be translated into the corresponding amino acid sequence using the genetic code. This approach is faster, cheaper, and more efficient than direct protein sequencing methods like tandem mass spectrometry or Edman degradation.

Created using AI

Why is DNA sequencing preferred over direct protein sequencing?

DNA sequencing is preferred over direct protein sequencing because it is faster, cheaper, and more efficient. DNA is more stable and easier to work with in the lab compared to proteins, which are sensitive to conditions like temperature and pH. Additionally, DNA sequencing provides more comprehensive data, allowing researchers to derive the amino acid sequence from the nucleotide sequence. This makes genomic analyses a more practical and informative approach for obtaining protein sequencing data.

Created using AI

What are the limitations of genomic analyses in protein sequencing?

Genomic analyses have several limitations in protein sequencing. Firstly, they cannot identify unknown protein samples without a corresponding DNA sequence. Secondly, genomic analyses cannot detect chemically modified amino acid residues, which are important for understanding protein function and regulation. These limitations necessitate the use of direct protein sequencing methods, such as tandem mass spectrometry, which can identify unknown proteins and reveal post-translational modifications.

Created using AI

How does the genetic code facilitate indirect protein sequencing?

The genetic code facilitates indirect protein sequencing by providing a set of rules for translating nucleotide sequences (mRNA codons) into amino acid sequences. Each codon, a sequence of three nucleotides, corresponds to a specific amino acid. By using the genetic code, researchers can convert the mRNA sequence derived from DNA into the corresponding peptide sequence. This process involves reading the codons in the mRNA and matching them to their respective amino acids, thus revealing the protein's sequence.

Created using AI

What are the advantages of direct protein sequencing methods?

Direct protein sequencing methods, such as tandem mass spectrometry and Edman degradation, have several advantages. They can identify unknown protein samples without needing a corresponding DNA sequence. Additionally, direct methods can detect chemically modified amino acid residues, which are crucial for understanding protein function and regulation. These capabilities make direct protein sequencing essential for comprehensive protein analysis, complementing the data obtained from genomic analyses.

Created using AI

Previous Topic: Strategy for Ordering Cleaved Fragments Next Topic: Enzymes

Your Biochemistry tutor

Jason Amores Sumpter

Biology, Biochemistry and Microbiology lead instructor

Download the Mobile app

Privacy

Do not sell my personal information

Accessibility

Patent notice

Sitemap