Menu

Secuenciación Genómica (Bioinformática II) - Coursera - Especialización

Acerca del Curso

In "Finding Hidden Messages in DNA", we discussed how to find short regulatory motifs in genomes.  But how do we know what the DNA sequence making up a genome is in the first place?  After all, biologists still do not possess technology that would read all the nucleotides of your genome from beginning to end.


In this course, you will learn how entire genomes are assembled from millions of short overlapping pieces of DNA.  The scale of this problem (the human genome is 3 billion nucleotides long!) implies that computers must be involved. Yet the problem is even more complex than it may appear ... to solve it, we will need to travel back in time to meet three famous mathematicians, and learn about algorithms based on graph theory.

Later in the course, we will see that sequencing genomes is not the only task related to decoding biological macromolecules.  Another difficult problem is sequencing antibiotics, short mini-proteins engineered by bacteria to fight each other.  Even though antibiotics often contain fewer than 10 amino acids, sequencing them is a formidable challenge. Decoding the sequence of amino acids making up an antibiotic is an important biomedical problem, but the practical barriers to sequencing short antibiotics are often more substantial than barriers to assembling a genome with millions of  nucleotides! To address this computational challenge, we will learn aboutbrute force algorithms that often succeed in various bioinformatics applications.

Finally in this course, you will learn how to apply popular bioinformatics software tools to assemble a deadly Staphylococcus bacterium. You will also be introduced to the popular cloud service BaseSpace offered by Illumina, the leading DNA sequencing company, thus joining the thousands of biologists and bioinformaticians who use BaseSpace every day.


Programa del Curso
  • Exploding Newspapers
  • The String Reconstruction Problem
  • String reconstruction as a walk in the overlap graph
  • Another graph for string reconstruction
  • Walking in the de Bruijn graph
  • The seven bridges of Konigsberg
  • Euler's Theorem
  • From Euler's Theorem to an Algorithm for Finding Eulerian Cycles
  • Assembling genomes from read-pairs
  • Epilogue: Genome assembly faces real sequencing data
How Do We Sequence Antibiotics? (Brute Force Algorithms)
  • The Discovery of Antibiotics
  • How Do Bacteria Make Antibiotics?
  • Dodging the Central Dogma
  • Sequencing Antibiotics by Shattering them into Pieces
  • A Brute Force Algorithm for Cyclopeptide Sequencing
  • A Branch-and-Bound Algorithm for Cyclopeptide Sequencing
  • Just How Fast Is This Algorithm?
  • Adapting Cyclopeptide Sequencing for Spectra with Errors
  • From 20 to More than 100 Amino Acids
  • The Spectral Convolution Saves the Day
  • Epilogue: From Simulated to Real Spectra
Bioinformatics Application Challenge: Sequencing a Staphylococcus aureusgenome

Preparación Previa Recomendada
If you are aiming at earning a standard certificate in this class, then you do not need to have any experience in biology or programming.  The only prerequisite is the enthusiasm to learn about how computational approaches are used in modern biology :)
If you are aiming at earning a certificate with distinction, then you should either know the basics of programming in the language of your choice (there is no required language for this course) or be willing to learn about programming before the course begins. In this case, in addition to the first course in this series,  we have the following suggestions for resources that will help you learn programming:
  • The language tracks on Codecademy, particularly the Python track.
  • Introductory problems on Rosalind, a resource for learning bioinformatics created by the course instructors.

"Finding Hidden Messages in DNA" is the suggested prerequisite for taking this course, but it is not a strict prerequisite.


Lecturas Sugeridas

The printed course companion is Bioinformatics Algorithms: An Active-Learning Approach (2nd edition), by Compeau & Pevzner.

Formato del Curso

This course covers two chapters from Bioinformatics Algorithms: An Active Learning Approach, by Compeau & Pevzner.  The course also contains summary quizzes and lecture videos.



To earn a standard certificate in the class, you must complete weekly quizzes in addition to a Bioinformatics Application Challenge in which you apply popular bioinformatics software tools to a real experimental dataset. If you are eager to learn about bioinformatics, you should be able to complete the Application Challenge and earn the course certificate even if you do not know how to program.
To earn a certificate with distinction, rather than complete the Software Challenge, you must complete some programming assignments found in the course's interactive text. The distinction is a "hacker track" that is aimed at learners who know how to program and would like to explore the nuts and bolts of bioinformatics algorithms.




Con tecnología de Blogger.