Přeskočit na horní lištu Přeskočit na hlavičku Přeskočit na obsah Přeskočit na patičku

Pokročilé metody bioinformatiky

Interaktivní osnova

Pokročilé metody bioinformatiky

Týden 4 - Genome and transcriptome assembly: state of the art and best practices

Vyučující: Mgr. Monika Čechová, Ph.D.

PŘEDNÁŠKA

Best practices in the genome and transcriptome assembly

CVIČENÍ

Genome Assembly as Shortest Superstring

Assessing Assembly Quality with N50 and N75

DOMÁCÍ ÚKOL

Remember there is no “one right way” to do an analysis. Choose parameters that you think are the most suitable for your goal.

Create an account at https://usegalaxy.eu/
Load following fastq files as a Collection (List of Pairs):
- https://zenodo.org/record/3541678/files/A1_left.fq.gz
- https://zenodo.org/record/3541678/files/A1_right.fq.gz
- https://zenodo.org/record/3541678/files/A2_left.fq.gz
- https://zenodo.org/record/3541678/files/A2_right.fq.gz
- https://zenodo.org/record/3541678/files/A3_left.fq.gz
- https://zenodo.org/record/3541678/files/A3_right.fq.gz
- https://zenodo.org/record/3541678/files/B1_left.fq.gz
- https://zenodo.org/record/3541678/files/B1_right.fq.gz
- https://zenodo.org/record/3541678/files/B2_left.fq.gz
- https://zenodo.org/record/3541678/files/B2_right.fq.gz
- https://zenodo.org/record/3541678/files/B3_left.fq.gz
- https://zenodo.org/record/3541678/files/B3_right.fq.gz

Run FastQC before and after trimming reads with Trimmomatic. Trim for quality and consider whether the adaptor removal should be performed.
Assemble the trimmed reads with Trinity. Trinity will output both gene and isoform files. Focus on the isoforms.
Align trimmed reads to this de-novo reference assembly and estimate read abundance per isoform (Align reads and estimate abundance on a de novo assembly of RNA-Seq data). Use salmon as Abundance estimation method.
Rename the datasets: A1_raw, A2_raw, A3_raw, B1_raw, B2_raw, B3_raw
Build expression matrix for your de novo assembly of RNA-Seq data by Trinity (this is the first step in the differential gene expression pipeline)
Share your history with the user cechova.biomonika@gmail.com
Export your history to a file and upload your .tar.gz to the Odevzdávarna by April 13th, 2021

This exercise is inspired by the following draft tutorial:

De novo transcriptome assembly, annotation, and differential expression analysis

DOPLŇUJÍCÍ MATERIÁLY

De novo assembly and haplotype phasing of diploid human genomes using long High-fidelity reads and non-trio phasing approaches

Předchozí

Pokročilé metody bioinformatiky
- Nyní studovat
  
  Týden 1 - organizace kurzu, struktura genomu, sekvenace metodami NGS - 2. 3. 2021
- Nyní studovat
  
  Týden 2 - zpracování dat z NGS - 9. 3. 2021
- Nyní studovat
  
  Týden 3 - Repetitivní genom, metody NGS za využití dlouhých "readů" 16. 3. 2021
- Nyní studovat
  
  Týden 4 - Genome and transcriptome assembly: state of the art and best practices
- Nyní studovat
  
  Týden 5 - Variant calling a structural variation detection with long reads
- Nyní studovat
  
  Týden 6 - anotace genomů z NGS dat (varianty, ChIP-seq, ENCODE) - 6. 4. 2021
- Nyní studovat
  
  Týden 7 - anotace genomů, tvorba vlasní anotační stopy a "hubu" - 13. 4. 2021
- Nyní studovat
  
  Týden 8 - vazba protein-protein/protein-DNA, transkripční faktory - 20. 4. 2021
- Nyní studovat
  
  Týden 9 - transkripční faktory, anotace genomů - 27. 4. 2021
- Nyní studovat
  
  Týden 10 - Skryté Markovovy modely (HMM) - 4. 5. 2021
- Nyní studovat
  
  Týden 11 - Profilové HMM, HMMER - 11. 5. 2021
- Nyní studovat
  
  Týden 12 - Analýza rodiny Spike proteinu (S) ze SARS-CoV-2 - 18. 5. 2021
- Nyní studovat
  
  Týden 13 - mapování analýzy Spike proteinu na strukturu proteinu - 25. 5. 2021

Operace

Prohlédnout vše