HMM 17. 4. 2023
1. Refresh your knowledge of pHMMs by reproducing the R package 'aphid' vignetteexercise (link from previous week).
A nice formal write-up for general HMMs and the terminology and algorithms involved (WILL ADD LINK)
2. Can you create a HMM (either profile or general) that would generate sequences likely to form G-quadruplexes?
G-quadruplex is a sequence containing 4 runs of Gs, each of ideally 3Gs and a short loop section with no special sequence preferences (may not be entirely true in real G4s). Sometimes a regular expression G3+N+G3+N+G3+N+G3+ is used to identify PQS sequences (PQS = potential quadruplex sequence), or R package pqsfinder can be used.
3. Profile HMM exercise using HMMER
http://www.math.chalmers.se/Stat/Bioinfo/Master/Courses/BioinformaticsII/HMM_lab.html