Please enable JavaScript.
Coggle requires JavaScript to display documents.
Variants, Variants - Coggle Diagram
-
Variants
-
Types
Small
Indels
Biological Effects
If in coding region
Frameshift mutations
If indel is of 1 or 2 bases - affects a large stretch of protein coding sequence, can also lead to premature STOP codon.
-
-
-
-
-
-
Germline variants - part of genetic code, inherited and common, in DNA of all body cells. Can cause disease or increase risk of disease - 1 in 1000 bp
-
Somatic variants - occurs during lifetime, differs between cells, can cause cancer. - 1 in 1,000,000 bp
Variant detection
-
-
Variant Calling
-
-
-
Variant calling tools
Snippy - a straightforward way to run multiple alignment and variant calling (SNPs) on the command line (SNVs, indels and rearrangements)
FreeBayes - when given an input of BAM files and a reference, it produces an output describing
SNPs, indels and complex variants in the alignments
-
-
Phasing
To know in which of the 2 chromosomes (for a diploid organism), variants are located with respect to each other
For example, phased indels can tell us about the biological effects of mutations more accurately, than not knowing which chromosome these indels are in.
-
If the variants happen to be close together, we can interpret the phase by using information from overlapping reads and read pairs from paired-end reads.
Haplotypes
Haploid genotype - set of phased genotypes. We know which of the variants occur on the same chromosome.
Some SNPs are inherited together due to linkage disequilibrium - SNPs are co-inherited either due to being physically close together or in recombination hotspots
Use
Genotype imputation - use haplotype information (international hapmap project) to deduce likely genotypes of other unmeasured SNPs, that are linked to the genotypes of measured SNPs --- not useful if the whole genome is being sequenced
Variant annotation - using UCSC, VEP, pfam, etc.