Please enable JavaScript.

Coggle requires JavaScript to display documents.

Tools for understanding of complex disease (Quantitative Genetic Traits…

- - - - Genetic Variance split by:
        
        Additive genetic effects (VarA)
        
        Non-additive genetic/epistatic effects (VarD)
        
        Environmental Variance split by:
        
        Individual Environment (Var E)
        
        Common Environment (Var C)
        
        VarA/VarP = NARROW-SENSE HERITABILITY
        THEREFORE
        VarP = VarA + VarD + VarC + VarE
        
        Behavioural genetics does not include non-additive effects (epigenetics) -> VarP = a2 + c2 + e2
        BUT HOW CAN YOU RULE OUT ONE EFFECT - SURELY THIS WOULD NOT BE ACCURATE?
        
        Two independent people can have very similar environment... socioeconomic status, city/rural, healthcare etc
        
        Estimate effect of individual environment through calculating correlation of relative with additive genetics and common environment
        e2 (0) = c2 + a2 + VarP
        
        Heritability = MZ correlation - DZ correlation x 2
        
        If the trait is purely genetic, in theory, MZ correlation should be twice DZ.
        
        If it's not twice, common environment must play a role.
        
        VarE = 1 - MZ correlation
        
        e.g. HEIGHT
        rMZ = a2 + c2 = 0.80
        rDZ = .5a2 + c2 = 0.50
        
        rDZ-rMZ = 0.30
        
        a2 = 2(0.3) = 0.6 = genes
        
        c2 = 0.8 - 0.6 = 0.2 = common e
        
        1 - 0.6 - 0.2 = 0.2 = unique e
        
        ASSUMPTIONS:
        
        No interaction or correlation between genetics and environment
        
        Equal Environment
        
        Random Mating
        
        Generalisability to the general population
        
        CRITIQUES OF ABOVE
        
        rGE - Can genetic factors affect the choice of environment?
        GxE - Genetic control of the sensitivity to the environment - e.g. some are more likely to get depressed than others. (Bigger family input can give more info)
        .
        
        No environment is equal anyway, but twins have an even greater shared environment as they are treated more similarly - therefore heritability is overestimated. Labelling twin studies can show this criticism.
        .
        
        Not in all populations - partners who choose each other other on specific traits can decrease heritiability - e.g. delinquency.
        .
        
        Are twins generalisable to the population?
        
        SOLUTIONS TO CRITICISMS
        
        Including parents can:
        
        enable estimation of dominant genetic variance
        
        split common environment by sibling effects
        
        SIBLINGS:
        
        VarC = 1, VarA = 0.5
        -RSIB = 0.5A2 + 1C2
        
        MONOZYGOTIC TWINS (REARED APART)
        
        VarC = 0, VarA = 1
        
        ADOPTED SIBLINGS
        
        VarC = 1, VarA = 0
        
        DIZYGOTIC TWINS
        
        VarC = 1, VarA = 0.5
        
        MONOZYGOTIC TWINS
        
        VarC = 1, VariA = 1
        
        rMZ-rDZ = 0.5a2
        a2 = 2(rMZ-rDZ)
        c2 = rMZ - a2
        e2 = 1 - (a2 + c2)
  - - - Compares 12k MZ and DZ twins - stays with the critique of shared environment.
        
        Healthy, 80% female, aged 18-102, highly engaged and recallable
        .
        
        Determine the contributory factor of the genes associated with common complex diseases of aging and associated traits
        
        Integrated biomedical resource for research community
        • Data is available to bona fide researchers upon application
        • Many projects are collaborator driven
        • Healthy, unselected population allows use as controls in many studies
        .
        
        Receive questionnaires once a year
        
        Linked with medical history
        
        Clinical visit 6 hours, every 4 years (continuous)
        .
        They take Metabolomic (whole genome, microbiome, epigenetics etc), Biochemical, Physiological, Lifestyle, Family history, and hospital/death records.
        .
        
        GWASs with over 100 traits and 500 loci - prioritised understanding the mechanism
    - - Gene expression is an essential cellular function whose regulation determines a significant amount of phenotypic variance
        
        Gene regulation is under genetic, epigenetic
        and environmental control
        
        Tissue-specificity of gene expression and
        regulation points to the importance of
        studying appropriate cell types (preferably in
        vivo primary tissue)
        .
        
        Took adipose tissue, whole blood, lyphoblasoid cell lines, and whole skin.
        CAN FIND:
        
        Expression
        
        Methylation
        
        Genetics
        
        Epidemiology
        
        There is single tissue expression but most expression overlaps in all genes.
        .
        Regulation of genes can be local (cis) or distal (trans)
        .
        High heritibility in most twins but not always a unique environment effect (but could be prone to batch bias)
      - eQTL studies have a large effect size, have a proximal endophenotype, and a reduced search space (multiple testing)
    - - Intergenic variants in the FTO gene are robustly associated with obesity but JUST IN BRAIN NOT IN ADIPOSE - regulated by IRX3 (touching in loop)
        .
        
        GWAS signals mediated by cis-eQTLs are highly tissue specific - therefore sample needs to be done in the appropriate tissue.
        .
        
        Maternally expressed transcription factor KLF14 but only expressed in adipose tissue BUT causes transcription factors to bind in various places in adipose tissue across the genome and change regulation. PROTECTS FORM T2D BY SHIFTING FAT DISTRIBUTION.
      - Metabolomics can help measure things to avoid questionnaires e.g. nicotine / caffeine.
        .
        There are many metabolite - disease associations
        .
        One SNP can explain 30% variance
        . Different things are metabolised at different rates - this is linked to eQTLs
        .
        Can use mendialian randomization to look at causal SNPs
      - Microbiome sequencing to find heritibility and associations
        
        Found one heritable microbe assoicated with lean twin - christensenellaceae - gave it to mice and they lost weight (Goodrich et al 2014)
  - - - Try to link genetic variation to disease, either through GWAS or QTL.
        
        OR link common genetic variation with the environment to disease. Through transcriptome (eQT) -> Proteome (pQTL) -> Metabolome (mQTL) -> Intermediate phenotypes.
      - CARTaGENE - Quebec
        COLLECTS from the ENTIRE COHORT:
        
        Blood and urine -> for DNA and PMC
        
        Questionnaires -> gender/age, Demographics, location (pollution), life habits (nutrition), mental state, psychosocial environemt (stress/life events), disease history (individual/family/medication)
        
        Blood data - cell counts/ cell measures / serum measures
        
        Physical tests / cognitive tests / anthropometric tests
        .
        
        Gene expression - RNA Sequencing - transcriptome (1k ppts)
        
        Deep Sequencing - DNA Seq - exome sequencing (1k ppts)
        
        Genotyping - DNA - whole genome (96 ppts) - does 2.5 mil SNPs.
      - Rationale:
        
        1% urban population of Quebec (60k)
        
        Complex diseases require deep phenotyping
        
        PROSPECTIVE cohort of healthy ppts (there is self-reported disease/risk traits)
        
        Can access medical, phrama, and geneology
        
        APPLICATION:
        
        use variation in gene expression to explain phenotype effect
        
        Big differences between city ppts and rural ppts - a lot of involved genes are involved in o2 transport - pollution?.
        
        Integration between DNA and RNA - differences in alleles are associated with expression of the gene
    - - RNA sequencing data gives more than just expression:
        
        GWAS with SNPs from RNA Seq can find mutually significant results.
        
        Allele Specific Expression - different alleles have different expression - SNPs or bias in this expression can be associated with disease.
        
        Ongen 2014 sequenced tumour cells and normal cells form the same individual - there was significant ASE in the cancer cells - gene dysregulation.
      - RNA Methylation - can change base pairs post-transcription - changes to proteins.
        .
        
        Mitochondria patterns - only 16.5k bps long (more DNA has moved over to nucleus) - codes for 13 proteins for ETC - associated with 580 diseases (really bad diseases as well).
        
        Each gene is separated by a tRNA - important for cleavage.
        
        rs 11156878 significantly correlates with mRNA expression - explains 22% variation - is also associated with BMR
- - - - DRESS: Drug rash with eosinophilia and systematic symptoms
        
        Fever / Facial oedema / Rash
        
        Lymphadenopathy
        
        Hepatic / pulmonary / cardiac issues
        
        10% people with HIV get very severe reaction
        
        AGEP: acute generalised exanthematous pustulosis
        TEN: Toxic Epidermal Necrolysis
        
        Blistering / inflammation / can't swallow / death
      - Screening for genetic sensitivity for HLA-B*5701 can help reduce DRESS in abacavir
  - - - Warfarrin - Blood thinner
        
        Therapeutic dose varies from patients to patient (20-fold
        difference in effective dose among Caucasian patients)
        
        Can trigger fatal haemorrhaging if dose too high, or stroke
        if dose too low
        
        Warfarin is carefully monitored via regular INR
        (International Normalised Ratio) measurements (of blood
        clotting time)
        
        Narrow therapeutic window - INR needs to be between 2.0-3.9 Hylek 2003 - dose can vary between 0.5 - 7mg.
        
        There is interindividual variability in dose requirement for one population
        
        Many drug-food interactions e.g. mango/fish oil
        
        40% of dose-variability can be explained with VK0RC1 or CYP2C9 genes
        
        Splitting ppts into genotype by a point-of-care test significantly improved their outcomes (by 10%) (Pinmohamed 2013) now in 2017 guidelines.
        
        2013 Kimmel trial - didn't work, but then didn't start until 3-5 days after drug taking.
      - Azathioprine - Anti-inflammatory
        
        Induces T-cell apoptosis
        
        Used as an immunosuppressant agent (transplantation / inflammatory disease)
        .
        
        Is either broken down to inactive metabolites or active metabolites (homo/heterozygous for breaking down TPMT) (drug interactions can do the same thing)
        
        Test is available but isn't used for half of ppts - and only account for 27% of myelosuppression but doesn't predict adverse events.
        
        There are so many other factors involved.
      - Psoriasis
        
        Treated by Methotrexate
        
        5% toxicity in withdrawal
        
        IL36RN mutations is different psoriasis
      - Melanoma
        
        Half of melanomas have an active mutation in the BRAF gene.
        
        Drug pathway took 2 years through speciftiy
        
        There is a specific change in melanoma tumours which can be used to identify specific drug targets and have a greater effect.
      - Statins
        
        High risk of muscular myopathy (20%)
        
        2008 GWAS found significant SNP with only 85 cases.
        
        SLCO1B1 - regulates hepatic uptake of statins
        
        Odds ratio of myopathy is 16.9
      - Carbamazepine in Hong Kong
        
        Can induce TEN in 15% Asians
        
        2011 HK introduced mandatory screening for HLA0B*15:02
        
        BUT prescriptions just dropped and gave different drugs which gave lower ADRs but then increased ADRs for other drugs.
- - - - MAGMA Software:
        
        SNP analysis - Take GWAS for single SNPs
        
        Gene-based analysis - SNP-set analysis with gene as unit
        
        Gene-set analysis - SNP-set analysis with sets of genes as unit of analysis
        
        Targeted gene-set pathways
        
        All known gene-set pathways
        
        INDIRECT: map drug features on to network of known genes.
        DIRECT: set of genes with proteins with known effect of drugs.
        HOW MUCH MORE EFFECTIVE IS ONE GENE SET THAN ANOTHER? Rank them.
- - - - ENCODE:
        
        Identify everything involved in:
        
        transcription
        
        transcription factor association
        
        where they bind
        
        chromatin structure
        
        histone modification
        
        how they've been modified
        
        80% of human genome is associated with at least one biochem function
        
        COMPUTATIONAL METHODS:
        
        Protein coding genes start with a methionine codon
        
        Continue in frame (3 bp codons)
        
        End with a stop codon
        
        Interupted by splice sites (which have a conserved sequence)
        
        In theory it is relatively straightforward to computationally identify regions of the genome that are consistent with gene models
        
        BUT Low accuracy, since complexity from splicing leads to many false positives
        
        In practice vertebrate gene annotation relies on computational alignment of transcriptome data to genome sequence to define genes.
        
        High accuracy, but false negatives since transcriptomics data incomplete and can be limiting. - could be missing certain transcripts in certain cell types at certain times e.g. foetal growth.
        
        Gene models can be checked manually and through experimental validation to improve accuracy
        
        FOUND 20K coding genes, 16K non-coding genes.
        Functional annotation of the human genome
        
        .
        
        Using databases like encode to find a suggested functional SNP from the lead SNP
        
        Testable hypothesis of the biological mechanism underlying the observed association (knock-out)
        
        OTHER FUNCTIONAL ANNOTATIONS
        
        DNAasel hypersensitivity sites:
        
        Open chromatin is transcriptionally active and sensitive to DNAsel
        
        Such regions can be assess through enzyme digestion with DNAasel to sequence the resulting fragments
        
        ATAC-sequencing is starting to replace this though.
        
        DNA Methylation
        
        Addition of methyl group to the 5' of cytosine nucleotides
        
        Broadly asssociated with transcriptional silencing at promotors and transcriptional activity within genes
        
        Assessed by reduced representation bisulfite sequencing (RRBS) in the encode project
        
        Chromosome interactions
        
        Long range interactions between distant chromosomal regions
        
        ChIP Sequencing
        
        Uses chromatin immunoprecipitation and parallel sequencing to locate genome-wide protein-DNA binding events.
        
        Proteins touching DNA are fixed in place with a cross-linking agent.
        
        DNA is fragmented and complexes are harvested with targetted antibodies.
        
        Cross-links are broken and only DNA fragments from binding sites remain which are sent for sequencing.
        
        Mapping of the sequence reads back to the genome and defines loci where the antibody targeted protein is bound.
        
        Explains the functional role.
        
        RNA-Sequencing - Transcriptomics
        
        2nd generation sequencing to catalog RNA in cell
        
        The reads are re-alligned
        
        Number of reads for each exon is proportional to the number of copies in each cell
      - DIRECT MEASUREMENT
        
        Identify RNA transcripts
        
        Identify regions of the genome that bind proteins
        
        Identify regions of the genome that do not bind proteins
        
        INTERSPECIES SEQUENCE COMPARISON
        
        Many functional regions of the genome are likely to be conserved across evolution
        
        Comparisons of genomic sequences will identify such regions
        
        COMPUTATIONAL APPROACHES
        
        Functional regions may contain sequence motifs that define their function
        
        Identification of such motifs can inform this
        
        EXPERIMENTAL -> TRANSCRITION APPROACHES
- - - - Talmud 2010 - using genetics with non-genetic risk factors to predict diabetes
        
        Used the Framingham risk scores (odd that the risk score doesn't include physical activity)
        
        Used 20 genetic risk factors (0-40 risk alleles)
        
        COHORT STUDY
        
        5.5K people, 303 developed T2D in 10 years
        
        use baseline info and genetics
        
        Without adjusting for weighting of SNP - FOUND NO SIGNIFICANT RISK
        
        BUT:
        
        SNPs only adjust account for 10% BUT MZ IS 70%
        
        SNPs might not be the causal variants
        
        Need larger sample sizes
        
        Rare variants not yet tested
        
        Resilience not tested
        
        Physical activity not accounted for
        
        GENE-ENVIRONMENT NOT INCLUDED
    - - 77 Variants for polygenic breast cancer (NOT ABOUT THE MONOGENIC FORM BRAC1)
        
        Bottom quintile have lifetime risk of 5.2%
        
        Top quintile have lifetime16.6% risk
        
        Top 1% have 30% risk
        
        Screening:
        
        Women aged 47-73 every three years
        
        Aimed for when risk reaches over 2.5%
        
        There are often follow-up tests for benign biopsies
        
        Women at high risk from family history have a separate programme
        BUT
        
        Using genetic screening could help find the people at higher risk earlier and reduce wasted time and cost for those at lower risk
    - - Khera et al 2016
        
        52 Variants for CHD
        AND
        
        Healthy lifestyle score (smoking / PA / diet)
        .
        Those with low genetic risk + low environmental risk = 1, those with high for both had 3.5 BUT NO INTERECTION EFFECT - CUMMULATIVE