Chapter 58: genetics
58.1 term
- chromosome
- locus (sl.) loci (pl.)
- marker
- allele
- haplotype
- genotype
- phenotype / trait
- endophenotype
58.2 marker
- large marker: STRP = short tandem repeat polymorphism, STRs = short tandem repeats
- linkage analysis
- paternity testing
- taxonomy
- small marker: SNP = single-nucleotide polymorphism
- association analysis
- disease diagnosis
- pharmacodynamics? drug design / pharmacogenomics = custom drug
- RFLP = restriction fragment length polymorphism
- platform
- customized: MALDI-TOF MS(= mass spectrometry)
- whole-genome gene chip
- Affymetrix
- Taiwan BioBank: TWB2.0
- Illumina
- Affymetrix
- NGS = next-generation sequencing
58.3 genome project
- HGP = Human Genome Project
- 20~25K genes
- 3 billion bps(base pairs)
- ELSI = ethical, legal, and social issues
- 99.9% bps are exactly the same in all people
- germline mutation
- male : female = 2 : 1
- HapMap = International HapMap Project
- SNP, haplotype, tag SNP
- haplotypes are combination of SNPs
- tag SNPs can identify unique haplotypes
- HapMap 1 & 2
- between-ancestry SNP: 1 common SNP per 5 Kb to 1 common SNP per 1 Kb
- African
- European
- East Asian
- Han Chinese
- Japanese
- between-ancestry SNP: 1 common SNP per 5 Kb to 1 common SNP per 1 Kb
- HapMap 3: more ancestries
- SNP, haplotype, tag SNP
- 1000 Genomes Project
- NGS
- identify 95% genetic variants with frequencies at least 1%
- final phase 77M SNPs
- browser
- Ensembl GRCh37
- Ensembl GRCh38 http://asia.ensembl.org/Homo_sapiens/Info/Index
- TWB = Taiwan BioBank
- browser: TaiwanView https://taiwanview.twbiobank.org.tw/index
- pricing: https://www.biobank.org.tw/about_price.php
- TPMI = Taiwan Precision Medicine Initiative https://tpmi.ibms.sinica.edu.tw/www/precision-medicine/
58.4 linkage analysis
- Mendel 1^{st} & 2^{nd} laws
- law of segregation ~ 3 : 1
- law of assortment ~ 9 : 3 : 3 : 1
- phenotypic model by R.A. Fisher
- P = G + E
- G is the genotypic component
- E is the environmental component
- P = G + E + G \cdot E
- G \cdot E is the interaction between the genotypic component and environmental component
- P = G + E
- linkage = co-segregation = cosegregation
- \theta = recombination fraction: 1% recombination = 1 crossover per 100 meioses = 1 cM(centiMorgan) on genetic map
- statistical hypothesis testing for in linkage mapping
- statistical hypothesis testing for categorical trait in linkage mapping: PLA = parametric linkage analysis
- H_0: no linkage \theta = 0.5
- H_1: linkage \theta < 0.5
- statistical hypothesis testing for quantitative trait in linkage mapping: VCLA = variance component linkage analysis
- H_0: no linkage \sigma^2_{q} = 0
- H_1: linkage \sigma^2_{q} > 0
- statistical hypothesis testing for categorical trait in linkage mapping: PLA = parametric linkage analysis
- study design
- case control
- trio
- affected / discordant sib-pair
- extended pedigree
- data format: linkage format
- family-based
- PID = pedigree Id
- IID = individual Id
- FID = father Id
- MID = mother Id
- gender
- affection
- marker
- M1 = marker 1
- M2 = marker 2
- …
- family-based
- single major locus model
- a two-allele A and a locus influences a dichotomous trait
- allele frequency
- p = \mathrm{P}\left(A\right)
- q = \mathrm{P}\left(a\right) = 1-p
- penetrance
- f_{AA} = \mathrm{P}\left(\text{affected}\mid AA\right)
- f_{Aa} = \mathrm{P}\left(\text{affected}\mid Aa\right) = f_{aA}
- f_{aa} = \mathrm{P}\left(\text{affected}\mid aa\right)
- disease mode of inheritance
- dominant model
- \begin{cases}f_{AA} = 1 \\ f_{Aa} = 1 \\ f_{aa} = 0\end{cases}
- recessive model
- \begin{cases}f_{AA} = 0 \\ f_{Aa} = 0 \\ f_{aa} = 1\end{cases}
- additive model
- \begin{cases}f_{AA} = 1 \\ f_{Aa} = \frac{1}{2} \\ f_{aa} = 0\end{cases}
- phenocopy model
- f_{aa} > 0 perhaps due to environmental cause
- liability model
- e.g. f_{AA},f_{Aa},f_{aa} are age-dependent
- dominant model
58.5 association analysis
- LD = linkage disequilibrium
- genotype & allele frequency
- diallelic marker
- p_{AA}
- p_{Aa}
- p_{aA}
- p_{aa}
- p_{A}
- p_{a}
- diallelic marker
- HWC = Hardy-Weinberg condition
- HWE = Hardy-Weinberg equilibrium
- HWD = Hardy-Weinberg disequilibrium
- gametic or haplotype frequency
- LE = linkage equilibrium
- LD = linkage disequilibrium