Skip to content

Commit ee64e75

Browse files
committed
PRS,TWAS
1 parent edb2e68 commit ee64e75

File tree

21 files changed

+804
-65
lines changed

21 files changed

+804
-65
lines changed

categories/genetics/index.html

Lines changed: 16 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -63,7 +63,14 @@
6363
<h2 class="archive-title">2025</h2>
6464

6565
<article class="archive-item">
66-
<a href="/post/gwas/" class="archive-item-link">Genome-wide association studies</a>
66+
<a href="/post/twas/" class="archive-item-link">Transcriptome-wide association studies</a>
67+
<span class="archive-item-date">
68+
2025-02-16
69+
</span>
70+
</article>
71+
72+
<article class="archive-item">
73+
<a href="/post/prs/" class="archive-item-link">Polygenic Risk Score</a>
6774
<span class="archive-item-date">
6875
2025-02-16
6976
</span>
@@ -72,7 +79,14 @@ <h2 class="archive-title">2025</h2>
7279
<article class="archive-item">
7380
<a href="/post/ldsc/" class="archive-item-link">Linkage Disequilibrium Score Regression</a>
7481
<span class="archive-item-date">
75-
2025-01-05
82+
2025-02-16
83+
</span>
84+
</article>
85+
86+
<article class="archive-item">
87+
<a href="/post/gwas/" class="archive-item-link">Genome-wide association studies</a>
88+
<span class="archive-item-date">
89+
2025-02-16
7690
</span>
7791
</article>
7892

categories/genetics/index.xml

Lines changed: 21 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -6,21 +6,35 @@
66
<description>Recent content in Genetics on A Hugo website</description>
77
<generator>Hugo</generator>
88
<language>en-US</language>
9-
<lastBuildDate>Sun, 16 Feb 2025 00:00:00 +0000</lastBuildDate>
9+
<lastBuildDate>Sun, 16 Feb 2025 15:19:26 -0900</lastBuildDate>
1010
<atom:link href="/categories/genetics/index.xml" rel="self" type="application/rss+xml" />
1111
<item>
12-
<title>Genome-wide association studies</title>
13-
<link>/post/gwas/</link>
14-
<pubDate>Sun, 16 Feb 2025 00:00:00 +0000</pubDate>
15-
<guid>/post/gwas/</guid>
16-
<description>&lt;h2 id=&#34;variants-trait-association&#34;&gt;Variants-trait association&lt;/h2&gt;&#xA;&lt;p&gt;The core objective of genetic studies is to identify which genetic variants contribute to disease risk. While establishing direct causation is challenging, we can detect statistical associations between genetic variants and traits by analyzing large-scale genomic data.&lt;/p&gt;&#xA;&lt;p&gt;Large biobanks (such as the UK Biobank) have collected genomic data from hundreds of thousands of samples. To study variant-trait associations, one approach is to apply linear or logistic regression for each genetic variant, treating genotypes as independent variables and the trait as the dependent variable.&lt;/p&gt;</description>
12+
<title>Transcriptome-wide association studies</title>
13+
<link>/post/twas/</link>
14+
<pubDate>Sun, 16 Feb 2025 15:19:26 -0900</pubDate>
15+
<guid>/post/twas/</guid>
16+
<description>&lt;h2 id=&#34;instrument-variable--twas&#34;&gt;Instrument variable &amp;amp; TWAS&lt;/h2&gt;&#xA;&lt;p&gt;Transcriptome-wide association studies (TWAS) aim to identify associations between gene expression and traits of interest. In an ideal world where we have both RNA-seq and trait data for tens of thousands of individuals, performing a TWAS analysis would be straightforward: simply regress the trait on gene expression. However, GTEx, the largest collection of expression data, has only ~700 RNA-seq samples, and it does not include trait values. This limitation precludes a direct association test between expression and traits.&lt;/p&gt;</description>
17+
</item>
18+
<item>
19+
<title>Polygenic Risk Score</title>
20+
<link>/post/prs/</link>
21+
<pubDate>Sun, 16 Feb 2025 15:19:26 -0600</pubDate>
22+
<guid>/post/prs/</guid>
23+
<description>&lt;h2 id=&#34;bayesian-regression-method-for-polygenic-score&#34;&gt;Bayesian regression method for polygenic score&lt;/h2&gt;&#xA;&lt;p&gt;Polygenic score (PRS) investigates the genetic liability of certain diseases. Given the training data, we might compute the polygenic score as &lt;code&gt;\(PRS_i = \sum_{j = 1}^{M} \hat \beta_j G_{ij}\)&lt;/code&gt; for the testing cohort. Most of the PRS methods paper, such as &lt;a href=&#34;https://www.nature.com/articles/s41467-019-09718-5&#34;&gt;PRS-CS&lt;/a&gt;, &lt;a href=&#34;https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4596916/&#34;&gt;LDPred&lt;/a&gt; aim to recover causal effects &lt;code&gt;\(\lambda\)&lt;/code&gt; from the observed marginal effect size estimates &lt;code&gt;\(\hat \beta_j\)&lt;/code&gt;. Here let&amp;rsquo;s consider a infinitesimal model (LDpred-inf).&lt;/p&gt;&#xA;&lt;p&gt;We assume the causal effect size &lt;code&gt;\(\lambda \sim MVN(0, \frac{h^2}{M}I)\)&lt;/code&gt; (called infinitesimal model). From &lt;a href=&#34;/post/gwas/&#34;&gt;this post&lt;/a&gt;, we also have &lt;code&gt;\(\hat \beta | \lambda \sim MVN(R \lambda, \frac{1 - h^2}{N}R)\)&lt;/code&gt;. The Bayesian inference recipe with conjugate prior normal distribution gives us (according to this &lt;a href=&#34;https://gregorygundersen.com/blog/2020/11/18/bayesian-mvn/&#34;&gt;document&lt;/a&gt; ):&#xA;$$&#xA;&lt;code&gt;\begin{split} p(\lambda \mid \hat \beta) &amp;amp;\propto f(\hat \beta \mid \lambda) \cdot f(\lambda) \\ &amp;amp;\propto \exp \{ - \frac{1}{2} (\hat \beta - R\lambda )^T (\frac{1 - h^2}{N}R)^{-1} (\hat \beta - R\lambda ) \} \cdot exp \{ - \frac{1}{2}\lambda^T (\frac{h^2}{M})^{-1} \lambda \} \\ &amp;amp;\propto \exp\{ - \frac{1}{2}[\frac{N}{1 - h^2}\cdot (\hat \beta - R\lambda )^T R^{-1}(\hat \beta - R\lambda ) +\frac{M}{h^2} \lambda^T \lambda ] \} \\ &amp;amp;\propto \exp \{- \frac{1}{2} [\frac{N}{1 - h^2} \cdot (\hat \beta^T R^{-1} \hat \beta - \hat \beta^T R^{-1} R \lambda -\lambda^T R R^{-1}\hat \beta + \lambda^T RR^{-1}R\lambda) + \frac{M}{h^2} \lambda^T \lambda] \} \\ &amp;amp;\propto \exp \{- \frac{1}{2} [\lambda^T(\frac{N}{1 - h^2}R + \frac{M}{h^2} I)\lambda - 2 \frac{N}{1 - h^2} \hat \beta^T \lambda ] \} \end{split}&lt;/code&gt;&#xA;$$&lt;/p&gt;</description>
1724
</item>
1825
<item>
1926
<title>Linkage Disequilibrium Score Regression</title>
2027
<link>/post/ldsc/</link>
21-
<pubDate>Sun, 05 Jan 2025 00:00:00 +0000</pubDate>
28+
<pubDate>Sun, 16 Feb 2025 15:19:26 -0400</pubDate>
2229
<guid>/post/ldsc/</guid>
2330
<description>&lt;h2 id=&#34;ldsc-derivation&#34;&gt;LDSC derivation&lt;/h2&gt;&#xA;&lt;p&gt;We discussed how to perform &lt;a href=&#34;/post/gwas/&#34;&gt;GWAS with scaled genotypes &amp;amp; phenotype&lt;/a&gt;. In this blog post, I present an important piece of result: Linkage Disequilibrium Score Regression (LDSC)&lt;/p&gt;&#xA;&lt;p&gt;LDSC was proposed in &lt;a href=&#34;https://www.nature.com/articles/ng.3211&#34;&gt;this&lt;/a&gt; landmark paper, in which it described how LD affect the probability of a variant being significant. Under infinitesimal model, LDSC states &lt;code&gt;\(\mathbb{E}[\chi_j^2] = \frac{Nh^2}{M} l_j + 1\)&lt;/code&gt;, where &lt;code&gt;\(l_j \equiv \sum_{k = 1}^M r_{jk}^2\)&lt;/code&gt; is the LD score. To carry out the derivation, one must treat the effect size as random: &lt;code&gt;\(\lambda_j \sim N(0, \frac{h^2}{M})\)&lt;/code&gt;.&lt;/p&gt;</description>
2431
</item>
32+
<item>
33+
<title>Genome-wide association studies</title>
34+
<link>/post/gwas/</link>
35+
<pubDate>Sun, 16 Feb 2025 15:19:26 -0300</pubDate>
36+
<guid>/post/gwas/</guid>
37+
<description>&lt;h2 id=&#34;variants-trait-association&#34;&gt;Variants-trait association&lt;/h2&gt;&#xA;&lt;p&gt;The core objective of genetic studies is to identify which genetic variants contribute to disease risk. While establishing direct causation is challenging, we can detect statistical associations between genetic variants and traits by analyzing large-scale genomic data.&lt;/p&gt;&#xA;&lt;p&gt;Large biobanks (such as the UK Biobank) have collected genomic data from hundreds of thousands of samples. To study variant-trait associations, one approach is to apply linear or logistic regression for each genetic variant, treating genotypes as independent variables and the trait as the dependent variable.&lt;/p&gt;</description>
38+
</item>
2539
</channel>
2640
</rss>

categories/index.xml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -6,12 +6,12 @@
66
<description>Recent content in Categories on A Hugo website</description>
77
<generator>Hugo</generator>
88
<language>en-US</language>
9-
<lastBuildDate>Sun, 16 Feb 2025 00:00:00 +0000</lastBuildDate>
9+
<lastBuildDate>Sun, 16 Feb 2025 15:19:26 -0900</lastBuildDate>
1010
<atom:link href="/categories/index.xml" rel="self" type="application/rss+xml" />
1111
<item>
1212
<title>Genetics</title>
1313
<link>/categories/genetics/</link>
14-
<pubDate>Sun, 16 Feb 2025 00:00:00 +0000</pubDate>
14+
<pubDate>Sun, 16 Feb 2025 15:19:26 -0900</pubDate>
1515
<guid>/categories/genetics/</guid>
1616
<description></description>
1717
</item>

category/index.html

Lines changed: 10 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -62,7 +62,16 @@ <h1 class="article-title">Category</h1>
6262

6363
<div class="article-content">
6464

65-
<h3 id="linear-algebra">Linear algebra</h3>
65+
<h3 id="genetics">Genetics</h3>
66+
<ul>
67+
<li><a href="/post/gwas/">Genome-wide association studies</a></li>
68+
<li><a href="/post/ldsc/">Linkage Disequilibrium Score Regression</a></li>
69+
<li><a href="/post/prs/">Polygenic Risk Score</a></li>
70+
<li><a href="/post/twas/">Transcriptome-wide association studies</a></li>
71+
</ul>
72+
<p> 
73+
 </p>
74+
<h3 id="linear-algebra">Linear algebra</h3>
6675
<ul>
6776
<li><a href="/post/pca1/">Calculate PCA by hand (via eigen-decomposition)</a></li>
6877
<li><a href="/post/svd/">Calculate SVD by hand (and decompose Spongebob)</a></li>
@@ -103,13 +112,6 @@ <h3 id="hidden-markov-model">Hidden Markov Model</h3>
103112
<h3 id="deep-learning">Deep learning</h3>
104113
<p> 
105114
 </p>
106-
<h3 id="genetics">Genetics</h3>
107-
<ul>
108-
<li><a href="/post/gwas/">Genome-wide association studies</a></li>
109-
<li><a href="/post/ldsc/">Linkage Disequilibrium Score Regression</a></li>
110-
</ul>
111-
<p> 
112-
 </p>
113115
<h3 id="rna-seq">RNA-seq</h3>
114116
<ul>
115117
<li><a href="/post/gene_exp1/">Model the Gene Expression (1): A GLM framework</a></li>

index.html

Lines changed: 16 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -66,7 +66,14 @@
6666
<h2 class="archive-title">2025</h2>
6767

6868
<article class="archive-item">
69-
<a href="/post/gwas/" class="archive-item-link">Genome-wide association studies</a>
69+
<a href="/post/twas/" class="archive-item-link">Transcriptome-wide association studies</a>
70+
<span class="archive-item-date">
71+
2025-02-16
72+
</span>
73+
</article>
74+
75+
<article class="archive-item">
76+
<a href="/post/prs/" class="archive-item-link">Polygenic Risk Score</a>
7077
<span class="archive-item-date">
7178
2025-02-16
7279
</span>
@@ -75,7 +82,14 @@ <h2 class="archive-title">2025</h2>
7582
<article class="archive-item">
7683
<a href="/post/ldsc/" class="archive-item-link">Linkage Disequilibrium Score Regression</a>
7784
<span class="archive-item-date">
78-
2025-01-05
85+
2025-02-16
86+
</span>
87+
</article>
88+
89+
<article class="archive-item">
90+
<a href="/post/gwas/" class="archive-item-link">Genome-wide association studies</a>
91+
<span class="archive-item-date">
92+
2025-02-16
7993
</span>
8094
</article>
8195

0 commit comments

Comments
 (0)