定性性狀的連鎖研究_第1頁(yè)
定性性狀的連鎖研究_第2頁(yè)
定性性狀的連鎖研究_第3頁(yè)
定性性狀的連鎖研究_第4頁(yè)
定性性狀的連鎖研究_第5頁(yè)
已閱讀5頁(yè),還剩78頁(yè)未讀 繼續(xù)免費(fèi)閱讀

下載本文檔

版權(quán)說明:本文檔由用戶提供并上傳,收益歸屬內(nèi)容提供方,若內(nèi)容存在侵權(quán),請(qǐng)進(jìn)行舉報(bào)或認(rèn)領(lǐng)

文檔簡(jiǎn)介

1、一、一、outline IBD/IBS介紹介紹 受累同胞對(duì)分析受累同胞對(duì)分析(Affected Sib-pair) 受累親屬對(duì)分析受累親屬對(duì)分析(Affected relative-pair)一、一、outline IBD/IBS介紹介紹 受累同胞對(duì)分析受累同胞對(duì)分析(Affected Sib-pair) 受累親屬對(duì)分析受累親屬對(duì)分析(Affected relative-pair)二、二、IBD/IBS介紹介紹 IBD (identical by descent):來源同一,指來源同一,指的是子代中共有的一段的是子代中共有的一段DNA區(qū)域或共有的區(qū)域或共有的等等位基因位基因來源于一個(gè)共同的祖先

2、。來源于一個(gè)共同的祖先。 An allele is shared by two family members (e.g. siblings, uncle-niece). It can be elucidated that this allele was transmitted from a common ancestor.二、二、IBD/IBS介紹介紹 IBS (identical by state):狀態(tài)同一,只考慮狀態(tài)同一,只考慮家系成員之間遺傳標(biāo)記或等位基因的相似性,家系成員之間遺傳標(biāo)記或等位基因的相似性,而不管其是否來源于一個(gè)共同的祖先,也不需而不管其是否來源于一個(gè)共同的祖先,也不需進(jìn)

3、行親代的等位基因分型。進(jìn)行親代的等位基因分型。 An allele is shared by two family members (e.g. siblings, uncle-niece). Although the allele is of the same type , they may or may not share this allele from a common ancestral chromosome.二、二、IBD/IBS介紹介紹1 21 21 21 3IBD=2 IBS=2二、二、IBD/IBS介紹介紹1 21 21 21 1IBD=? IBS=2二、二、IBD/IBS介紹

4、介紹1/11/31/21/31/21/21/21/31/31/21/21/3IBS二、二、IBD/IBS介紹介紹1 21 21 21 2二、二、IBD/IBS介紹介紹1 21 21 21 21 21 21 21 21 21 21 21 21 21 21 21 2二、二、IBD/IBS介紹缺失數(shù)據(jù)介紹缺失數(shù)據(jù)1/22/32/22/3二、二、IBD/IBS介紹缺失數(shù)據(jù)介紹缺失數(shù)據(jù)1/32/32/31/22/31/32/31/?2/32/31/31/3P(0) = 0, P(1) = 1, P(2) = 0P(0) = 1, P(1) = 0, P(2) = 0二、二、IBD/IBS介紹缺失數(shù)據(jù)介紹

5、缺失數(shù)據(jù) 對(duì)于上面的群體,等位對(duì)于上面的群體,等位2的頻率為的頻率為0.25,等位,等位3的頻率為的頻率為0.5。根據(jù)。根據(jù)2、3的等位頻率可以計(jì)的等位頻率可以計(jì)算出兩種可能的基因型的相對(duì)頻率算出兩種可能的基因型的相對(duì)頻率: P(父親基因型為父親基因型為1/2)=0.25/(0.5+0.25)=0.333 P(父親基因型為父親基因型為1/3)=0.5/(0.5+0.25)=0.667二、二、IBD/IBS介紹缺失數(shù)據(jù)介紹缺失數(shù)據(jù) P(父親基因型為父親基因型為1/2)=0.25/(0.5+0.25)=0.333 P(父親基因型為父親基因型為1/3)=0.5/(0.5+0.25)=0.667 計(jì)算

6、出計(jì)算出IBD=0,1,2概率:概率: P(IBD=0)=0.333*0+0.667*1=0.667 P(IBD=1)=0.333*1+0.667*0=0.333 P(IBD=2)=0.333*0+0.667*0=0二、二、IBD/IBS介紹介紹-擴(kuò)展家系擴(kuò)展家系叔叔侄女IBS=1IBD1一、一、outline IBD/IBS介紹介紹 受累同胞對(duì)分析受累同胞對(duì)分析(Affected Sib-pair) 受累親屬對(duì)分析受累親屬對(duì)分析(Affected relative-pair)三、受累同胞對(duì)數(shù)據(jù)分析三、受累同胞對(duì)數(shù)據(jù)分析 受累同胞對(duì)分析受累同胞對(duì)分析(Affected Sib-Pair, AS

7、P):通過確定的父母和受累(非受累)后代通過確定的父母和受累(非受累)后代的基因型數(shù)據(jù)能夠確定多個(gè)受累個(gè)體的的基因型數(shù)據(jù)能夠確定多個(gè)受累個(gè)體的同胞關(guān)系,只有孩子具有表型。同胞關(guān)系,只有孩子具有表型。 當(dāng)父母的表型有缺失時(shí),可以使用非受當(dāng)父母的表型有缺失時(shí),可以使用非受累后代與受累的后代在同胞對(duì)之間比較累后代與受累的后代在同胞對(duì)之間比較共享的等位。共享的等位。三、受累同胞對(duì)三、受累同胞對(duì)(ASP)數(shù)據(jù)分析數(shù)據(jù)分析 ASP分析的三種抽樣方法:分析的三種抽樣方法:1.同胞對(duì)都患病;同胞對(duì)都患病;2.同胞對(duì)一個(gè)患病,一個(gè)正常;同胞對(duì)一個(gè)患病,一個(gè)正常;3.同胞對(duì)都正常同胞對(duì)都正常三、受累同胞對(duì)三、受累

8、同胞對(duì)(ASP)數(shù)據(jù)分析數(shù)據(jù)分析 觀察兩個(gè)受累同胞對(duì)標(biāo)記等位觀察兩個(gè)受累同胞對(duì)標(biāo)記等位 的遺傳特性。如果疾病與標(biāo)記的遺傳特性。如果疾病與標(biāo)記 有關(guān),那么在同胞之間有相似有關(guān),那么在同胞之間有相似 的基因型。的基因型。 利用同胞對(duì)之間共享的利用同胞對(duì)之間共享的IBD的數(shù)目來衡量的數(shù)目來衡量基因型的相似性。基因型的相似性。1/32/31/22/3三、受累同胞對(duì)三、受累同胞對(duì)(ASP)數(shù)據(jù)分析數(shù)據(jù)分析 假設(shè)父代在標(biāo)記位點(diǎn)的基因型分別為假設(shè)父代在標(biāo)記位點(diǎn)的基因型分別為 (g1,g2) 和和 (g3,g4).假定假定sib1的基因型為的基因型為(g1,g3). Pr(sib2 has allele g1

9、) = Pr(sib2 has allele g2) = 0.5 Pr(sib2 has allele g3) = Pr(sib2 has allele g4) = 0.5 Pr(sib2 has (g1,g3) = Pr(IBD=2) = 0.52 = 0.25 Pr(sib2 has (g2,g4) = Pr(IBD=0) = 0.52 = 0.25 Pr(sib2 has (g1,g4) = Pr(sib2 has (g2,g3) = 0.52 = 0.25 Pr(IBD=1) = Pr(sib2 has (g1,g4) + Pr(sib2 has (g2,g3) = 0.5 根據(jù)孟德爾

10、分離法則同胞對(duì)共享情況如下:根據(jù)孟德爾分離法則同胞對(duì)共享情況如下: Both marker alleles 0.25 One marker allele 0.5 No marker allele 0.25三、受累同胞對(duì)三、受累同胞對(duì)(ASP)數(shù)據(jù)分析數(shù)據(jù)分析 The average number of shared alleles is:(0.25 * 2) + (0.5 * 1) + (0.25 * 0) = 1However, if the marker is linked to the disease locus the overall amount of sharing between

11、 affected sibs will be increased !三、受累同胞對(duì)數(shù)據(jù)分析三、受累同胞對(duì)數(shù)據(jù)分析 如果疾病與標(biāo)記沒有連鎖,那么共享如果疾病與標(biāo)記沒有連鎖,那么共享IBD數(shù)目數(shù)目分別為分別為0,1,2的親屬對(duì)的比例為的親屬對(duì)的比例為1:2:1。 父親或者母親或者將父親或者母親或者將相同的等位相同的等位(共享(共享IBD)或者將或者將不同的等位不同的等位(不共享(不共享IBD)傳遞給兩個(gè))傳遞給兩個(gè)后代。如果疾病與標(biāo)記沒有連鎖,傳遞相同等后代。如果疾病與標(biāo)記沒有連鎖,傳遞相同等位給后代的父親或者母親的比例為位給后代的父親或者母親的比例為50,如果,如果有連鎖,那么此比例高于期望的有

12、連鎖,那么此比例高于期望的50。 如上圖,一個(gè)傳遞共享如上圖,一個(gè)傳遞共享IBD(“3”),一個(gè)傳),一個(gè)傳遞不共享遞不共享IBD(“2” or“1”)。)。1/32/31/22/3 Main Idea:如果一個(gè)標(biāo)記與疾病位點(diǎn)連如果一個(gè)標(biāo)記與疾病位點(diǎn)連鎖鎖患病的同胞對(duì)比隨機(jī)情況患病的同胞對(duì)比隨機(jī)情況(無連鎖無連鎖)更傾向于獲得相同的標(biāo)記等位。更傾向于獲得相同的標(biāo)記等位。三、受累同胞對(duì)三、受累同胞對(duì)(ASP)數(shù)據(jù)分析數(shù)據(jù)分析Disease LocusClose to disease locus (0.5):affected sibs share on average more than 1 al

13、lele IBD if the genetic component is strong.Far away (=0.5):affected sibs share on average 1 marker allele IBD. IBD sharing probabilities: 0.25, 0.5, 0.25. IBS sharing probabilities: 依賴于標(biāo)記位點(diǎn)的依賴于標(biāo)記位點(diǎn)的等位頻率,一個(gè)常見(等位頻率,一個(gè)常見(common)的等位比一個(gè))的等位比一個(gè)稀少(稀少(rare)的等位在同胞對(duì)之間更可能是共享)的等位在同胞對(duì)之間更可能是共享的的IBS。 由于由于IBS沒有用到父

14、母的信息所以比沒有用到父母的信息所以比IBD分析的分析的檢驗(yàn)效能要低,而且如果等位頻率錯(cuò)誤可能會(huì)產(chǎn)檢驗(yàn)效能要低,而且如果等位頻率錯(cuò)誤可能會(huì)產(chǎn)生偏倚。生偏倚。三、受累同胞對(duì)數(shù)據(jù)分析三、受累同胞對(duì)數(shù)據(jù)分析 假設(shè)一個(gè)同胞對(duì)和一些位點(diǎn),父親或者母親向假設(shè)一個(gè)同胞對(duì)和一些位點(diǎn),父親或者母親向下傳遞相同等位(共享下傳遞相同等位(共享IBD)或者不同等位)或者不同等位(不共享(不共享IBD)給同胞對(duì)的概率是)給同胞對(duì)的概率是0.5。 N表示同胞對(duì)之間共享表示同胞對(duì)之間共享IBD的數(shù)目,可以將的數(shù)目,可以將N看作是兩次試驗(yàn)中成功的次數(shù),這里的成功是看作是兩次試驗(yàn)中成功的次數(shù),這里的成功是指父親或者母親將相同的

15、等位傳遞給同胞對(duì)。指父親或者母親將相同的等位傳遞給同胞對(duì)。 成功的概率為成功的概率為0.5。三、受累同胞對(duì)數(shù)據(jù)分析三、受累同胞對(duì)數(shù)據(jù)分析)5 . 0 , 2(BinN nkqpknkNPknk, 1 ,0 ,)(.25.05.0)2(,5.0)5.01(5.012)1(,25.0)5.01()0(22NPNPNP 如果給定的是疾病位點(diǎn),受累同胞對(duì)共如果給定的是疾病位點(diǎn),受累同胞對(duì)共享享IBD的概率,相當(dāng)于計(jì)算條件概率:的概率,相當(dāng)于計(jì)算條件概率: z0 = P(N=0|ASP) z1 = P(N=1|ASP) z2 = P(N=2|ASP) 此條件概率依賴于疾病的遺傳模式此條件概率依賴于疾病的

16、遺傳模式三、受累同胞對(duì)數(shù)據(jù)分析三、受累同胞對(duì)數(shù)據(jù)分析 隱性疾病:隱性疾病: f0=f1=0, f2=1 顯性疾病:顯性疾病:f0= 0, f1= f2=1三、受累同胞對(duì)數(shù)據(jù)分析三、受累同胞對(duì)數(shù)據(jù)分析 父母在疾病位點(diǎn)的可能基因型為:父母在疾病位點(diǎn)的可能基因型為:aa,aa, Aa,aa,.AA,aa,aaaaaa Aaaa AAAaXAaAa AaAa AAAAAAAA AaAA AA 滿足哈代溫伯格平衡下相應(yīng)的基因型頻滿足哈代溫伯格平衡下相應(yīng)的基因型頻率為:率為: 43223223223422222p242pq2q 22pqpqqpqpqppqpppqxpqqq三、受累同胞對(duì)數(shù)據(jù)分析三、受累同

17、胞對(duì)數(shù)據(jù)分析P( N=k): k = 0, 1, 2 ?P( N=k): k = 0, 1, 2 ?P(Ci)Mating Typeq4aa,aaC14pq3Aa,aaC24p2q2Aa,AaC32p2q2AA,aaC44p3qAA,AaC5p4AA,AAC625. 0)0()()0()0|()|0( IBDPASPPIBDPIBDASPPASPNP6441(|0)(|0)|) ()1*iiiP ASP IBDPASP IBDC P CppAssume a genetic model of a recessive disease with full penetrance: f =(0,0,1)

18、.Explanation: both affected sibs must have 2 disease alleles and theseMust be of different parental origin 三、受累同胞對(duì)數(shù)據(jù)分析三、受累同胞對(duì)數(shù)據(jù)分析P( N=k): k = 0, 1, 2 ?P(Ci)Mating Typeq4aa,aaC14pq3Aa,aaC24p2q2Aa,AaC32p2q2AA,aaC44p3qAA,AaC5p4AA,AAC62222344)1(25. 025. 0*)()0()0|()|0(ppqpqpppASPPIBDPIBDASPPASPNP 223443

19、222222346125. 0*14*)5 . 0(2*04*)25. 0(4*0*0)()|()(qpqpppqpqpqppqqCPCASPPASPPiii In a similar way:22)1(1)|2( ,)1(2)|1(pASPNPppASPNP P = 0.001P(N=0|ASP) = 0.000P(N=1|ASP) = 0.002P(N=2|ASP) = 0.998三、受累同胞對(duì)數(shù)據(jù)分析三、受累同胞對(duì)數(shù)據(jù)分析 隱性疾病:隱性疾病: z0=z1=0, z2=1 顯性疾病:顯性疾病:z0= 0, z1= z2=1 標(biāo)記位點(diǎn)與疾病無關(guān):標(biāo)記位點(diǎn)與疾病無關(guān): z0=0.25,z1=

20、0.5, z2=0.25三、受累同胞對(duì)數(shù)據(jù)分析三、受累同胞對(duì)數(shù)據(jù)分析三、受累同胞對(duì)數(shù)據(jù)分析三、受累同胞對(duì)數(shù)據(jù)分析 根據(jù)上述兩個(gè)假設(shè),提出關(guān)于受累同胞根據(jù)上述兩個(gè)假設(shè),提出關(guān)于受累同胞對(duì)分析的三種檢驗(yàn)方法:對(duì)分析的三種檢驗(yàn)方法:1.擬和優(yōu)度檢驗(yàn):擬和優(yōu)度檢驗(yàn):IBD=0,1,2的卡方檢驗(yàn),的卡方檢驗(yàn),自由度為自由度為2;2.均值檢驗(yàn):受累同胞之間共享等位數(shù)均均值檢驗(yàn):受累同胞之間共享等位數(shù)均值與期望值值與期望值1相比較;相比較;3.比例檢驗(yàn):檢驗(yàn)比例檢驗(yàn):檢驗(yàn)IBD=2的數(shù)目與期望值的數(shù)目與期望值相比較相比較三、受累同胞對(duì)數(shù)據(jù)分析三、受累同胞對(duì)數(shù)據(jù)分析 根據(jù)上述兩個(gè)假設(shè),提出關(guān)于受累同胞根據(jù)上述

21、兩個(gè)假設(shè),提出關(guān)于受累同胞對(duì)分析的三種檢驗(yàn)方法:對(duì)分析的三種檢驗(yàn)方法:1.擬和優(yōu)度檢驗(yàn):擬和優(yōu)度檢驗(yàn):IBD=0,1,2的卡方檢驗(yàn),的卡方檢驗(yàn),自由度為自由度為2;2.均值檢驗(yàn):受累同胞之間共享等位數(shù)均均值檢驗(yàn):受累同胞之間共享等位數(shù)均值與期望值值與期望值1相比較;相比較;3.比例檢驗(yàn):檢驗(yàn)比例檢驗(yàn):檢驗(yàn)IBD=2的數(shù)目與期望值的數(shù)目與期望值相比較相比較三、受累同胞對(duì)數(shù)據(jù)分析三、受累同胞對(duì)數(shù)據(jù)分析 擬和優(yōu)度檢驗(yàn)擬和優(yōu)度檢驗(yàn)(goodness of fit) 設(shè)一個(gè)受累同胞對(duì)設(shè)一個(gè)受累同胞對(duì)(ASP)共享共享IBD=0,1,2的概率分別為的概率分別為z0,z1,z2 在疾病與標(biāo)記沒有連鎖的零假設(shè)

22、下在疾病與標(biāo)記沒有連鎖的零假設(shè)下z0=1/4,z1=1/2,z2=1/4 n=受累同胞對(duì)的總數(shù)受累同胞對(duì)的總數(shù) n0=共享共享IBD(0)的同胞對(duì)數(shù)目的同胞對(duì)數(shù)目 n1=共享共享IBD(1)的同胞對(duì)數(shù)目的同胞對(duì)數(shù)目 n2=共享共享IBD(2)的同胞對(duì)數(shù)目的同胞對(duì)數(shù)目三、受累同胞對(duì)數(shù)據(jù)分析三、受累同胞對(duì)數(shù)據(jù)分析 擬和優(yōu)度檢驗(yàn)擬和優(yōu)度檢驗(yàn)(goodness of fit) 2021iiiieenS三、受累同胞對(duì)數(shù)據(jù)分析三、受累同胞對(duì)數(shù)據(jù)分析 擬和優(yōu)度檢驗(yàn)擬和優(yōu)度檢驗(yàn)(goodness of fit) NNnNnNn2221202442244 三、受累同胞對(duì)數(shù)據(jù)分析三、受累同胞對(duì)數(shù)據(jù)分析 根據(jù)上述兩

23、個(gè)假設(shè),提出關(guān)于受累同胞根據(jù)上述兩個(gè)假設(shè),提出關(guān)于受累同胞對(duì)分析的三種檢驗(yàn)方法:對(duì)分析的三種檢驗(yàn)方法:1.擬和優(yōu)度檢驗(yàn):擬和優(yōu)度檢驗(yàn):IBD=0,1,2的卡方檢驗(yàn),的卡方檢驗(yàn),自由度為自由度為2;2.均值檢驗(yàn):受累同胞之間共享等位數(shù)均均值檢驗(yàn):受累同胞之間共享等位數(shù)均值與期望值值與期望值1相比較;相比較;3.比例檢驗(yàn):檢驗(yàn)比例檢驗(yàn):檢驗(yàn)IBD=2的數(shù)目與期望值的數(shù)目與期望值相比較相比較 均值檢驗(yàn):受累同胞之間共享等均值檢驗(yàn):受累同胞之間共享等位數(shù)均值與期望值位數(shù)均值與期望值1相比較相比較,無連無連鎖的零假設(shè)下共享等位的期望為鎖的零假設(shè)下共享等位的期望為50。三、受累同胞對(duì)數(shù)據(jù)分析三、受累同胞對(duì)

24、數(shù)據(jù)分析 均值檢驗(yàn)均值檢驗(yàn)(Mean Test) 22212nnnnS 服從標(biāo)準(zhǔn)正態(tài)分布三、受累同胞對(duì)數(shù)據(jù)分析三、受累同胞對(duì)數(shù)據(jù)分析 根據(jù)上述兩個(gè)假設(shè),提出關(guān)于受累同胞根據(jù)上述兩個(gè)假設(shè),提出關(guān)于受累同胞對(duì)分析的三種檢驗(yàn)方法:對(duì)分析的三種檢驗(yàn)方法:1.擬和優(yōu)度檢驗(yàn):擬和優(yōu)度檢驗(yàn):IBD=0,1,2的卡方檢驗(yàn),的卡方檢驗(yàn),自由度為自由度為2;2.均值檢驗(yàn):受累同胞之間共享等位數(shù)均均值檢驗(yàn):受累同胞之間共享等位數(shù)均值與期望值值與期望值1相比較;相比較;3.比例檢驗(yàn):檢驗(yàn)比例檢驗(yàn):檢驗(yàn)IBD=2的數(shù)目與期望值的數(shù)目與期望值相比較相比較 共享共享IBD=2的同胞對(duì)數(shù)目與期望的的同胞對(duì)數(shù)目與期望的進(jìn)行比較

25、。進(jìn)行比較。三、受累同胞對(duì)數(shù)據(jù)分析三、受累同胞對(duì)數(shù)據(jù)分析 比例檢驗(yàn)比例檢驗(yàn)(Proportion Test)16/34/23nnnS 案例:研究案例:研究119個(gè)患有糖尿病的同胞對(duì),研究個(gè)患有糖尿病的同胞對(duì),研究人員已經(jīng)測(cè)定同胞對(duì)在候選位點(diǎn)人員已經(jīng)測(cè)定同胞對(duì)在候選位點(diǎn)FGF3的基因的基因型,研究問題:胰島素依賴性糖尿病型,研究問題:胰島素依賴性糖尿病(IDDM)與與FGF3是否存在連鎖?是否存在連鎖? 測(cè)定測(cè)定119個(gè)同胞對(duì)在個(gè)同胞對(duì)在FGF3候選位點(diǎn)的基因型之候選位點(diǎn)的基因型之后確定后確定20個(gè)同胞對(duì)共享個(gè)同胞對(duì)共享0個(gè)個(gè)IBD,59個(gè)同胞對(duì)個(gè)同胞對(duì)共享共享1個(gè)個(gè)IBD,40個(gè)同胞對(duì)共享個(gè)同

26、胞對(duì)共享2個(gè)個(gè)IBD。 通過檢驗(yàn)決定在通過檢驗(yàn)決定在IDDM與與FGF3之間是否存在之間是否存在顯著連鎖。顯著連鎖。三、受累同胞對(duì)數(shù)據(jù)分析三、受累同胞對(duì)數(shù)據(jù)分析 擬和優(yōu)度檢驗(yàn)擬和優(yōu)度檢驗(yàn)(goodness of fit)三、受累同胞對(duì)三、受累同胞對(duì)(ASP)數(shù)據(jù)分析數(shù)據(jù)分析 在利用上面三種檢驗(yàn)統(tǒng)計(jì)量時(shí),需要知在利用上面三種檢驗(yàn)統(tǒng)計(jì)量時(shí),需要知道在標(biāo)記位點(diǎn)同源等位基因的道在標(biāo)記位點(diǎn)同源等位基因的IBD值。值。而在實(shí)際問題中,往往不能由已有的信而在實(shí)際問題中,往往不能由已有的信息得到同胞對(duì)的確切息得到同胞對(duì)的確切IBD值,所以需要值,所以需要用估計(jì)值來代替。用估計(jì)值來代替。三、受累同胞對(duì)三、受累同

27、胞對(duì)(ASP)數(shù)據(jù)分析數(shù)據(jù)分析 如果受累同胞對(duì)其中的一個(gè)或者是他們的父如果受累同胞對(duì)其中的一個(gè)或者是他們的父母在特定位點(diǎn)的基因型缺失,可以估計(jì)此人母在特定位點(diǎn)的基因型缺失,可以估計(jì)此人在該位點(diǎn)可能基因型的概率值,根據(jù)概率值在該位點(diǎn)可能基因型的概率值,根據(jù)概率值計(jì)算計(jì)算IBD。20IBD) alleles i (iiPIBDP(i alleles IBD) :家系中所有可能的基因型共享:家系中所有可能的基因型共享i個(gè)個(gè)IBD的的概率和。概率和。 Siblings share 0,1, or 2 alleles IBD at a marker locus with probabilities p0

28、=0.25, p1=0.5, and p2=0.25, respectively under the null hypothesis of no linkage, but what is the corresponding IBD-distribution for father and son ?a.p0 = 0, p1 = 1, and p 2= 0;b.p0 = 0, p1 = 0.5, and p 2= 0.5;c.p0 = 0.25, p1 = 0.5, and p 2= 0.25;d.p0 = 0.5, p1 = 0.5, and p 2= 0;三、受累同胞對(duì)三、受累同胞對(duì)(ASP)

29、數(shù)據(jù)分析數(shù)據(jù)分析 優(yōu)點(diǎn):優(yōu)點(diǎn): 1.不依賴于遺傳模式;不依賴于遺傳模式; 2.計(jì)算相對(duì)簡(jiǎn)單;計(jì)算相對(duì)簡(jiǎn)單; 3.同胞對(duì)數(shù)據(jù)相對(duì)容易得到;同胞對(duì)數(shù)據(jù)相對(duì)容易得到; 缺點(diǎn):缺點(diǎn): 1.效能較低;效能較低; 2.未對(duì)重組率進(jìn)行估計(jì);未對(duì)重組率進(jìn)行估計(jì);三、受累同胞對(duì)三、受累同胞對(duì)(ASP)數(shù)據(jù)分析數(shù)據(jù)分析 Parametric Assume genetic model known Non-parametric No assumptions about the genetic model The parametric model is more powerful when the genetic mo

30、del is correctly specified. Problem size limitations Parametric large pedigrees, small number of markers Non-parametric small pedigrees, many markers一、一、outline IBD/IBS介紹介紹 受累同胞對(duì)分析受累同胞對(duì)分析(Affected Sib-pair) 受累親屬對(duì)分析受累親屬對(duì)分析(Affected relative-pair)四、患病親屬對(duì)四、患病親屬對(duì)(ARP)數(shù)據(jù)分析數(shù)據(jù)分析 如果某種疾病被某一位點(diǎn)所控制,因?yàn)槿绻撤N疾病被某一位

31、點(diǎn)所控制,因?yàn)橛H屬對(duì)比非親屬對(duì)更傾向于具有相同的親屬對(duì)比非親屬對(duì)更傾向于具有相同的基因,那么在給定親屬對(duì)一個(gè)成員患病基因,那么在給定親屬對(duì)一個(gè)成員患病的條件下,另一個(gè)成員也患病的條件概的條件下,另一個(gè)成員也患病的條件概率要比群體的流行率高。率要比群體的流行率高。 單個(gè)位點(diǎn)模式單個(gè)位點(diǎn)模式四、患病親屬對(duì)數(shù)據(jù)分析四、患病親屬對(duì)數(shù)據(jù)分析 jijiijiiiggaffPggfgtg|:i:的的外外顯顯率率基基因因型型的的等等位位頻頻率率;等等位位個(gè)個(gè)等等位位;一一個(gè)個(gè)位位點(diǎn)點(diǎn)的的第第四、患病親屬對(duì)數(shù)據(jù)分析四、患病親屬對(duì)數(shù)據(jù)分析 假設(shè)隨機(jī)從群體中抽取一個(gè)個(gè)體,假設(shè)隨機(jī)從群體中抽取一個(gè)個(gè)體,1表示表示患病

32、狀態(tài),患病狀態(tài),0表示正常。表示正常。 在哈代溫伯格平衡成立的假設(shè)下,群在哈代溫伯格平衡成立的假設(shè)下,群體的流行率為體的流行率為 ijjiijijjijiijjittfggGPggGYPggGYPYPK|1, 11四、患病親屬對(duì)數(shù)據(jù)分析四、患病親屬對(duì)數(shù)據(jù)分析 另一個(gè)親人患病的概率另一個(gè)親人患病的概率一個(gè)親人患病的條件下一個(gè)親人患病的條件下復(fù)發(fā)風(fēng)險(xiǎn)復(fù)發(fā)風(fēng)險(xiǎn),:1|112 YYPKR RKKYYKYYP*,cov1, 121221 率為:率為:一對(duì)親屬對(duì)都患病的概一對(duì)親屬對(duì)都患病的概 21,/1YYCovKKKR 四、患病親屬對(duì)數(shù)據(jù)分析四、患病親屬對(duì)數(shù)據(jù)分析 211,1:KYYCovKKRRR 的

33、比值的比值復(fù)發(fā)風(fēng)險(xiǎn)與群體流行率復(fù)發(fā)風(fēng)險(xiǎn)與群體流行率 顯性遺傳方差顯性遺傳方差:可加遺傳方差;:可加遺傳方差;位的概率;位的概率;:親屬對(duì)有兩個(gè)同源等:親屬對(duì)有兩個(gè)同源等親緣系數(shù);親緣系數(shù);:2,2221DADAVVVVYYCov 四、患病親屬對(duì)數(shù)據(jù)分析四、患病親屬對(duì)數(shù)據(jù)分析 DARVVK222/11 系系。是是第第三三代代親親緣緣關(guān)關(guān)緣緣關(guān)關(guān)系系,第第一一代代表表兄兄弟弟侄侄子子是是第第二二代代親親、祖祖父父孫孫子子、叔叔叔叔緣緣關(guān)關(guān)系系,半半兄兄弟弟例例如如:父父子子是是第第一一代代親親代代親親緣緣關(guān)關(guān)系系;第第:同同胞胞對(duì)對(duì);同同卵卵雙雙胞胞胎胎;iiSM: 22211412 IBDPIB

34、DPIBDP四、患病親屬對(duì)數(shù)據(jù)分析四、患病親屬對(duì)數(shù)據(jù)分析 AAADASDAMVKVKVKVVKSVVKM8111:3. 54111:2. 42111:1. 32411. 211. 123222122第一代表兄弟第一代表兄弟祖父孫子祖父孫子父子對(duì)父子對(duì):同胞對(duì)同胞對(duì):同卵雙胞胎同卵雙胞胎 124141212321 SM Risch利用復(fù)發(fā)風(fēng)險(xiǎn)比將受累同胞對(duì)利用復(fù)發(fā)風(fēng)險(xiǎn)比將受累同胞對(duì)(ASP)的方法推廣到了患病親屬對(duì),這種方法稱的方法推廣到了患病親屬對(duì),這種方法稱為患病親屬對(duì)方法為患病親屬對(duì)方法(affected relative pair, ARP). R表示一般親屬關(guān)系表示一般親屬關(guān)系 IBD

35、T表示親屬對(duì)在性狀位點(diǎn)同源等位的個(gè)表示親屬對(duì)在性狀位點(diǎn)同源等位的個(gè)數(shù)數(shù)四、患病親屬對(duì)數(shù)據(jù)分析四、患病親屬對(duì)數(shù)據(jù)分析 IBDT對(duì)應(yīng)著下面三種血緣關(guān)系:對(duì)應(yīng)著下面三種血緣關(guān)系: IBDT=2等價(jià)于同卵雙胞胎等價(jià)于同卵雙胞胎 IBDT1等價(jià)于父子關(guān)系等價(jià)于父子關(guān)系 IBDT0等價(jià)于獨(dú)立關(guān)系等價(jià)于獨(dú)立關(guān)系四、患病親屬對(duì)數(shù)據(jù)分析四、患病親屬對(duì)數(shù)據(jù)分析 患病親屬對(duì)患病親屬對(duì)(ARP)可以用來進(jìn)行連鎖分析,可以用來進(jìn)行連鎖分析,可以描述為:從群體中收集一組兩成員都可以描述為:從群體中收集一組兩成員都患病的親屬對(duì),確定親屬對(duì)同源等位的分患病的親屬對(duì),確定親屬對(duì)同源等位的分布,然后比較觀察到的同源等位分布與期布

36、,然后比較觀察到的同源等位分布與期望的同源等位分布。如果觀察值比期望值望的同源等位分布。如果觀察值比期望值相差較大,這種差別可能是由于疾病與基相差較大,這種差別可能是由于疾病與基因有關(guān)而引起。這樣兩成員都患病則他們因有關(guān)而引起。這樣兩成員都患病則他們的基因相似,從而他們的同源等位個(gè)數(shù)偏的基因相似,從而他們的同源等位個(gè)數(shù)偏大。大。四、患病親屬對(duì)數(shù)據(jù)分析四、患病親屬對(duì)數(shù)據(jù)分析Affected Relative Pairs Methods When the underlying genetic model cannot be specified 遺傳模型未知遺傳模型未知 Missing marker

37、 data 有缺失數(shù)據(jù)有缺失數(shù)據(jù) Extended pedigrees 擴(kuò)展家系擴(kuò)展家系 Discrete phenotypes 離散表型離散表型 Computational consideration Unavailability of suitable (better) package 無更好方法無更好方法General Idea of ARP to utilize procedures that rely less completely on the genetic model specification 不依賴于遺傳模式不依賴于遺傳模式 to use statistical appro

38、aches that is less sensitive to departure from normal distribution 敏感性較低敏感性較低 to employ Large Sample Theory 大數(shù)定律大數(shù)定律 to gain simplicity at expense of loss of power 功效低,計(jì)算簡(jiǎn)單功效低,計(jì)算簡(jiǎn)單 to make choice of no choice 五、受累家系成員五、受累家系成員(Affected Pedigree Member, APM)方法方法 目的:在一個(gè)家系中尋找所有受累家系目的:在一個(gè)家系中尋找所有受累家系成員之間的

39、成員之間的IBS。What is the IBS status of the 2 indicated individuals ?The 2 indicated individuals are IBS for two alleles.What is their IBD status ?The 2 indicated individuals are IBD for neither one of the alleles.五、受累家系成員五、受累家系成員(Affected Pedigree Member, APM)方法方法 zij :一個(gè)親屬對(duì)之間的相似性的統(tǒng)計(jì)量;:一個(gè)親屬對(duì)之間的相似性的統(tǒng)計(jì)量;

40、假假設(shè)其中一個(gè)親屬對(duì)的基因型為設(shè)其中一個(gè)親屬對(duì)的基因型為 (A1,A2) ,另一,另一個(gè)為個(gè)為 (B1,B2) 1, x and y represent the same allele IBS ( , )0, otherwisex y),(412121babaijBAz 111sisijijzz一般的相似性統(tǒng)計(jì)量計(jì)算如下:五、受累家系成員五、受累家系成員(Affected Pedigree Member, APM)方法方法 兩個(gè)親屬共享一個(gè)稀少的等位比共享一個(gè)普兩個(gè)親屬共享一個(gè)稀少的等位比共享一個(gè)普遍的等位更有意義。因此在考慮遍的等位更有意義。因此在考慮IBS是通常將是通常將等位頻率考慮加入進(jìn)

41、去:等位頻率考慮加入進(jìn)去:)(),(412121ababaijAfBAz 1)(1aAf可能的權(quán)重函數(shù):aAaPAf1)(2aAaPAf1)(3所有家系的全局統(tǒng)計(jì)量21/2() var()mmmmmmw ZE ZTwZwhere1/21/2(1) var()mmmwrZAdvantages 不必明確性狀的遺不必明確性狀的遺傳模式傳模式 除了同胞對(duì)之外還除了同胞對(duì)之外還包括受累的親屬包括受累的親屬 計(jì)算快速計(jì)算快速 可以進(jìn)行多點(diǎn)分析可以進(jìn)行多點(diǎn)分析Disadvantages 依賴依賴IBS而不是而不是IBD 對(duì)于標(biāo)記等位的等位頻率對(duì)于標(biāo)記等位的等位頻率敏感敏感 扔掉了一些潛在的信息扔掉了一些潛在

42、的信息 一類錯(cuò)誤會(huì)提高一類錯(cuò)誤會(huì)提高 權(quán)重權(quán)重 當(dāng)家系結(jié)構(gòu)復(fù)雜時(shí)排除標(biāo)準(zhǔn)的LOD分?jǐn)?shù)計(jì)算; 如果受累的親屬很多而不是同胞對(duì)可以應(yīng)用APM ; 如果進(jìn)行多點(diǎn)分析;應(yīng)用:應(yīng)用:說明說明 Excellent idea with sacrifice of statistical power 犧牲統(tǒng)計(jì)功效犧牲統(tǒng)計(jì)功效 May be suitable for first rapid blind screening for linkage 快速快速 Never used when an approach based in IBD is available IBD能用時(shí)能用時(shí)不用此方法不用此方法SimIBD

43、Analysis To computing an observed statistics based on all marker and pedigree information To use permutation to obtain empirical threshold values (randomly shuffling marker genotypes) to rely on three factors: the genotypes of both affected and unaffected, marker allele frequencies and pedigree stru

44、ctureSimIBD statistics 22,111()ijia jbabZapSimIBD statistics for a given pedigree 1/2111(1)rrpijijZZrThe overall SimIBD statistics for all pedigrees 1mppZZAdvantages Disadvantages Using IBD information Without requirement of specifying the genetic model Having a lower Type I error and high power rel

45、ative to APM Cannot be used for arbitrarily complex pedigree (software limitation) No extension to multipoint analysis (software limitation) High computing load due to permutations Usefulness When an extended pedigree structure precludes standard lod score calculations for a single gene disorder Whe

46、n many affected relatives other than sibling are available When genotypes for unaffected individuals who connect affected individuals are availableCommentary It is a clever tactic to use empirical distribution Well suitable for small sample size when CLT is not working Reducing Type I error due to p

47、olygenic background and common environmentsNPL (Non-parametric linkage)Analysis Based on IBD Using all pedigree information Using all marker information The NPL pairs statistics,0,1,2pairsijijSSwhereSorThe normalized statistics 1/2()var()pairspairspairspairsSE SZSThe NPLall statistics 212( )!faallih

48、iSb h where a = # affected in the pedigreeh = collections of alleles2f = total # of founder allelesbi(h) = total # of a specific founder allele (i) in the collection (h)Advantages Disadvantages A friendly and easy-to-use interface NPLall is the only statistics that use all affected relatives simulta

49、neously instead in pairs Fully using all the marker simultaneously NPL is limited to small pedigree (2n-f = 16 (due to Lander-Green algorithm) Computing speed and loadUsefulness When pedigree of moderate size are available. Even pedigree with only a single affected sib pair can be useful, since a la

50、rge number of markers can be examined simultaneously. Large pedigree can not examined. When many relatives other tan siblings are available When a large number of linked markers are being examined. This is particularly useful if pedigrees have somewhat sparse genotypingCommentary Collecting and geno

51、typing data sets of size sufficient for mapping complex diseases is a formidable task. Multipoint methods can extract inheritance information from all the markers. Robust, non-parametric linkage methods take account of all pedigree member. Who takes account of information on trait phenotypic distrib

52、ution?WPC Analysis WPC (weighted pairwise correlation) are based on a score test of Homogeneity Strata are marker genotypes within each family Unlinked markers to traits homogeneity across these strata Based on IBSWPC statistics 111nnLijijij iSW UU WRPC statistics 1/2()var()RRRSE SWRPCSThe overall W

53、RPC statistic 1nppWRPCAdvantages Disadvantages Without requirement of specification of genetic model derived for quantitative traits and has the greatest power when it is used for this purpose The use of rank allows easy conversion for qualitative traits, highlighting the flexibility of WPC The only method that allow inclusion environmenta

溫馨提示

  • 1. 本站所有資源如無特殊說明,都需要本地電腦安裝OFFICE2007和PDF閱讀器。圖紙軟件為CAD,CAXA,PROE,UG,SolidWorks等.壓縮文件請(qǐng)下載最新的WinRAR軟件解壓。
  • 2. 本站的文檔不包含任何第三方提供的附件圖紙等,如果需要附件,請(qǐng)聯(lián)系上傳者。文件的所有權(quán)益歸上傳用戶所有。
  • 3. 本站RAR壓縮包中若帶圖紙,網(wǎng)頁(yè)內(nèi)容里面會(huì)有圖紙預(yù)覽,若沒有圖紙預(yù)覽就沒有圖紙。
  • 4. 未經(jīng)權(quán)益所有人同意不得將文件中的內(nèi)容挪作商業(yè)或盈利用途。
  • 5. 人人文庫(kù)網(wǎng)僅提供信息存儲(chǔ)空間,僅對(duì)用戶上傳內(nèi)容的表現(xiàn)方式做保護(hù)處理,對(duì)用戶上傳分享的文檔內(nèi)容本身不做任何修改或編輯,并不能對(duì)任何下載內(nèi)容負(fù)責(zé)。
  • 6. 下載文件中如有侵權(quán)或不適當(dāng)內(nèi)容,請(qǐng)與我們聯(lián)系,我們立即糾正。
  • 7. 本站不保證下載資源的準(zhǔn)確性、安全性和完整性, 同時(shí)也不承擔(dān)用戶因使用這些下載資源對(duì)自己和他人造成任何形式的傷害或損失。

評(píng)論

0/150

提交評(píng)論