專利名稱:一種癌癥診斷用的陣列和分析方法
技術領域:
本發(fā)明涉及一種通過分析或檢測樣本中的癌相關基因或其產物而進行的癌診斷方法。具體而言,本發(fā)明涉及一種癌癥診斷用的陣列和分析方法。
背景技術:
很早就已經知道,癌癥的發(fā)病起因于細胞蛋白質的變異或數(shù)量的變化。近年來,基因工程學的發(fā)展使編碼特定蛋白質的基因的擴增或分析癌細胞中的基因變異成為可能,給癌研究領域帶來了飛躍性的發(fā)展。截止到目前,關于細胞癌變、癌細胞異常增殖的所謂癌基因的分析和鑒定還在持續(xù)進行中。另一方面,由于變異或表達下降導致癌變的癌抑制基因在近幾年備受注目,迄今為止,作為癌抑制基因,已經發(fā)現(xiàn)有視網膜胚細胞瘤的Rb基因、大腸癌的p53基因和APC基因、Wilms腫瘤的WT1基因等等。也報道了使用WT1基因的癌抑制劑(請參見專利文獻1)。
而且,已經逐漸明確,癌的發(fā)生、惡性化發(fā)展、轉移等并不是僅僅涉及一個基因的異常,而是涉及到多個基因的異常,并認為,進一步還會存在有更多的未鑒定的癌基因或癌抑制基因。雖然與癌相關、具有效果的基因已經了解了很多,但多數(shù)情況下,迄今其篩選鑒別的方法或者是通過將染色體DNA進行染色而將患者基因的變異可視化以后進行檢出(非專利文獻1),或者是通過LOH(雜合性丟失,Lossof Heterozygosity)分析基因缺失從而選定一個大致的范圍以后、再逐漸縮小到重要的基因區(qū)(專利文獻2)??墒牵@些方法的缺點是其需要判別的DNA缺失區(qū)域非常大,縮小精簡到重要的基因區(qū)域的工作需要花費大量的時間和勞力,因此,其作為檢出癌相關基因的方法是有局限的。而且,現(xiàn)有技術中的癌病變的分離、識別方法也使癌癥的判定非常困難。
非專利文獻1Yasuhide Yamashita,et al.,World J Gastroenterol,11(33)5129-5135,2005專利文獻1WO2003/002142號公報專利文獻2WO01/032859號公報發(fā)明內容本發(fā)明要解決的課題是提供一種癌診斷手段,其通過現(xiàn)有方法中還沒有發(fā)現(xiàn)的新的方法檢出癌相關基因,通過檢測出該癌相關基因的缺失、或該癌相關基因的變異或表達量異常而進行癌診斷。
為解決上述問題,本發(fā)明人全力進行了非小細胞肺癌病例的DNA的部分缺失區(qū)的探索和甲基化程度不同的基因的鑒定。非小細胞肺癌進一步被細分為腺癌、扁平上皮癌、大細胞癌、腺扁平上皮癌等組織型,占肺癌的8成以上。本發(fā)明人為了鑒定非小細胞肺癌中的缺失的DNA,采用新近開發(fā)的陣列CGH法(Inazawa J.,et al.,CancerSci.95(7),559,2004)篩選出了癌癥中高頻缺失的基因,結果成功分離了在基因組DNA中顯示同源缺失的基因,也即在非小細胞肺癌的基因組中不存在的基因,通過檢測出這些基因,使癌的診斷成為可能。基于以上這些發(fā)現(xiàn)完成了本發(fā)明。
即,根據(jù)本發(fā)明,本發(fā)明提供一種癌癥診斷用的陣列,其包含GMDS基因、ANKRD15基因、TEK基因或者EBI2基因的全部或部分區(qū)域的DNA,用于檢測出樣本試樣中的各基因的缺失。
根據(jù)本發(fā)明的另外一個方面,本發(fā)明提供一種癌癥診斷用的陣列,其包含GMDS基因、ANKRD15基因、TEK基因或者EBI2基因的全部或部分區(qū)域的DNA或RNA,用于分析樣本試樣中GMDS基因、ANKRD15基因、TEK基因或者EBI2基因。
優(yōu)選的是,所述分析為基因變異的檢測或基因表達量異常的檢測。
根據(jù)本發(fā)明的再另外一個方面,本發(fā)明提供一種分析方法,其采用針對GMDS蛋白、ANKRD15蛋白、TEK蛋白或者EBI2蛋白的抗體或其片斷來分析樣本試樣中GMDS蛋白、ANKRD15蛋白、TEK蛋白或者EBI2蛋白。
優(yōu)選的是,所述分析為蛋白質表達量異常的檢測。
優(yōu)選的是,本發(fā)明的診斷用陣列用于肺癌的診斷。
本發(fā)明還提供了一種癌癥的診斷方法,其包括采用包含GMDS基因、ANKRD15基因、TEK基因或者EBI2基因的全部或部分區(qū)域的DNA來檢測出樣本試樣中的各基因的缺失的工序。
根據(jù)本發(fā)明的另一個方面,本發(fā)明提供一種癌癥的診斷方法,其包括采用包含GMDS基因、ANKRD15基因、TEK基因或者EBI2基因的全部或部分區(qū)域的DNA或RNA來分析樣本試樣中GMDS基因、ANKRD15基因、TEK基因或者EBI2基因的工序。
優(yōu)選的是,所述分析為基因變異的檢測或基因表達量異常的檢測。
根據(jù)本發(fā)明的再一個方面,本發(fā)明提供一種癌癥的診斷方法,其包括采用針對于GMDS蛋白、ANKRD15蛋白、TEK蛋白或者EBI2蛋白的抗體或其片斷來分析樣本試樣中GMDS蛋白、ANKRD15蛋白、TEK蛋白或者EBI2蛋白的工序。
優(yōu)選的是,所述分析為蛋白質表達量異常的檢測。
優(yōu)選的是,本發(fā)明的診斷方法為肺癌的診斷方法。
根據(jù)本發(fā)明,新提供了一種通過檢測癌相關的GMDS基因、ANKRD15基因、TEK基因或者EBI2基因、或者檢測該基因所編碼的GMDS蛋白、ANKRD15蛋白、TEK蛋白或者EBI2蛋白而進行的癌診斷方法。無論是從基于癌癥個性的對癌癥的治療或預后改善的臨床角度考慮,還是從癌癥基礎研究的角度考慮,這些基因或基因產物的分析都是非常有用的。而且,通過測定GMDS基因、ANKRD15基因、TEK基因或者EBI2基因的信使RNA的表達量,也使對非小細胞肺癌患者的篩選成為可能。
具體實施例方式
下面,就本發(fā)明的實施形式和實施方法詳細地說明本發(fā)明。
在本發(fā)明的癌診斷方法中,作為檢測對象的癌相關基因為GMDS基因、ANKRD15基因、TEK基因或者EBI2基因。
GMDS基因、ANKRD15基因、TEK基因和EBI2基因的堿基序列、以及GMDS蛋白、ANKRD15蛋白、TEK蛋白和EBI2蛋白的氨基酸序列均為已知,可以經由National Center for BiotechnologyInformation(NCBI)的數(shù)據(jù)庫取得。
GMDS已知為GMD、GDP-甘露糖4,6-脫水酶(EC 4.2.1.47)(GDP-D-甘露糖脫水酶),其在REFSEQ中的ID為NM_001500(序列編號1)。還列出了其氨基酸序列(序列編號2)。該基因包含于人基因組中的RP-11克隆79M24中,在UCSC基因組瀏覽器(UCSCGenome Browser)(http://genome.ucsc.edu/cgi-bin/hgGateway)可以瀏覽。
ANKRD15已知為DKFZp451G231、KANK、KIAA0172、MGC43128、錨蛋白重復結構域15(ankyrin repeat domain 15),其在REFSEQ中的ID為NM_015158、NM_153186(序列編號3和序列編號4)。還列出了其氨基酸序列(序列編號5和序列編號6)。該基因包含于人基因組中的RP-11克隆31F19中。已經報道在腎癌細胞中,其作為癌抑制基因行使功能(J.Biol.Chem.,Vol.277,Issue 39,36585-36591,2002)。
TEK已知為CD202B、TIE-2、VMCM(venous malformations,multiple cutaneous and mucosal)、VMCM1、TEK酪氨酸激酶、內皮因子,其在REFSEQ中的ID為NM 000459(序列編號7)。還列出了其氨基酸序列(序列編號8)。該基因包含于人基因組中的RP-11克隆33O15中。已知其是一種具有酪氨酸激酶活性的促血管生成素1的受體,所述促血管生成素1涉及血管內皮細胞的增殖。(Cell 118149-161,2004)。
EBI2已知為EBV誘導的偶聯(lián)G蛋白的受體2、Epstein-Barr病毒誘導的基因2(淋巴細胞特異性的偶聯(lián)G蛋白的受體),其在REFSEQ中的ID為NM_004951(序列編號9)。還列出了其氨基酸序列(序列編號10)。該基因包含于人基因組中的RP-11克隆72J7中。已知它是G蛋白的共同受體,由EB病毒的感染誘導產生,其配體尚未被發(fā)現(xiàn)(J.Virol.672209-2220,1993.)。
在本說明書中,所謂“基因”是指由上述序列限定的來源于人的基因(基因組DNA以及cDNA),所謂“蛋白質”是指由該基因編碼、由上述氨基酸序列限定的蛋白質。
GMDS基因、ANKRD15基因、TEK基因或者EBI2基因可以為本領域技術人員采用公知的技術由培養(yǎng)細胞等中獲得的cDNA,也可以根據(jù)本發(fā)明說明書的表1所記載的克隆信息利用PCR等方法進行化學合成。在根據(jù)PCR法獲得具有序列編號所記載的堿基序列的DNA的時候,將人染色體DNA或cDNA文庫作為模板使用,使用能夠擴增來源于表1記載的克隆信息的堿基序列的1對設計引物進行PCR擴增,PCR擴增的DNA片斷可以克隆入在大腸桿菌等宿主中能夠增殖的適當載體中。
上述探針或引物的制備、cDNA文庫的構建、cDNA文庫的篩選、以及目的基因的克隆等等操作對于本領域技術人員來說均是已知的,例如,可以參照<分子克隆試驗手冊>,第2版,冷泉港實驗室,冷泉港,紐約,1989,或Current Protocols in Molecular Biology,Supplement1-38,John Wiley & Sons(1987-1997)等記載的方法進行制備。
作為GMDS基因、ANKRD15基因、TEK基因或者EBI2基因,也可以采用這些基因的等同基因。在本發(fā)明中,所謂“等同基因”,是指具有在表1所示的源于克隆信息的堿基序列中有1個至多個堿基缺失、添加或取代的堿基序列且編碼癌相關蛋白質的堿基序列的基因、或者具有能與表1所示的源于克隆信息的堿基序列在嚴格條件下雜交的堿基序列且編碼癌相關蛋白質的堿基序列的基因。GMDS基因、ANKRD15基因、TEK基因或者EBI2基因的等同基因還包括這些基因的片段。
上述“在表1所示的源于克隆信息的堿基序列中有1個至多個堿基缺失、添加或取代的堿基序列”中,對“1個至多個”的范圍并不作特別限定,例如,其意指1個至60個,優(yōu)選為1個至30個,更優(yōu)選為1-20個,進一步優(yōu)選為1至10個,特別優(yōu)選為1至5個左右。
因此,“GMDS基因、ANKRD15基因、TEK基因或者EBI2基因的等同基因”只要具有上述定義的結構和功能,不拘來源,也可以來源于人以外的哺乳動物,也可以向來源于人等哺乳動物的基因人工導入變異。
上述“具有在表1所示的源于克隆信息的堿基序列中有1個至多個堿基缺失、添加或取代的堿基序列且編碼癌相關蛋白質的堿基序列的基因”可以通過化學合成、基因工程方法或誘變等本領域技術人員已知的任意方法進行制備。具體地,利用具有表1所示的源于克隆信息的堿基序列的DNA,通過在這些DNA中導入變異即可以獲得上述基因。例如,對于具有表1所示的源于克隆信息的堿基序列的DNA,可以采用與誘變劑接觸的方法、采用紫外線照射的方法、采用基因工程的方法等等來進行。作為基因工程的方法之一,定點特異誘變由于可以在特定位置引入特定變異而非常有用,可以參照<分子克隆試驗手冊>,第2版,冷泉港實驗室,冷泉港,紐約,1989,或Current Protocolsin Molecular Biology,Supplement 1-38,John Wiley & Sons(1987-1997)等記載的方法進行制備。
上述“在嚴格條件下雜交的堿基序列”是指將DNA作為探針使用、采用菌落雜交法、蝕斑雜交法、或者DNA印記雜交而得到的DNA序列。例如,可以列舉的是如此鑒定獲得的DNA采用將來源于菌落或蝕斑的DNA或該DNA的片段固定化的濾膜,在0.7-1.0M的NaCl存在下,在65℃進行雜交以后,使用0.1-2×SSC溶液(1×SSC溶液為1 50mM氯化鈉、15mM檸檬酸鈉)在65℃條件下清洗濾膜。雜交可以參照<分子克隆試驗手冊>,第2版,冷泉港實驗室,冷泉港,紐約,1989等記載的方法進行。
在嚴格(嚴謹)條件下雜交的DNA可以列舉出與作為探針使用的DNA堿基序列具有一定程度以上的同源性的DNA,例如,具有70%以上,優(yōu)選80%以上,更優(yōu)選90%以上,進一步優(yōu)選93%以上,特別優(yōu)選95%以上同源性的DNA。
上述的所謂“具有與表1所示的源于克隆信息的堿基序列在嚴格條件下雜交的堿基序列且編碼癌相關蛋白質的堿基序列的基因”,如上所述,可以在一定嚴謹度雜交條件下通過采用菌落雜交法、蝕斑雜交法、或者DNA印記雜交而獲得。
在本發(fā)明中,可以采用包含GMDS基因、ANKRD15基因、TEK基因或者EBI2基因的一部分的DNA序列對樣本試樣中的各個基因進行分析。這里,所謂“GMDS基因、ANKRD15基因、TEK基因或者EBI2基因的一部分”意思是指由表1所示的源于克隆信息的堿基序列中的例如約10-30個連續(xù)堿基序列組成的寡核苷酸。
篩選鑒別癌癥的檢測可以通過將包含GMDS基因、ANKRD15基因、TEK基因或者EBI2基因的全部或其一部分的DNA或RNA作為引物或探針對樣本試樣中的GMDS基因、ANKRD15基因、TEK基因或者EBI2基因進行分析,這里,所謂“對GMDS基因、ANKRD15基因、TEK基因或者EBI2基因進行分析”具體是指檢測基因組DNA的缺失、基因變異或基因表達量異常。
當以上述DNA或RNA作為引物進行基因變異的檢測時,例如,通過選定的2種序列作引物,用PCR擴增由樣本試樣制備的DNA的部分序列。
另一方面,基因表達量的異常的檢測可以通過采用包含上述RNA序列的探針進行RNA印記雜交或者RT-PCR(逆轉錄聚合酶鏈式反應)法進行。
樣本試樣可以采用疑似腫瘤的組織切片、血液、淋巴液、咳痰、肺清洗液、尿、糞便、組織培養(yǎng)上清等等。
本發(fā)明中,可以通過采用針對于GMDS蛋白、ANKRD15蛋白、TEK蛋白或者EBI2蛋白的抗體或其片斷來分析樣本試樣中GMDS蛋白、ANKRD15蛋白、TEK蛋白或者EBI2蛋白,并由此進行癌的診斷。
本方法中采用的針對GMDS蛋白、ANKRD15蛋白、TEK蛋白或者EBI2蛋白的抗體(以下簡稱“抗體”)可以以GMDS蛋白、ANKRD15蛋白、TEK蛋白或者EBI2蛋白的全部或者一部分作為抗原按通常的方法制備。所謂“GMDS蛋白、ANKRD15蛋白、TEK蛋白或者EBI2蛋白的一部分”是指記載于序列表中的GMDS蛋白、ANKRD15蛋白、TEK蛋白或者EBI2蛋白質的氨基酸序列中例如連續(xù)至少6個氨基酸、優(yōu)選至少約8-10個氨基酸、進一步優(yōu)選至少約11-20個氨基酸組成的多肽。作為抗原的GMDS蛋白、ANKRD15蛋白、TEK蛋白或者EBI2蛋白的全部或者一部分可以通過生物學方法進行制備,也可以通過化學合成方法進行制備。
多克隆抗體可以通過將上述抗原多次接種于小鼠、豚鼠、兔等動物的皮下、肌肉內、腹腔內、靜脈內而充分免疫以后,從該動物采血、進行血清分離來進行制備。單克隆抗體可以通過將免疫小鼠的脾細胞和市場上銷售的小鼠骨髓瘤細胞通過細胞融合制得雜交瘤以后,從培養(yǎng)該雜交瘤的上清或接種該雜交瘤的小鼠腹水中制得。
通過采用按上述方法制得的針對GMDS蛋白、ANKRD15蛋白、TEK蛋白或者EBI2蛋白的抗體或其片斷,可以分析獲得樣本試樣中GMDS蛋白、ANKRD15蛋白、TEK蛋白或者EBI2蛋白的表達量。測定可以使用免疫印記法、酶免疫測定法(EIA)、放射免疫測定法(RIA)、熒光抗體法、免疫細胞染色等等免疫學方法、或者蛋白質印記法等等。這里,所謂“針對GMDS蛋白、ANKRD15蛋白、TEK蛋白或者EBI2蛋白的抗體片段”是指該抗體的單鏈抗體片段(scFV)等。
樣本試樣可以采用疑似腫瘤的組織切片、血液、淋巴液、咳痰、肺清洗液、尿、糞便、組織培養(yǎng)上清等等。當測定的樣本試樣中的GMDS蛋白、ANKRD15蛋白、TEK蛋白或者EBI2蛋白的表達量降低的時候,其顯示在作為樣本的組織或細胞中GMDS、ANKRD15、TEK或者EBI2基因的表達被抑制,可以進行癌的診斷。
本發(fā)明的癌診斷方法中,作為診斷對象的癌的具體例子可以列舉為例如惡性黑色素瘤、惡性淋巴瘤、肺癌、食道癌、胃癌、大腸癌、直腸癌、結腸癌、輸尿管腫瘤、膽囊癌、膽管癌、膽道癌、乳房癌、肝癌、胰腺癌、睪丸癌、上頜癌、舌癌、唇癌、口腔癌、咽癌、喉癌、卵巢癌、子宮癌、前列腺癌、甲狀腺癌、腦瘤、卡波西肉瘤、血管瘤、白血病、真性紅細胞增多癥、神經胚細胞瘤、視網膜胚細胞瘤、骨髓瘤、膀胱瘤、肉瘤、骨肉瘤、肌肉瘤、皮膚癌、基底細胞癌、皮膚附屬器癌、轉移性皮膚癌、皮膚黑色素瘤等等,但是,并不僅僅限于這些。優(yōu)選的作為診斷對象的癌是肺癌,特別優(yōu)選的是非小細胞肺癌。
下面通過實施例進一步詳細地說明本發(fā)明,但本發(fā)明并不特別限定于以下這些實施例。
實施例(1)試驗材料使用的細胞株是扁平上皮細胞株EBC-1、LK-2、PC10、VMRC-LCP、LC-1sq、ACC-LC-73、腺癌細胞株11-18、A549、ABC-1、RERF-LC-OK、VMRC-LCD、SK-LC-3、RERF-LC-KJ、大細胞癌細胞株KNS-62、86-2、LU65、PC-13、ACC-LC-33、NCI-H460、LU99A。至于來源于臨床樣本的肺癌樣本,使用了53種腺癌的石蠟包埋樣本、59種包含鄰接正常部位的速凍樣本。來源于臨床樣本的癌樣本從日本國立癌癥中心、長野北信綜合醫(yī)院獲得,得到各患者的同意并得到各組織的倫理委員會的認可。而且,在采取樣本之前不對臨床樣本的提供者進行放射治療、化學治療、免疫治療。
(2)通過陣列CGH方法分離非小細胞肺癌細胞中的缺失DNA對于陣列CGH方法,采用MCG全基因組陣列-4500(MCG wholeGenome Array-4500)(Inazawa J.,et al.,Cancer Sci.95(7),559,2004)進行非小細胞肺癌細胞株中的同源缺失基因的篩選,將來源于癌細胞的DNA用Cy3標記、正常人對照樣本的DNA用Cy5標記,將這兩種被標記物混合。將雜交后兩者的熒光信號強度相除所得的值用以2為底的對數(shù)值表示,并顯示出所述對數(shù)值為2.0以下的BAC克隆和其中包含的基因(請參見表1)。在各種癌細胞中均發(fā)現(xiàn)缺失區(qū),從而檢測出了以癌抑制基因CDKN2為代表的基因缺失,而且,在非小細胞肺癌細胞株中,又發(fā)現(xiàn)了GMDS(6p25GDP-甘露糖脫水酶)、ANKRD15(9p24.3錨蛋白重復結構域15)、TEK(9p21.3TEK酪氨酸激酶,內皮因子,TIE2)、EBI2(13q32.2Epstein-Barr病毒誘導的基因2(淋巴細胞特異性的偶聯(lián)G蛋白的受體))基因缺失。這些基因缺失揭示出在非小細胞肺癌細胞中其是作為癌相關基因在發(fā)揮作用,可以作為非小細胞肺癌的基因缺失標記進行利用。
表1通過陣列CGH方法從非小細胞肺癌細胞中作為同源缺失DNA分離出的BAC克隆和該區(qū)域包含的基因
a來自于UCSC Genome Browser,2004年5月專輯。
b位于BAC附近的可能的腫瘤抑制基因。
(3)結論根據(jù)陣列CGH方法進行篩選,分離出了在基因組中顯示同源缺失的基因,即分離出了在非小細胞肺癌中不存在的基因組區(qū)域的基因。因此,通過檢測這些基因的存在與否可以進行癌的診斷。
序列表<110>富士膠片株式會社(Fuji Photo Film Co.)<120>一種癌癥診斷用的陣列和分析方法<130>FI-070292-59<160>10<170>PatentIn version 3.3<210>1<211>1698<212>DNA<213>Homo sapiens<400>1cccggccctc cctgcacggc ctcccgtgcg cccctgtcag actgtggcgg ccggtcgcgc 60ggtgcgctct ccctccctgc ccgcagcctg gagaggcgct tcgtgctgca cacccccgcg 120ttcctgccgg caccgcgcct gccctctgcc gcgctccgcc ctgccgccga ccgcacgccc 180gccgcgggac atggcacacg caccggcacg ctgccccagc gcccggggct ccggggacgg 240cgagatgggc aagcccagga acgtggcgct catcaccggt atcacaggcc aggatggttc 300ctacctggct gagttcctgc tggagaaagg ctatgaggtc catggaattg tacggcggtc 460cagttcattt aatacgggtc gaattgagca tctgtataag aatccccagg ctcacattga 420aggaaacatg aagttgcact atggcgatct cactgacagt acctgccttg tgaagatcat 480taatgaagta aagcccacag agatctacaa ccttggagcc cagagccacg tcaaaatttc 540ctttgacctc gctgagtaca ctgcggacgt tgacggagtt ggcactctac gacttctaga 600tgcagttaag acttgtggcc ttatcaactc tgtgaagttc taccaagcct caacaagtga 660actttatggg aaagtgcagg aaatacccca gaaggagacc acccctttct atccccggtc 720accctatggg gcagcaaaac tctatgccta ttggattgtg gtgaacttcc gtgaggcgta 780taatctcttt gcagtgaacg gcattctctt caatcatgag agtcccagaa gaggagctaa 840tttcgttact cgaaaaatta gccggtcagt agctaagatt taccttggac aactggaatg 900tttcagtttg ggaaatctgg atgccaaacg agattggggc catgccaagg actatgtgga 960ggctatgtgg ttgatgttgc agaatgatga gccggaggac ttcgttatag ctactgggga 1020ggtccatagt gtccgggaat ttgtcgagaa atcattcttg cacattggaa aaaccattgt 1080gtgggaagga aagaatgaaa atgaagtggg cagatgtaaa gagaccggca aagttcacgt 1140
gactgtggat ctcaagtact accggccaac tgaagtggac tttctgcagg gcgactgcac 1200caaagcgaaa cagaagctga actggaagcc ccgggtcgct ttcgatgagc tggtgaggga 1260gatggtgcac gccgacgtgg agctcatgag gacaaacccc aatgcctgag cagcgcctcg 1320gagcccggcc cgccctccgg ctacaatccc cgcagagtct ccggtgcaga cgcgctgcgg 1380ggatggggag cggcgtgcca atctgcgggt cccctgcggc ccctgctgcc gctgcgctgt 1440cccggccgca agagcggggc cgccccgccg aggtttgtag cagccgggat gtgaccctcc 1500agggtttggg tcgctttgcg tttgtcgaag cctcctctga atggctttgt gaaatcaaga 1560tgttttaatc acattcactt tacttgaaat tatgttgtta cacaacaaat tgtggggcct 1620tcaaattgtt tttctctttt catattaaaa atggtctttc tgtgaactag caaaaaaaaa 1680aaaaaaaaaa aaaaaaaa 1698<210>2<211>372<212>PRT<213>Homo sapiens<400>2Met Ala His Ala Pro Ala Arg Cys Pro Ser Ala Arg Gly Ser Gly Asp1 5 10 15Gly Glu Met Gly Lys Pro Arg Asn Val Ala Leu Ile Thr Gly Ile Thr20 25 30Gly Gln Asp Gly Ser Tyr Leu Ala Glu Phe Leu Leu Glu Lys Gly Tyr35 40 45Glu Val His Gly Ile Val Arg Arg Ser Ser Ser Phe Asn Thr Gly Arg50 55 60Ile Glu His Leu Tyr Lys Asn Pro Gln Ala His Ile Glu Gly Asn Met65 70 75 80Lys Leu His Tyr Gly Asp Leu Thr Asp Ser Thr Cys Leu Val Lys Ile85 90 95Ile Asn Glu Val Lys Pro Thr Glu Ile Tyr Asn Leu Gly Ala Gln Ser100 105 110His Val Lys Ile Ser Phe Asp Leu Ala Glu Tyr Thr Ala Asp Val Asp
115 120 125Gly Val Gly Thr Leu Arg Leu Leu Asp Ala Val Lys Thr Cys Gly Leu130 135 140Ile Asn Ser Val Lys Phe Tyr Gln Ala Ser Thr Ser Glu Leu Tyr Gly145 150 155 160Lys Val Gln Glu Ile Pro Gln Lys Glu Thr Thr Pro Phe Tyr Pro Arg165 170 175Ser Pro Tyr Gly Ala Ala Lys Leu Tyr Ala Tyr Trp Ile Val Val Asn180 185 190Phe Arg Glu Ala Tyr Asn Leu Phe Ala Val Asn Gly Ile Leu Phe Asn195 200 205His Glu Ser Pro Arg Arg Gly Ala Asn Phe Val Thr Arg Lys Ile Ser210 215 220Arg Ser Val Ala Lys Ile Tyr Leu Gly Gln Leu Glu Cys Phe Ser Leu225 230 235 240Gly Asn Leu Asp Ala Lys Arg Asp Trp Gly His Ala Lys Asp Tyr Val245 250 255Glu Ala Met Trp Leu Met Leu Gln Asn Asp Glu Pro Glu Asp Phe Val260 265 270Ile Ala Thr Gly Glu Val His Ser Val Arg Glu Phe Val Glu Lys Ser275 280 285Phe Leu His Ile Gly Lys Thr Ile Val Trp Glu Gly Lys Asn Glu Asn290 295 300Glu Val Gly Arg Cys Lys Glu Thr Gly Lys Val His Val Thr Val Asp305 310 315 320Leu Lys Tyr Tyr Arg Pro Thr Glu Val Asp Phe Leu Gln Gly Asp Cys325 330 335Thr Lys Ala Lys Gln Lys Leu Asn Trp Lys Pro Arg Val Ala Phe Asp340 345 350Glu Leu Val Arg Glu Met Val His Ala Asp Val Glu Leu Met Arg Thr
355 360 365Asn Pro Asn Ala370<210>3<211>5083<212>DNA<213>Homo sapiens<400>3ggtccgcggc ggagcgagcg agcggccggc aggttgggag gagcggccga aggttgaatg 60cctttgagaa cttgatgcat aaaatttgca tgactcctca ctcctttctg gatctctcat 120tggactcaag ccagcatggc tcacaccaca aaggttaacg gcagtgcctc aggaaaagca 180ggtgatattc tcagtggaga ccaggacaag gaacagaaag acccttactt tgtggagacc 240ccctatggtt atcaactaga cttagatttc ctcaaatatg tggatgacat acagaaggga 300aataccatca aaagactgaa catccagaag aggcggaagc cgtccgtgcc atgcccagaa 360cccaggacca catctggtca gcaaggtata tggacttcca ctgaatccct ctcatcctcc 420aacagtgatg acaacaagca gtgccccaac ttcctcatag ccagaagtca agttacatca 480actccaatct caaagccacc tccccctctg gagacctcac tcccttttct taccatccca 540gaaaatcgac agctgccacc tccctcacca caactcccaa agcataacct tcatgtcacc 600aagacactga tggagacccg gagaagactg gaacaggaga gagccaccat gcagatgaca 660ccgggtgagt tcagaaggcc caggctggcc agttttggag gcatgggcac cacaagctcc 720ctcccttctt ttgtgggttc tggaaaccac aatcctgcca agcaccagct tcagaatgga 780taccaaggta atggggatta tggtagctat gccccagctg ctcccaccac ttcctccatg 840gggagctcca tccgccacag ccccctgagc tcagggatct ccaccccagt gaccaacgtg 900agccccatgc acctgcagca catccgcgag cagatggcca ttgctctgaa acgcctgaag 960gagctggagg agcaggtgcg aaccatccct gtgctccagg taaagatctc tgtcttgcaa 1020gaagagaaaa ggcagttggt ctcacagctg aaaaaccaaa gggctgcatc ccagatcaat 1080gtctgtggtg tgaggaagcg gtcctatagt gcggggaacg cctcccagct ggaacagctc 1140tcccgggccc gaagaagtgg cggggaatta tacattgact atgaggagga agaaatggag 1200accgtagaac agagcacgca gaggataaag gagttccggc aacttacagc agacatgcaa 1260gccctggagc agaagatcca ggacagcagc tgtgaggcct cctcagagct cagggagaat 1320
ggagagtgcc ggtctgtggc tgtgggtgcc gaggagaaca tgaacgacat cgtcgtgtac 1380cacagaggct ccaggtcctg taaggatgca gctgtaggga cacttgttga gatgagaaat 1440tgtggggtca gcgtgacaga ggccatgctt ggagtgatga ctgaagctga caaagaaatt 1500gagctgcaac agcagaccat agaatccttg aaggaaaaga tctatcgcct agaagtacag 1560cttagagaaa ccacccatga ccgggagatg actaaactga aacaagagct gcaggctgct 1620ggatcgagga aaaaggttga caaagccacg atggcccagc cgcttgtttt cagtaaggtg 1680gtggaggcag tggtgcagac cagagaccaa atggtcggca gtcacatgga cctggtggac 1740acgtgtgttg ggacctccgt ggaaacaaac agtgtaggca tctcctgcca gcctgaatgt 1800aagaataaag tcgtagggcc tgagctgcct atgaattggt ggattgttaa ggagagggtg 1860gaaatgcatg accgatgtgc tgggaggtct gtggaaatgt gtgacaagag tgtgagtgtg 1920gaagtcagcg tctgcgaaac aggcagcaac acagaggagt ctgtgaacga cctcacactc 1980ctcaagacaa acttgaatct caaagaagtg cggtctatcg gttgtggaga ttgttctgtt 2040gacgtgaccg tctgctctcc aaaggagtgc gcctcccggg gcgtgaacac tgaggctgtt 2100agccaggtgg aagctgccgt catggcagtg cctcgtactg cagaccagga cactagcaca 2160gatttggaac aggtgcacca gttcaccaac accgagacgg ccaccctcat agagtcctgc 2220accaacactt gtctaagcac tttggacaag cagaccagca cccagactgt ggagacgcgg 2280acagtagctg taggagaagg ccgtgtcaag gacatcaact cctccaccaa gacgcggtcc 2340attggtgttg gaacgttgct ttctggccat tctgggtttg acaggccatc agctgtgaag 2400accaaagagt caggtgtggg gcagataaat attaacgaca actatctggt tggtctcaaa 2460atgaggacta tagcttgtgg gccaccacag ttgactgtgg ggctgacagc cagcagaagg 2520agcgtggggg ttggggatga ccctgtaggg gaatctctgg agaaccccca gcctcaagct 2580ccacttggaa tgatgactgg cctggatcac tacattgagc gtatccagaa gctgctggca 2640gaacagcaga cactgctggc tgagaactac agtgaactgg cagaagcttt cggggaacct 2700cactcacaga tgggctccct caactctcag ctcatcagca ccctgtcgtc tatcaactct 2760gtcatgaaat ctgcaagcac tgaagagctg aggaaccctg acttccagaa aaccagtctg 2820ggtaaaatca caggcaatta tttgggatat acctgtaagt gtgggggcct tcagtcagga 2880agtcccttaa gctcccagac atcccagcct gagcaagaag tggggacctc agaaggaaag 2940ccaatcagca gcctggatgc cttccccact caggaaggta cgctgtctcc agtgaacctg 3000acagacgacc agatcgccgc tggcctctat gcatgtacaa acaatgaaag tacactgaag 3060tccatcatga agaagaaaga tggtaacaaa gattcaaatg gcgcaaaaaa gaatcttcag 3120
tttgttggca ttaatggagg gtatgaaaca acttcaagtg atgattccag ctcagatgaa 3180agctcttctt ccgagtcaga tgacgagtgt gatgtcattg agtatcctct tgaagaagag 3240gaggaggagg aggatgaaga cactcgggga atggcagaag ggcaccatgc agttaatatt 3300gaaggtttga agtctgccag ggtggaagat gaaatgcagg ttcaagaatg tgaacctgag 3360aaggtggaaa tcagagagag gtatgaatta agtgaaaaga tgttgtctgc atgcaactta 3420ctgaaaaata ctataaatga ccccaaagct ttgaccagca aagatatgag gttctgtctg 3480aacaccctcc agcacgagtg gttccgcgtg tccagtcaga agtcagccat tccagccatg 3540gtgggggact acatagctgc ttttgaggcc atttccccag atgtcctccg ctatgtcatc 3600aacttggcag acggcaacgg caacacagcc ctccattaca gcgtgtccca ctccaacttc 3660gagattgtga agctgctgtt agatgccgat gtgtgtaatg tggatcacca gaacaaggca 3720ggctacaccc ccatcatgtt ggcggccctc gccgctgtgg aagcagagaa ggacatgcgg 3780attgtggaag aactcttcgg ctgtggggat gtgaatgcca aagctagtca ggcgggacag 3840acggccctca tgctggcggt cagtcacgga cggatagaca tggtgaaggg ccttctggcc 3900tgtggggctg atgtcaacat ccaggatgac gagggctcca cggccctcat gtgtgccagc 3960gagcacgggc acgtggagat tgtcaagctg ctgctggccc agcccggctg caacggtcac 4020ctagaggaca acgatggcag cactgcgctc tcaatcgccc tggaagcagg acacaaggac 4080atcgctgttc ttctgtatgc ccatgtcaac tttgcaaaag cccagtctcc gggcacccct 4140aggcttggaa ggaagacgtc tcctggcccc acccaccgag gttcatttga ttgattgtat 4200gcaaatagcc ctttatttac atgccactat taagctgcta attgttcctg ttggggtgac 4260agatactgaa tgtatacgta ttgtgcctga gctcaccagc aaacagaagc atcaagccca 4320ggggtaaagg ctgaagcttt cacagtgcag agactgctag cctgggcaca cacacctcct 4380ttctggccgt cttctgtgta gggcacactt taacccagtc tctgttgctg ttgagtctct 4440gctccgtttt gtacagtcac agggaattct gatctgaagg ggcaccttct gttcactccc 4500acaaagtggt gtctggttct cactgagacg ttttaagatt tttccacaaa tatttatatg 4560tactaaatgt ggaaccatta gaaagttctt ccaaaatctc attccagcat agttttggat 4620ttttcttttg tcttatttta aaataaggaa gtcgagatga ctttgatcat tggtaacttg 4680ggcctgggcc agacaaagta taaaacttac aaaagaatat tctcatttgg tcttaactag 4740gtagatgtaa tatatgactt tttataaaaa gggtatctat atgaacttga cacagtattt 4800tcagcttttg tattccatac taaagccatg aagaactaca cgtaacatca tcatttgtat 4860taattgcaca actccaatgc taaaggttgg attgtgttag aggaatcggc tctgtatttg 4920
cctctagaga aacacagtgt tctctttgta tttatggatt cctttttacc gtgtcacatt 4980tactttggtc ctctatgtat ttaaatgttt gaagtgcctt agactcttgc catattttca 5040aaataaaatt ccattaagct ctaaaaaaaa aaaaaaaaaa aaa 5083<210>4<211>5263<212>DNA<213>Homo sapiens<400>4agagcctctc tgtagagatg cttaaagtgc tagctttatt taaaaatcag ctggcaaagc 60gggagtgaag aacacacatt ttcagaagat accggtttcc tgttttcatt aagaaaacag 120ggaatgcgga acagctgagt gcggcggggg tggtgtcact gcagccggag ggagacctcg 180ccggtgaaag ctcagcctat ggcatccgag cgtggccggc caagtttaca cgaatgttga 240aacttctcct cacggggtga ggatcgaggg cgcccgaggc cgggtccggc tgagctggag 300cgagctgtgc ggggcagcgc gggctggcgg ggagcgcggc gggcgccacg tggaaaagca 360ggtgatattc tcagtggaga ccaggacaag gaacagaaag acccttactt tgtggagacc 420ccctatggtt atcaactaga cttagatttc ctcaaatatg tggatgacat acagaaggga 480aataccatca aaagactgaa catccagaag aggcggaagc cgtccgtgcc atgcccagaa 540cccaggacca catctggtca gcaaggtata tggacttcca ctgaatccct ctcatcctcc 600aacagtgatg acaacaagca gtgccccaac ttcctcatag ccagaagtca agttacatca 660actccaatct caaagccacc tccccctctg gagacctcac tcccttttct taccatccca 720gaaaatcgac agctgccacc tccctcacca caactcccaa agcataacct tcatgtcacc 780aagacactga tggagacccg gagaagactg gaacaggaga gagccaccat gcagatgaca 840ccgggtgagt tcagaaggcc caggctggcc agttttggag gcatgggcac cacaagctcc 900ctcccttctt ttgtgggttc tggaaaccac aatcctgcca agcaccagct tcagaatgga 960taccaaggta atggggatta tggtagctat gccccagctg ctcccaccac ttcctccatg 1020gggagctcca tccgccacag ccccctgagc tcagggatct ccaccccagt gaccaacgtg 1080agccccatgc acctgcagca catccgcgag cagatggcca ttgctctgaa acgcctgaag 1140gagctggagg agcaggtgcg aaccatccct gtgctccagg taaagatctc tgtcttgcaa 1200gaagagaaaa ggcagttggt ctcacagctg aaaaaccaaa gggctgcatc ccagatcaat 1260gtctgtggtg tgaggaagcg gtcctatagt gcggggaacg cctcccagct ggaacagctc 1320
tcccgggccc gaagaagtgg cggggaatta tacattgact atgaggagga agaaatggag 1380accgtagaac agagcacgca gaggataaag gagttccggc aacttacagc agacatgcaa 1440gccctggagc agaagatcca ggacagcagc tgtgaggcct cctcagagct cagggagaat 1500ggagagtgcc ggtctgtggc tgtgggtgcc gaggagaaca tgaacgacat cgtcgtgtac 1560cacagaggct ccaggtcctg taaggatgca gctgtaggga cacttgttga gatgagaaat 1620tgtggggtca gcgtgacaga ggccatgctt ggagtgatga ctgaagctga caaagaaatt 1680gagctgcaac agcagaccat agaatccttg aaggaaaaga tctatcgcct agaagtacag 1740cttagagaaa ccacccatga ccgggagatg actaaactga aacaagagct gcaggctgct 1800ggatcgagga aaaaggttga caaagccacg atggcccagc cgcttgtttt cagtaaggtg 1860gtggaggcag tggtgcagac cagagaccaa atggtcggca gtcacatgga cctggtggac 1920acgtgtgttg ggacctccgt ggaaacaaac agtgtaggca tctcctgcca gcctgaatgt 1980aagaataaag tcgtagggcc tgagctgcct atgaattggt ggattgttaa ggagagggtg 2040gaaatgcatg accgatgtgc tgggaggtct gtggaaatgt gtgacaagag tgtgagtgtg 2100gaagtcagcg tctgcgaaac aggcagcaac acagaggagt ctgtgaacga cctcacactc 2160ctcaagacaa acttgaatct caaagaagtg cggtctatcg gttgtggaga ttgttctgtt 2220gacgtgaccg tctgctctcc aaaggagtgc gcctcccggg gcgtgaacac tgaggctgtt 2280agccaggtgg aagctgccgt catggcagtg cctcgtactg cagaccagga cactagcaca 2340gatttggaac aggtgcacca gttcaccaac accgagacgg ccaccctcat agagtcctgc 2400accaacactt gtctaagcac tttggacaag cagaccagca cccagactgt ggagacgcgg 2460acagtagctg taggagaagg ccgtgtcaag gacatcaact cctccaccaa gacgcggtcc 2520attggtgttg gaacgttgct ttctggccat tctgggtttg acaggccatc agctgtgaag 2580accaaagagt caggtgtggg gcagataaat attaacgaca actatctggt tggtctcaaa 2640atgaggacta tagcttgtgg gccaccacag ttgactgtgg ggctgacagc cagcagaagg 2700agcgtggggg ttggggatga ccctgtaggg gaatctctgg agaaccccca gcctcaagct 2760ccacttggaa tgatgactgg cctggatcac tacattgagc gtatccagaa gctgctggca 2820gaacagcaga cactgctggc tgagaactac agtgaactgg cagaagcttt cggggaacct 2880cactcacaga tgggctccct caactctcag ctcatcagca ccctgtcgtc tatcaactct 2940gtcatgaaat ctgcaagcac tgaagagctg aggaaccctg acttccagaa aaccagtctg 3000ggtaaaatca caggcaatta tttgggatat acctgtaagt gtgggggcct tcagtcagga 3060agtcccttaa gctcccagac atcccagcct gagcaagaag tggggacctc agaaggaaag 3120
ccaatcagca gcctggatgc cttccccact caggaaggta cgctgtctcc agtgaacctg 3180acagacgacc agatcgccgc tggcctctat gcatgtacaa acaatgaaag tacactgaag 3240tccatcatga agaagaaaga tggtaacaaa gattcaaatg gcgcaaaaaa gaatcttcag 3300tttgttggca ttaatggagg gtatgaaaca acttcaagtg atgattccag ctcagatgaa 3360agctcttctt ccgagtcaga tgacgagtgt gatgtcattg agtatcctct tgaagaagag 3420gaggaggagg aggatgaaga cactcgggga atggcagaag ggcaccatgc agttaatatt 3480gaaggtttga agtctgccag ggtggaagat gaaatgcagg ttcaagaatg tgaacctgag 3540aaggtggaaa tcagagagag gtatgaatta agtgaaaaga tgttgtctgc atgcaactta 3600ctgaaaaata ctataaatga ccccaaagct ttgaccagca aagatatgag gttctgtctg 3660aacaccctcc agcacgagtg gttccgcgtg tccagtcaga agtcagccat tccagccatg 3720gtgggggact acatagctgc ttttgaggcc atttccccag atgtcctccg ctatgtcatc 3780aacttggcag acggcaacgg caacacagcc ctccattaca gcgtgtccca ctccaacttc 3840gagattgtga agctgctgtt agatgccgat gtgtgtaatg tggatcacca gaacaaggca 3900ggctacaccc ccatcatgtt ggcggccctc gccgctgtgg aagcagagaa ggacatgcgg 3960attgtggaag aactcttcgg ctgtggggat gtgaatgcca aagctagtca ggcgggacag 4020acggccctca tgctggcggt cagtcacgga cggatagaca tggtgaaggg ccttctggcc 4080tgtggggctg atgtcaacat ccaggatgac gagggctcca cggccctcat gtgtgccagc 4140gagcacgggc acgtggagat tgtcaagctg ctgctggccc agcccggctg caacggtcac 4200ctagaggaca acgatggcag cactgcgctc tcaatcgccc tggaagcagg acacaaggac 4260atcgctgttc ttctgtatgc ccatgtcaac tttgcaaaag cccagtctcc gggcacccct 4320aggcttggaa ggaagacgtc tcctggcccc acccaccgag gttcatttga ttgattgtat 4380gcaaatagcc ctttatttac atgccactat taagctgcta attgttcctg ttggggtgac 4440agatactgaa tgtatacgta ttgtgcctga gctcaccagc aaacagaagc atcaagccca 4500ggggtaaagg ctgaagcttt cacagtgcag agactgctag cctgggcaca cacacctcct 4560ttctggccgt cttctgtgta gggcacactt taacccagtc tctgttgctg ttgagtctct 4620gctccgtttt gtacagtcac agggaattct gatctgaagg ggcaccttct gttcactccc 4680acaaagtggt gtctggttct cactgagacg ttttaagatt tttccacaaa tatttatatg 4740tactaaatgt ggaaccatta gaaagttctt ccaaaatctc attccagcat agttttggat 4800ttttcttttg tcttatttta aaataaggaa gtcgagatga ctttgatcat tggtaacttg 4860ggcctgggcc agacaaagta taaaacttac aaaagaatat tctcatttgg tcttaactag 4920
gtagatgtaa tatatgactt tttataaaaa gggtatctat atgaacttga cacagtattt 4980tcagcttttg tattccatac taaagccatg aagaactaca cgtaacatca tcatttgtat 5040taattgcaca actccaatgc taaaggttgg attgtgttag aggaatcggc tctgtatttg 5100cctctagaga aacacagtgt tctctttgta tttatggatt cctttttacc gtgtcacatt 5160tactttggtc ctctatgtat ttaaatgttt gaagtgcctt agactcttgc catattttca 5220aaataaaatt ccattaagct ctaaaaaaaa aaaaaaaaaa aaa 5263<210>5<211>1352<212>PRT<213>Homo sapiens<400>5Met Ala His Thr Thr Lys Val Asn Gly Ser Ala Ser Gly Lys Ala Gly1 5 10 15Asp Ile Leu Ser Gly Asp Gln Asp Lys Glu Gln Lys Asp Pro Tyr Phe20 25 30Val Glu Thr Pro Tyr Gly Tyr Gln Leu Asp Leu Asp Phe Leu Lys Tyr35 40 45Val Asp Asp Ile Gln Lys Gly Asn Thr Ile Lys Arg Leu Asn Ile Gln50 55 60Lys Arg Arg Lys Pro Ser Val Pro Cys Pro Glu Pro Arg Thr Thr Ser65 70 75 80Gly Gln Gln Gly Ile Trp Thr Ser Thr Glu Ser Leu Ser Ser Ser Asn85 90 95Ser Asp Asp Asn Lys Gln Cys Pro Asn Phe Leu Ile Ala Arg Ser Gln100 105 110Val Thr Ser Thr Pro Ile Ser Lys Pro Pro Pro Pro Leu Glu Thr Ser115 120 125Leu Pro Phe Leu Thr Ile Pro Glu Asn Arg Gln Leu Pro Pro Pro Ser130 135 140Pro Gln Leu Pro Lys His Asn Leu His Val Thr Lys Thr Leu Met Glu
145 150 155 160Thr Arg Arg Arg Leu Glu Gln Glu Arg Ala Thr Met Gln Met Thr Pro165 170 175Gly Glu Phe Arg Arg Pro Arg Leu Ala Ser Phe Gly Gly Met Gly Thr180 185 190Thr Ser Ser Leu Pro Ser Phe Val Gly Ser Gly Asn His Asn Pro Ala195 200 205Lys His Gln Leu Gln Asn Gly Tyr Gln Gly Asn Gly Asp Tyr Gly Ser210 215 220Tyr Ala Pro Ala Ala Pro Thr Thr Ser Ser Met Gly Ser Ser Ile Arg225 230 235 240His Ser Pro Leu Ser Ser Gly Ile Ser Thr Pro Val Thr Asn Val Ser245 250 255Pro Met His Leu Gln His Ile Arg Glu Gln Met Ala Ile Ala Leu Lys260 265 270Arg Leu Lys Glu Leu Glu Glu Gln Val Arg Thr Ile Pro Val Leu Gln275 280 285Val Lys Ile Ser Val Leu Gln Glu Glu Lys Arg Gln Leu Val Ser Gln290 295 300Leu Lys Asn Gln Arg Ala Ala Ser Gln Ile Asn Val Cys Gly Val Arg305 310 315 320Lys Arg Ser Tyr Ser Ala Gly Asn Ala Ser Gln Leu Glu Gln Leu Ser325 330 335Arg Ala Arg Arg Ser Gly Gly Glu Leu Tyr Ile Asp Tyr Glu Glu Glu340 345 350Glu Met Glu Thr Val Glu Gln Ser Thr Gln Arg Ile Lys Glu Phe Arg355 360 365Gln Leu Thr Ala Asp Met Gln Ala Leu Glu Gln Lys Ile Gln Asp Ser370 375 380Ser Cys Glu Ala Ser Ser Glu Leu Arg Glu Asn Gly Glu Cys Arg Ser
385 390 395 400Val Ala Val Gly Ala Glu Glu Asn Met Asn Asp Ile Val Val Tyr His405 410 415Arg Gly Ser Arg Ser Cys Lys Asp Ala Ala Val Gly Thr Leu Val Glu420 425 430Met Arg Asn Cys Gly Val Ser Val Thr Glu Ala Met Leu Gly Val Met435 440 445Thr Glu Ala Asp Lys Glu Ile Glu Leu Gln Gln Gln Thr Ile Glu Ser450 455 460Leu Lys Glu Lys Ile Tyr Arg Leu Glu Val Gln Leu Arg Glu Thr Thr465 470 475 480His Asp Arg Glu Met Thr Lys Leu Lys Gln Glu Leu Gln Ala Ala Gly485 490 495Ser Arg Lys Lys Val Asp Lys Ala Thr Met Ala Gln Pro Leu Val Phe500 505 510Ser Lys Val Val Glu Ala Val Val Gln Thr Arg Asp Gln Met Val Gly515 520 525Ser His Met Asp Leu Val Asp Thr Cys Val Gly Thr Ser Val Glu Thr530 535 540Asn Ser Val Gly Ile Ser Cys Gln Pro Glu Cys Lys Asn Lys Val Val545 550 555 560Gly Pro Glu Leu Pro Met Asn Trp Trp Ile Val Lys Glu Arg Val Glu565 570 575Met His Asp Arg Cys Ala Gly Arg Ser Val Glu Met Cys Asp Lys Ser580 585 590Val Ser Val Glu Val Ser Val Cys Glu Thr Gly Ser Asn Thr Glu Glu595 600 605Ser Val Asn Asp Leu Thr Leu Leu Lys Thr Asn Leu Asn Leu Lys Glu610 615 620Val Arg Ser Ile Gly Cys Gly Asp Cys Ser Val Asp Val Thr Val Cys
625 630 635 640Ser Pro Lys Glu Cys Ala Ser Arg Gly Val Asn Thr Glu Ala Val Ser645 650 655Gln Val Glu Ala Ala Val Met Ala Val Pro Arg Thr Ala Asp Gln Asp660 665 670Thr Ser Thr Asp Leu Glu Gln Val His Gln Phe Thr Asn Thr Glu Thr675 680 685Ala Thr Leu Ile Glu Ser Cys Thr Asn Thr Cys Leu Ser Thr Leu Asp690 695 700Lys Gln Thr Ser Thr Gln Thr Val Glu Thr Arg Thr Val Ala Val Gly705 710 715 720Glu Gly Arg Val Lys Asp Ile Asn Ser Ser Thr Lys Thr Arg Ser Ile725 730 735Gly Val Gly Thr Leu Leu Ser Gly His Ser Gly Phe Asp Arg Pro Ser740 745 750Ala Val Lys Thr Lys Glu Ser Gly Val Gly Gln Ile Asn Ile Asn Asp755 760 765Asn Tyr Leu Val Gly Leu Lys Met Arg Thr Ile Ala Cys Gly Pro Pro770 775 780Gln Leu Thr Val Gly Leu Thr Ala Ser Arg Arg Ser Val Gly Val Gly785 790 795 800Asp Asp Pro Val Gly Glu Ser Leu Glu Asn Pro Gln Pro Gln Ala Pro805 810 815Leu Gly Met Met Thr Gly Leu Asp His Tyr Ile Glu Arg Ile Gln Lys820 825 830Leu Leu Ala Glu Gln Gln Thr Leu Leu Ala Glu Asn Tyr Ser Glu Leu835 840 845Ala Glu Ala Phe Gly Glu Pro His Ser Gln Met Gly Ser Leu Asn Ser850 855 860Gln Leu Ile Ser Thr Leu Ser Ser Ile Asn Ser Val Met Lys Ser Ala
865 870 875 880Ser Thr Glu Glu Leu Arg Asn Pro Asp Phe Gln Lys Thr Ser Leu Gly885 890 895Lys Ile Thr Gly Asn Tyr Leu Gly Tyr Thr Cys Lys Cys Gly Gly Leu900 905 910Gln Ser Gly Ser Pro Leu Ser Ser Gln Thr Ser Gln Pro Glu Gln Glu915 920 925Val Gly Thr Ser Glu Gly Lys Pro Ile Ser Ser Leu Asp Ala Phe Pro930 935 940Thr Gln Glu Gly Thr Leu Ser Pro Val Asn Leu Thr Asp Asp Gln Ile945 950 955 960Ala Ala Gly Leu Tyr Ala Cys Thr Asn Asn Glu Ser Thr Leu Lys Ser965 970 975Ile Met Lys Lys Lys Asp Gly Asn Lys Asp Ser Asn Gly Ala Lys Lys980 985 990Asn Leu Gln Phe Val Gly Ile Asn Gly Gly Tyr Glu Thr Thr Ser Ser995 10001005Asp Asp Ser Ser Ser Asp Glu Ser Ser Ser Ser Glu Ser Asp Asp1010 1015 1020Glu Cys Asp Val Ile Glu Tyr Pro Leu Glu Glu Glu Glu Glu Glu1025 1030 1035Glu Asp Glu Asp Thr Arg Gly Met Ala Glu Gly His His Ala Val1040 1045 1050Asn Ile Glu Gly Leu Lys Ser Ala Arg Val Glu Asp Glu Met Gln1055 1060 1065Val Gln Glu Cys Glu Pro Glu Lys Val Glu Ile Arg Glu Arg Tyr1070 1075 1080Glu Leu Ser Glu Lys Met Leu Ser Ala Cys Asn Leu Leu Lys Asn1085 1090 1095Thr Ile Asn Asp Pro Lys Ala Leu Thr Ser Lys Asp Met Arg Phe
1100 1105 1110Cys Leu Asn Thr Leu Gln His Glu Trp Phe Arg Val Ser Ser Gln1115 1120 1125Lys Ser Ala Ile Pro Ala Met Val Gly Asp Tyr Ile Ala Ala Phe1130 1135 1140Glu Ala Ile Ser Pro Asp Val Leu Arg Tyr Val Ile Asn Leu Ala1145 1150 1155Asp Gly Asn Gly Asn Thr Ala Leu His Tyr Ser Val Ser His Ser1160 1165 1170Asn Phe Glu Ile Val Lys Leu Leu Leu Asp Ala Asp Val Cys Asn1175 1180 1185Val Asp His Gln Asn Lys Ala Gly Tyr Thr Pro Ile Met Leu Ala1190 1195 1200Ala Leu Ala Ala Val Glu Ala Glu Lys Asp Met Arg Ile Val Glu1205 1210 1215Glu Leu Phe Gly Cys Gly Asp Val Asn Ala Lys Ala Ser Gln Ala1220 1225 1230Gly Gln Thr Ala Leu Met Leu Ala Val Ser His Gly Arg Ile Asp1235 1240 1245Met Val Lys Gly Leu Leu Ala Cys Gly Ala Asp Val Asn Ile Gln1250 1255 1260Asp Asp Glu Gly Ser Thr Ala Leu Met Cys Ala Ser Glu His Gly1265 1270 1275His Val Glu Ile Val Lys Leu Leu Leu Ala Gln Pro Gly Cys Asn1280 1285 1290Gly His Leu Glu Asp Asn Asp Gly Ser Thr Ala Leu Ser Ile Ala1295 1300 1305Leu Glu Ala Gly His Lys Asp Ile Ala Val Leu Leu Tyr Ala His1310 1315 1320Val Asn Phe Ala Lys Ala Gln Ser Pro Gly Thr Pro Arg Leu Gly
1325 1330 1335Arg Lys Thr Ser Pro Gly Pro Thr His Arg Gly Ser Phe Asp1340 1345 1350<210>6<211>1194<212>PRT<213>Homo sapiens<400>6Met Glu Thr Arg Arg Arg Leu Glu Gln Glu Arg Ala Thr Met Gln Met1 5 10 15Thr Pro Gly Glu Phe Arg Arg Pro Arg Leu Ala Ser Phe Gly Gly Met20 25 30Gly Thr Thr Ser Ser Leu Pro Ser Phe Val Gly Ser Gly Asn His Asn35 40 45Pro Ala Lys His Gln Leu Gln Asn Gly Tyr Gln Gly Asn Gly Asp Tyr50 55 60Gly Ser Tyr Ala Pro Ala Ala Pro Thr Thr Ser Ser Met Gly Ser Ser65 70 75 80Ile Arg His Ser Pro Leu Ser Ser Gly Ile Ser Thr Pro Val Thr Asn85 90 95Val Ser Pro Met His Leu Gln His Ile Arg Glu Gln Met Ala Ile Ala100 105 110Leu Lys Arg Leu Lys Glu Leu Glu Glu Gln Val Arg Thr Ile Pro Val115 120 125Leu Gln Val Lys Ile Ser Val Leu Gln Glu Glu Lys Arg Gln Leu Val130 135 140Ser Gln Leu Lys Asn Gln Arg Ala Ala Ser Gln Ile Asn Val Cys Gly145 150 155 160Val Arg Lys Arg Ser Tyr Ser Ala Gly Asn Ala Ser Gln Leu Glu Gln165 170 175
Leu Ser Arg Ala Arg Arg Ser Gly Gly Glu Leu Tyr Ile Asp Tyr Glu180 185 190Glu Glu Glu Met Glu Thr Val Glu Gln Ser Thr Gln Arg Ile Lys Glu195 200 205Phe Arg Gln Leu Thr Ala Asp Met Gln Ala Leu Glu Gln Lys Ile Gln210 215 220Asp Ser Ser Cys Glu Ala Ser Ser Glu Leu Arg Glu Asn Gly Glu Cys225 230 235 240Arg Ser Val Ala Val Gly Ala Glu Glu Asn Met Asn Asp Ile Val Val245 250 255Tyr His Arg Gly Ser Arg Ser Cys Lys Asp Ala Ala Val Gly Thr Leu260 265 270Val Glu Met Arg Asn Cys Gly Val Ser Val Thr Glu Ala Met Leu Gly275 280 285Val Met Thr Glu Ala Asp Lys Glu Ile Glu Leu Gln Gln Gln Thr Ile290 295 300Glu Ser Leu Lys Glu Lys Ile Tyr Arg Leu Glu Val Gln Leu Arg Glu305 310 315 320Thr Thr His Asp Arg Glu Met Thr Lys Leu Lys Gln Glu Leu Gln Ala325 330 335Ala Gly Ser Arg Lys Lys Val Asp Lys Ala Thr Met Ala Gln Pro Leu340 345 350Val Phe Ser Lys Val Val Glu Ala Val Val Gln Thr Arg Asp Gln Met355 360 365Val Gly Ser His Met Asp Leu Val Asp Thr Cys Val Gly Thr Ser Val370 375 380Glu Thr Asn Ser Val Gly Ile Ser Cys Gln Pro Glu Cys Lys Asn Lys385 390 395 400Val Val Gly Pro Glu Leu Pro Met Asn Trp Trp Ile Val Lys Glu Arg405 410 415
Val Glu Met His Asp Arg Cys Ala Gly Arg Ser Val Glu Met Cys Asp420 425 430Lys Ser Val Ser Val Glu Val Ser Val Cys Glu Thr Gly Ser Asn Thr435 440 445Glu Glu Ser Val Asn Asp Leu Thr Leu Leu Lys Thr Asn Leu Asn Leu450 455 460Lys Glu Val Arg Ser Ile Gly Cys Gly Asp Cys Ser Val Asp Val Thr465 470 475 480Val Cys Ser Pro Lys Glu Cys Ala Ser Arg Gly Val Asn Thr Glu Ala485 490 495Val Ser Gln Val Glu Ala Ala Val Met Ala Val Pro Arg Thr Ala Asp500 505 510Gln Asp Thr Ser Thr Asp Leu Glu Gln Val His Gln Phe Thr Asn Thr515 520 525Glu Thr Ala Thr Leu Ile Glu Ser Cys Thr Asn Thr Cys Leu Ser Thr530 535 540Leu Asp Lys Gln Thr Ser Thr Gln Thr Val Glu Thr Arg Thr Val Ala545 550 555 560Val Gly Glu Gly Arg Val Lys Asp Ile Asn Ser Ser Thr Lys Thr Arg565 570 575Ser Ile Gly Val Gly Thr Leu Leu Ser Gly His Ser Gly Phe Asp Arg580 585 590Pro Ser Ala Val Lys Thr Lys Glu Ser Gly Val Gly Gln Ile Asn Ile595 600 605Asn Asp Asn Tyr Leu Val Gly Leu Lys Met Arg Thr Ile Ala Cys Gly610 615 620Pro Pro Gln Leu Thr Val Gly Leu Thr Ala Ser Arg Arg Ser Val Gly625 630 635 640Val Gly Asp Asp Pro Val Gly Glu Ser Leu Glu Asn Pro Gln Pro Gln645 650 655
Ala Pro Leu Gly Met Met Thr Gly Leu Asp His Tyr Ile Glu Arg Ile660 665 670Gln Lys Leu Leu Ala Glu Gln Gln Thr Leu Leu Ala Glu Asn Tyr Ser675 680 685Glu Leu Ala Glu Ala Phe Gly Glu Pro His Ser Gln Met Gly Ser Leu690 695 700Asn Ser Gln Leu Ile Ser Thr Leu Ser Ser Ile Asn Ser Val Met Lys705 710 715 720Ser Ala Ser Thr Glu Glu Leu Arg Asn Pro Asp Phe Gln Lys Thr Ser725 730 735Leu Gly Lys Ile Thr Gly Asn Tyr Leu Gly Tyr Thr Cys Lys Cys Gly740 745 750Gly Leu Gln Ser Gly Ser Pro Leu Ser Ser Gln Thr Ser Gln Pro Glu755 760 765Gln Glu Val Gly Thr Ser Glu Gly Lys Pro Ile Ser Ser Leu Asp Ala770 775 780Phe Pro Thr Gln Glu Gly Thr Leu Ser Pro Val Asn Leu Thr Asp Asp785 790 795 800Gln Ile Ala Ala Gly Leu Tyr Ala Cys Thr Asn Asn Glu Ser Thr Leu805 810 815Lys Ser Ile Met Lys Lys Lys Asp Gly Asn Lys Asp Ser Asn Gly Ala820 825 830Lys Lys Asn Leu Gln Phe Val Gly Ile Asn Gly Gly Tyr Glu Thr Thr835 840 845Ser Ser Asp Asp Ser Ser Ser Asp Glu Ser Ser Ser Ser Glu Ser Asp850 855 860Asp Glu Cys Asp Val Ile Glu Tyr Pro Leu Glu Glu Glu Glu Glu Glu865 870 875 880Glu Asp Glu Asp Thr Arg Gly Met Ala Glu Gly His His Ala Val Asn885 890 895
Ile Glu Gly Leu Lys Ser Ala Arg Val Glu Asp Glu Met Gln Val Gln900 905 910Glu Cys Glu Pro Glu Lys Val Glu Ile Arg Glu Arg Tyr Glu Leu Ser915 920 925Glu Lys Met Leu Ser Ala Cys Asn Leu Leu Lys Asn Thr Ile Asn Asp930 935 940Pro Lys Ala Leu Thr Ser Lys Asp Met Arg Phe Cys Leu Asn Thr Leu945 950 955 960Gln His Glu Trp Phe Arg Val Ser Ser Gln Lys Ser Ala Ile Pro Ala965 970 975Met Val Gly Asp Tyr Ile Ala Ala Phe Glu Ala Ile Ser Pro Asp Val980 985 990Leu Arg Tyr Val Ile Asn Leu Ala Asp Gly Asn Gly Asn Thr Ala Leu995 1000 1005His Tyr Ser Val Ser His Ser Asn Phe Glu Ile Val Lys Leu Leu1010 1015 1020Leu Asp Ala Asp Val Cys Asn Val Asp His Gln Asn Lys Ala Gly1025 1030 1035Tyr Thr Pro Ile Met Leu Ala Ala Leu Ala Ala Val Glu Ala Glu1040 1045 1050Lys Asp Met Arg Ile Val Glu Glu Leu Phe Gly Cys Gly Asp Val1055 1060 1065Asn Ala Lys Ala Ser Gln Ala Gly Gln Thr Ala Leu Met Leu Ala1070 1075 1080Val Ser His Gly Arg Ile Asp Met Val Lys Gly Leu Leu Ala Cys1085 1090 1095Gly Ala Asp Val Asn Ile Gln Asp Asp Glu Gly Ser Thr Ala Leu1100 1105 1110Met Cys Ala Ser Glu His Gly His Val Glu Ile Val Lys Leu Leu1115 1120 1125
Leu Ala Gln Pro Gly Cys Asn Gly His Leu Glu Asp Asn Asp Gly1130 1135 1140Ser Thr Ala Leu Ser Ile Ala Leu Glu Ala Gly His Lys Asp Ile1145 1150 1155Ala Val Leu Leu Tyr Ala His Val Asn Phe Ala Lys Ala Gln Ser1160 1165 1170Pro Gly Thr Pro Arg Leu Gly Arg Lys Thr Ser Pro Gly Pro Thr1175 1180 1185His Arg Gly Ser Phe Asp1190<210>7<211>4138<212>DNA<213>Homo sapiens<400>7cttctgtgct gttccttctt gcctctaact tgtaaacaag acgtactagg acgatgctaa 60tggaaagtca caaaccgctg ggtttttgaa aggatccttg ggacctcatg cacatttgtg 120gaaactggat ggagagattt ggggaagcat ggactcttta gccagcttag ttctctgtgg 180agtcagcttg ctcctttctg gaactgtgga aggtgccatg gacttgatct tgatcaattc 240cctacctctt gtatctgatg ctgaaacatc tctcacctgc attgcctctg ggtggcgccc 300ccatgagccc atcaccatag gaagggactt tgaagcctta atgaaccagc accaggatcc 360gctggaagtt actcaagatg tgaccagaga atgggctaaa aaagttgttt ggaagagaga 420aaaggctagt aagatcaatg gtgcttattt ctgtgaaggg cgagttcgag gagaggcaat 480caggatacga accatgaaga tgcgtcaaca agcttccttc ctaccagcta ctttaactat 540gactgtggac aagggagata acgtgaacat atctttcaaa aaggtattga ttaaagaaga 600agatgcagtg atttacaaaa atggttcctt catccattca gtgccccggc atgaagtacc 660tgatattcta gaagtacacc tgcctcatgc tcagccccag gatgctggag tgtactcggc 720caggtatata ggaggaaacc tcttcacctc ggccttcacc aggctgatag tccggagatg 780tgaagcccag aagtggggac ctgaatgcaa ccatctctgt actgcttgta tgaacaatgg 840tgtctgccat gaagatactg gagaatgcat ttgccctcct gggtttatgg gaaggacgtg 900
tgagaaggct tgtgaactgc acacgtttgg cagaacttgt aaagaaaggt gcagtggaca 960agagggatgc aagtcttatg tgttctgtct ccctgacccc tatgggtgtt cctgtgccac 1020aggctggaag ggtctgcagt gcaatgaagc atgccaccct ggtttttacg ggccagattg 1080taagcttagg tgcagctgca acaatgggga gatgtgtgat cgcttccaag gatgtctctg 1140ctctccagga tggcaggggc tccagtgtga gagagaaggc ataccgagga tgaccccaaa 1200gatagtggat ttgccagatc atatagaagt aaacagtggt aaatttaatc ccatttgcaa 1260agcttctggc tggccgctac ctactaatga agaaatgacc ctggtgaagc cggatgggac 1320agtgctccat ccaaaagact ttaaccatac ggatcatttc tcagtagcca tattcaccat 1380ccaccggatc ctcccccctg actcaggagt ttgggtctgc agtgtgaaca cagtggctgg 1440gatggtggaa aagcccttca acatttctgt taaagttctt ccaaagcccc tgaatgcccc 1500aaacgtgatt gacactggac ataactttgc tgtcatcaac atcagctctg agccttactt 1560tggggatgga ccaatcaaat ccaagaagct tctatacaaa cccgttaatc actatgaggc 1620ttggcaacat attcaagtga caaatgagat tgttacactc aactatttgg aacctcggac 1680agaatatgaa ctctgtgtgc aactggtccg tcgtggagag ggtggggaag ggcatcctgg 1740acctgtgaga cgcttcacaa cagcttctat cggactccct cctccaagag gtctaaatct 1800cctgcctaaa agtcagacca ctctaaattt gacctggcaa ccaatatttc caagctcgga 1860agatgacttt tatgttgaag tggagagaag gtctgtgcaa aaaagtgatc agcagaatat 1920taaagttcca ggcaacttga cttcggtgct acttaacaac ttacatccca gggagcagta 1980cgtggtccga gctagagtca acaccaaggc ccagggggaa tggagtgaag atctcactgc 2040ttggaccctt agtgacattc ttcctcctca accagaaaac atcaagattt ccaacattac 2100acactcctcg gctgtgattt cttggacaat attggatggc tattctattt cttctattac 2160tatccgttac aaggttcaag gcaagaatga agaccagcac gttgatgtga agataaagaa 2220tgccaccatc attcagtatc agctcaaggg cctagagcct gaaacagcat accaggtgga 2280catttttgca gagaacaaca tagggtcaag caacccagcc ttttctcatg aactggtgac 2340cctcccagaa tctcaagcac cagcggacct cggagggggg aagatgctgc ttatagccat 2400ccttggctct gctggaatga cctgcctgac tgtgctgttg gcctttctga tcatattgca 2460attgaagagg gcaaatgtgc aaaggagaat ggcccaagcc ttccaaaacg tgagggaaga 2520accagctgtg cagttcaact cagggactct ggccctaaac aggaaggtca aaaacaaccc 2580agatcctaca atttatccag tgcttgactg gaatgacatc aaatttcaag atgtgattgg 2640ggagggcaat tttggccaag ttcttaaggc gcgcatcaag aaggatgggt tacggatgga 2700
tgctgccatc aaaagaatga aagaatatgc ctccaaagat gatcacaggg actttgcagg 2760agaactggaa gttctttgta aacttggaca ccatccaaac atcatcaatc tcttaggagc 2820atgtgaacat cgaggctact tgtacctggc cattgagtac gcgccccatg gaaaccttct 2880ggacttcctt cgcaagagcc gtgtgctgga gacggaccca gcatttgcca ttgccaatag 2940caccgcgtcc acactgtcct cccagcagct ccttcacttc gctgccgacg tggcccgggg 3000catggactac ttgagccaaa aacagtttat ccacagggat ctggctgcca gaaacatttt 3060agttggtgaa aactatgtgg caaaaatagc agattttgga ttgtcccgag gtcaagaggt 3120gtacgtgaaa aagacaatgg gaaggctccc agtgcgctgg atggccatcg agtcactgaa 3180ttacagtgtg tacacaacca acagtgatgt atggtcctat ggtgtgttac tatgggagat 3240tgttagctta ggaggcacac cctactgcgg gatgacttgt gcagaactct acgagaagct 3300gccccagggc tacagactgg agaagcccct gaactgtgat gatgaggtgt atgatctaat 3360gagacaatgc tggcgggaga agccttatga gaggccatca tttgcccaga tattggtgtc 3420cttaaacaga atgttagagg agcgaaagac ctacgtgaat accacgcttt atgagaagtt 3480tacttatgca ggaattgact gttctgctga agaagcggcc taggacagaa catctgtata 3540ccctctgttt ccctttcact ggcatgggag acccttgaca actgctgaga aaacatgcct 3600ctgccaaagg atgtgatata taagtgtaca tatgtgctgg aattctaaca agtcataggt 3660taatatttaa gacactgaaa aatctaagtg atataaatca gattcttctc tctcatttta 3720tccctcacct gtagcatgcc agtcccgttt catttagtca tgtgaccact ctgtcttgtg 3780tttccacagc ctgcaagttc agtccaggat gctaacatct aaaaatagac ttaaatctca 3840ttgcttacaa gcctaagaat ctttagagaa gtatacataa gtttaggata aaataatggg 3900attttctttt cttttctctg gtaatattga cttgtatatt ttaagaaata acagaaagcc 3960tgggtgacat ttgggagaca tgtgacattt atatattgaa ttaatatccc tacatgtatt 4020gcacattgta aaaagtttta gttttgatga gttgtgagtt taccttgtat actgtaggca 4080cactttgcac tgatatatca tgagtgaata aatgtcttgc ctactcaaaa aaaaaaaa 4138<210>8<211>1124<212>PRT<213>Homo sapiens<400>8Met Asp Ser Leu Ala Ser Leu Val Leu Cys Gly Val Ser Leu Leu Leu
1 5 10 15Ser Gly Thr Val Glu Gly Ala Met Asp Leu Ile Leu Ile Asn Ser Leu20 25 30Pro Leu Val Ser Asp Ala Glu Thr Ser Leu Thr Cys Ile Ala Ser Gly35 40 45Trp Arg Pro His Glu Pro Ile Thr Ile Gly Arg Asp Phe Glu Ala Leu50 55 60Met Asn Gln His Gln Asp Pro Leu Glu Val Thr Gln Asp Val Thr Arg65 70 75 80Glu Trp Ala Lys Lys Val Val Trp Lys Arg Glu Lys Ala Ser Lys Ile85 90 95Asn Gly Ala Tyr Phe Cys Glu Gly Arg Val Arg Gly Glu Ala Ile Arg100 105 110Ile Arg Thr Met Lys Met Arg Gln Gln Ala Ser Phe Leu Pro Ala Thr115 120 125Leu Thr Met Thr Val Asp Lys Gly Asp Asn Val Asn Ile Ser Phe Lys130 135 140Lys Val Leu Ile Lys Glu Glu Asp Ala Val Ile Tyr Lys Asn Gly Ser145 150 155 160Phe Ile His Ser Val Pro Arg His Glu Val Pro Asp Ile Leu Glu Val165 170 175His Leu Pro His Ala Gln Pro Gln Asp Ala Gly Val Tyr Ser Ala Arg180 185 190Tyr Ile Gly Gly Asn Leu Phe Thr Ser Ala Phe Thr Arg Leu Ile Val195 200 205Arg Arg Cys Glu Ala Gln Lys Trp Gly Pro Glu Cys Asn His Leu Cys210 215 220Thr Ala Cys Met Asn Asn Gly Val Cys His Glu Asp Thr Gly Glu Cys225 230 235 240Ile Cys Pro Pro Gly Phe Met Gly Arg Thr Cys Glu Lys Ala Cys Glu
245 250 255Leu His Thr Phe Gly Arg Thr Cys Lys Glu Arg Cys Ser Gly Gln Glu260 265 270Gly Cys Lys Ser Tyr Val Phe Cys Leu Pro Asp Pro Tyr Gly Cys Ser275 280 285Cys Ala Thr Gly Trp Lys Gly Leu Gln Cys Asn Glu Ala Cys His Pro290 295 300Gly Phe Tyr Gly Pro Asp Cys Lys Leu Arg Cys Ser Cys Asn Asn Gly305 310 315 320Glu Met Cys Asp Arg Phe Gln Gly Cys Leu Cys Ser Pro Gly Trp Gln325 330 335Gly Leu Gln Cys Glu Arg Glu Gly Ile Pro Arg Met Thr Pro Lys Ile340 345 350Val Asp Leu Pro Asp His Ile Glu Val Asn Ser Gly Lys Phe Asn Pro355 360 365Ile Cys Lys Ala Ser Gly Trp Pro Leu Pro Thr Asn Glu Glu Met Thr370 375 380Leu Val Lys Pro Asp Gly Thr Val Leu His Pro Lys Asp Phe Asn His385 390 395 400Thr Asp His Phe Ser Val Ala Ile Phe Thr Ile His Arg Ile Leu Pro405 410 415Pro Asp Ser Gly Val Trp Val Cys Ser Val Asn Thr Val Ala Gly Met420 425 430Val Glu Lys Pro Phe Asn Ile Ser Val Lys Val Leu Pro Lys Pro Leu435 440 445Asn Ala Pro Asn Val Ile Asp Thr Gly His Asn Phe Ala Val Ile Asn450 455 460Ile Ser Ser Glu Pro Tyr Phe Gly Asp Gly Pro Ile Lys Ser Lys Lys465 470 475 480Leu Leu Tyr Lys Pro Val Asn His Tyr Glu Ala Trp Gln His Ile Gln
485 490 495Val Thr Asn Glu Ile Val Thr Leu Asn Tyr Leu Glu Pro Arg Thr Glu500 505 510Tyr Glu Leu Cys Val Gln Leu Val Arg Arg Gly Glu Gly Gly Glu Gly515 520 525His Pro Gly Pro Val Arg Arg Phe Thr Thr Ala Ser Ile Gly Leu Pro530 535 540Pro Pro Arg Gly Leu Asn Leu Leu Pro Lys Ser Gln Thr Thr Leu Asn545 550 555 560Leu Thr Trp Gln Pro Ile Phe Pro Ser Ser Glu Asp Asp Phe Tyr Val565 570 575Glu Val Glu Arg Arg Ser Val Gln Lys Ser Asp Gln Gln Asn Ile Lys580 585 590Val Pro Gly Asn Leu Thr Ser Val Leu Leu Asn Asn Leu His Pro Arg595 600 605Glu Gln Tyr Val Val Arg Ala Arg Val Asn Thr Lys Ala Gln Gly Glu610 615 620Trp Ser Glu Asp Leu Thr Ala Trp Thr Leu Ser Asp Ile Leu Pro Pro625 630 635 640Gln Pro Glu Asn Ile Lys Ile Ser Asn Ile Thr His Ser Ser Ala Val645 650 655Ile Ser Trp Thr Ile Leu Asp Gly Tyr Ser Ile Ser Ser Ile Thr Ile660 665 670Arg Tyr Lys Val Gln Gly Lys Asn Glu Asp Gln His Val Asp Val Lys675 680 685Ile Lys Asn Ala Thr Ile Ile Gln Tyr Gln Leu Lys Gly Leu Glu Pro690 695 700Glu Thr Ala Tyr Gln Val Asp Ile Phe Ala Glu Asn Asn Ile Gly Ser705 710 715 720Ser Asn Pro Ala Phe Ser His Glu Leu Val Thr Leu Pro Glu Ser Gln
725 730 735Ala Pro Ala Asp Leu Gly Gly Gly Lys Met Leu Leu Ile Ala Ile Leu740 745 750Gly Ser Ala Gly Met Thr Cys Leu Thr Val Leu Leu Ala Phe Leu Ile755 760 765Ile Leu Gln Leu Lys Arg Ala Asn Val Gln Arg Arg Met Ala Gln Ala770 775 780Phe Gln Asn Val Arg Glu Glu Pro Ala Val Gln Phe Asn Ser Gly Thr785 790 795 800Leu Ala Leu Asn Arg Lys Val Lys Asn Asn Pro Asp Pro Thr Ile Tyr805 810 815Pro Val Leu Asp Trp Asn Asp Ile Lys Phe Gln Asp Val Ile Gly Glu820 825 830Gly Asn Phe Gly Gln Val Leu Lys Ala Arg Ile Lys Lys Asp Gly Leu835 840 845Arg Met Asp Ala Ala Ile Lys Arg Met Lys Glu Tyr Ala Ser Lys Asp850 855 860Asp His Arg Asp Phe Ala Gly Glu Leu Glu Val Leu Cys Lys Leu Gly865 870 875 880His His Pro Asn Ile Ile Asn Leu Leu Gly Ala Cys Glu His Arg Gly885 890 895Tyr Leu Tyr Leu Ala Ile Glu Tyr Ala Pro His Gly Asn Leu Leu Asp900 905 910Phe Leu Arg Lys Ser Arg Val Leu Glu Thr Asp Pro Ala Phe Ala Ile915 920 925Ala Asn Ser Thr Ala Ser Thr Leu Ser Ser Gln Gln Leu Leu His Phe930 935 940Ala Ala Asp Val Ala Arg Gly Met Asp Tyr Leu Ser Gln Lys Gln Phe945 950 955 960Ile His Arg Asp Leu Ala Ala Arg Asn Ile Leu Val Gly Glu Asn Tyr
965 970 975Val Ala Lys Ile Ala Asp Phe Gly Leu Ser Arg Gly Gln Glu Val Tyr980 985 990Val Lys Lys Thr Met Gly Arg Leu Pro Val Arg Trp Met Ala Ile Glu995 1000 1005Ser Leu Asn Tyr Ser Val Tyr Thr Thr Asn Ser Asp Val Trp Ser1010 1015 1020Tyr Gly Val Leu Leu Trp Glu Ile Val Ser Leu Gly Gly Thr Pro1025 1030 1035Tyr Cys Gly Met Thr Cys Ala Glu Leu Tyr Glu Lys Leu Pro Gln1040 1045 1050Gly Tyr Arg Leu Glu Lys Pro Leu Asn Cys Asp Asp Glu Val Tyr1055 1060 1065Asp Leu Met Arg Gln Cys Trp Arg Glu Lys Pro Tyr Glu Arg Pro1070 1075 1080Ser Phe Ala Gln Ile Leu Val Ser Leu Asn Arg Met Leu Glu Glu1085 1090 1095Arg Lys Thr Tyr Val Asn Thr Thr Leu Tyr Glu Lys Phe Thr Tyr1100 1105 1110Ala Gly Ile Asp Cys Ser Ala Glu Glu Ala Ala1115 1120<210>9<211>1746<212>DNA<213>Homo sapiens<400>9tgcacagcct ctcttaaaac aacatgccac agttaaaact tacttcagtt tggacaacta 60ctcacagcta ctacacagag acccgaacga gtcactgata tacacctgga ccaccaccaa 120tggatataca aatggcaaac aattttactc cgccctctgc aactcctcag ggaaatgact 180gtgacctcta tgcacatcac agcacggcca ggatagtaat gcctctgcat tacagcctcg 240
tcttcatcat tgggctcgtg ggaaacttac tagccttggt cgtcattgtt caaaacagga 300aaaaaatcaa ctctaccacc ctctattcaa caaatttggt gatttctgat atacttttta 360ccaccgcttt gcctacacga atagcctact atgcaatggg ctttgactgg agaatcggag 420atgccttgtg taggataact gcgctagtgt tttacatcaa cacatatgca ggtgtgaact 480ttatgacctg cctgagtatt gaccgcttca ttgctgtggt gcaccctcta cgctacaaca 540agataaaaag gattgaacat gcaaaaggcg tgtgcatatt tgtctggatt ctagtatttg 600ctcagacact cccactcctc atcaacccta tgtcaaagca ggaggctgaa aggattacat 660gcatggagta tccaaacttt gaagaaacta aatctcttcc ctggattctg cttggggcat 720gtttcatagg atatgtactt ccacttataa tcattctcat ctgctattct cagatctgct 780gcaaactctt cagaactgcc aaacaaaacc cactcactga gaaatctggt gtaaacaaaa 840aggctctcaa cacaattatt cttattattg ttgtgtttgt tctctgtttc acaccttacc 900atgttgcaat tattcaacat atgattaaga agcttcgttt ctctaatttc ctggaatgta 960gccaaagaca ttcgttccag atttctctgc actttacagt atgcctgatg aacttcaatt 1020gctgcatgga cccttttatc tacttctttg catgtaaagg gtataagaga aaggttatga 1080ggatgctgaa acggcaagtc agtgtatcga tttctagtgc tgtgaagtca gcccctgaag 1140aaaattcacg tgaaatgaca gaaacgcaga tgatgataca ttccaagtct tcaaatggaa 1200agtgaaatgg attgtatttt ggtttatagt gacgtaaact gtatgacaaa ctttgcagga 1260cttcccttat aaagcaaaat aattgttcag cttccaatta gtattctttt atatttcttt 1320cattgggcac tttcccatct ccaactcgga agtaagccca agagaacaac ataaagcaaa 1380caacataaag cacaataaaa atgcaaataa atatttcatt tttatttgta aacgaataca 1440ccaaaaggag gcgctcttaa taactcccaa tgtaaaaagt tttgttttaa taaaaaattt 1500aattattatt tcttgccaac aaatggctag aaaggactga atagattata tattgccaga 1560tgttaatact gtaacatact ttttaaataa catatttctt aaatccaaat ttctctcaat 1620gttagattta attccctcaa taacaccaat gttttgtttt gtttcgttct gggtcataaa 1680actttgttaag gaactcttt tggaataaag agcaggatgc tgcaaagtta aaaaaaaaaa 1740aaaaaa1746<210>10<211>361<212>PRT<213>Homo sapiens
<400>10Met Asp Ile Gln Met Ala Asn Asn Phe Thr Pro Pro Ser Ala Thr Pro1 5 10 15Gln Gly Asn Asp Cys Asp Leu Tyr Ala His His Ser Thr Ala Arg Ile20 25 30Val Met Pro Leu His Tyr Ser Leu Val Phe Ile Ile Gly Leu Val Gly35 40 45Asn Leu Leu Ala Leu Val Val Ile Val Gln Asn Arg Lys Lys Ile Asn50 55 60Ser Thr Thr Leu Tyr Ser Thr Asn Leu Val Ile Ser Asp Ile Leu Phe65 70 75 80Thr Thr Ala Leu Pro Thr Arg Ile Ala Tyr Tyr Ala Met Gly Phe Asp85 90 95Trp Arg Ile Gly Asp Ala Leu Cys Arg Ile Thr Ala Leu Val Phe Tyr100 105 110Ile Asn Thr Tyr Ala Gly Val Asn Phe Met Thr Cys Leu Ser Ile Asp115 120 125Arg Phe Ile Ala Val Val His Pro Leu Arg Tyr Asn Lys Ile Lys Arg130 135 140Ile Glu His Ala Lys Gly Val Cys Ile Phe Val Trp Ile Leu Val Phe145 150 155 160Ala Gln Thr Leu Pro Leu Leu Ile Asn Pro Met Ser Lys Gln Glu Ala165 170 175Glu Arg Ile Thr Cys Met Glu Tyr Pro Asn Phe Glu Glu Thr Lys Ser180 185 190Leu Pro Trp Ile Leu Leu Gly Ala Cys Phe Ile Gly Tyr Val Leu Pro195 200 205Leu Ile Ile Ile Leu Ile Cys Tyr Ser Gln Ile Cys Cys Lys Leu Phe210 215 220Arg Thr Ala Lys Gln Asn Pro Leu Thr Glu Lys Ser Gly Val Asn Lys
225 230 235 240Lys Ala Leu Asn Thr Ile Ile Leu Ile Ile Val Val Phe Val Leu Cys245 250 255Phe Thr Pro Tyr His Val Ala Ile Ile Gln His Met Ile Lys Lys Leu260 265 270Arg Phe Ser Asn Phe Leu Glu Cys Ser Gln Arg His Ser Phe Gln Ile275 280 285Ser Leu His Phe Thr Val Cys Leu Met Asn Phe Asn Cys Cys Met Asp290 295 300Pro Phe Ile Tyr Phe Phe Ala Cys Lys Gly Tyr Lys Arg Lys Val Met305 310 315 320Arg Met Leu Lys Arg Gln Val Ser Val Ser Ile Ser Ser Ala Val Lys325 330 335Ser Ala Pro Glu Glu Asn Ser Arg Glu Met Thr Glu Thr Gln Met Met340 345 350Ile His Ser Lys Ser Ser Asn Gly Lys355 360
權利要求
1.一種癌癥診斷用的陣列,其包含GMDS基因、ANKRD15基因、TEK基因或者EBI2基因的全部或部分區(qū)域的DNA,用于檢測出樣本試樣中的所述各基因的缺失。
2.一種癌癥診斷用的陣列,其包含GMDS基因、ANKRD15基因、TEK基因或者EBI2基因的全部或部分區(qū)域的DNA或RNA,用于分析樣本試樣中GMDS基因、ANKRD15基因、TEK基因或者EBI2基因。
3.權利要求2所述的陣列,其中所述分析為基因變異的檢測或基因表達量異常的檢測。
4.一種分析方法,其采用針對GMDS蛋白質、ANKRD15蛋白質、TEK蛋白質或者EBI2蛋白質的抗體或其片斷,來分析樣本試樣中GMDS蛋白質、ANKRD15蛋白質、TEK蛋白質或者EBI2蛋白質。
5.權利要求4所述的方法,其中所述分析為蛋白質表達量異常的檢測。
6.權利要求1-3任意一項所述的陣列,該陣列用于肺癌的診斷。
全文摘要
本發(fā)明提供一種癌診斷用陣列,其包含GMDS基因、ANKRD15基因、TEK基因或者EBI2基因的全部或部分區(qū)域的DNA,用于檢測出樣本試樣中的各基因的缺失或用于分析樣本試樣中的GMDS基因、ANKRD15基因、TEK基因或者EBI2基因。本發(fā)明還提供一種分析方法,該方法采用針對GMDS蛋白質、ANKRD15蛋白質、TEK蛋白質或者EBI2蛋白質的抗體或其片斷,來分析樣本試樣中GMDS蛋白質、ANKRD15蛋白質、TEK蛋白質或者EBI2蛋白質。本發(fā)明的陣列特別適用于肺癌的診斷。
文檔編號C12Q1/68GK101092649SQ20071009587
公開日2007年12月26日 申請日期2007年4月12日 優(yōu)先權日2006年4月12日
發(fā)明者稻澤讓治, 井本逸勢, 和泉宏幸, 橫井佐奈 申請人:富士膠片株式會社, 國立大學法人東京醫(yī)科齒科大學