Aminokiselinska sekvenca
uredi
Dužina polipeptidnog lanca je 4.388 aminokiselina , a molekulska težina 491.916 Da .[ 6]
10 20 30 40 50
MLEGLVAWVL NTYLGKYVNN LNTDQLSVAL LKGAVELENL PLKKDALKEL
ELPFEVKAGF IGKVTLQIPF YRPHVDPWVI SISSLHLIGA PEKIQDFNDE
KEKLLERERK KALLQALEEK WKNDRQQKGE SYWYSVTASV VTRIVENIEL
KIQDVHLRFE DGVTNPSHPF AFGICIKNVS MQNAVNEPVQ KLMRKKQLDV
AEFSIYWDVD CTLLGDLPQM ELQEAMARSM ESRSHHYVLE PVFASALLKR
NCSKKPLRSR HSPRIDCDIQ LETIPLKLSQ LQYRQIMEFL KELERKERQV
KFRRWKPKVA ISKNCREWWY FALNANLYEI REQRKRCTWD FMLHRARDAV
SYTDKYFNKL KGGLLSTDDK EEMCRIEEEQ SFEELKILRE LVHDRFHKQE
ELAESLREPQ FDSPGACPGA PEPGGGSGML QYLQSWFPGW GGWYGQQTPE
GNVVEGLSAE QQEQWIPEEI LGTEEFFDPT ADASCMNTYT KRDHVFAKLN
LQLQRGTVTL LHKEQGTPQM NESAFMQLEF SDVKLLAESL PRRNSSLLSV
RLGGLFLRDL ATEGTMFPLL VFPNPQKEVG RVSQSFGLQT TSADRSDHYP
AADPDGPVFE MLYERNPAHS HFERRLNVST RPLNIIYNPQ AIKKVADFFY
KGKVHTSGFG YQSELELRVA EAARRQYNKL KMQTKAEIRQ TLDRLLVGDF
IEESKRWTVR LDISAPQVIF PDDFKFKNPV LVVVDLGRML LTNTQDNSRR
KSRDGSASEE TQFSDDEYKT PLATPPNTPP PESSSSNGEK TPPFSGVEFS
EEQLQAHLMS TKMYERYSLS FMDLQIMVGR VKDNWKHVQD IDVGPTHVVE
KFNVHLQLER RLIYTSDPKY PGAVLSGNLP DLKIHINEDK ISALKNCFAL
LTTPEMKTSD TQIKEKIFPQ EEQRGSLQDS VMNLTQSIVL LEQHTREVLV
ESQLLLAEFK VNCMQLGVES NGRYISVLKV FGTNAHFVKR PYDAEVSLTV
HGLLLVDTMQ TYGADFDLLM ASHKNLSFDI PTGSLRDSRA QSPVSGPNVA
HLTDGATLND RSATSVSLDK ILTKEQESLI KLEYQFVSSE CPSMNLDSTL
QVISLQVNNL DIILNPETIV ELIGFLQKSF PKEKDDLSPQ PLMTDFERSF
REQGTYQSTY EQNTEVAVEI HRLNLLLLRT VGMANREKYG RKIATASIGG
TKVNVSMGST FDMNGSLGCL QLMDLTQDNV KNQYVVSIGN SVGYENIISD
IGYFESVFVR MEDAALTEAL SFTFVERSKQ ECFLNLKMAS LHYNHSAKFL
KELTLSMDEL EENFRGMLKS AATKVTTVLA TKTAEYSEMV SLFETPRKTR
EPFILEENEI YGFDLASSHL DTVKLILNIN IESPVVSIPR KPGSPELLVG
HLGQIFIQNF VAGDDESRSD RLQVEIKDIK LYSLNCTQLA GREAVGSEGS
RMFCPPSGSG SANSQEEAHF TRHDFFESLH RGQAFHILNN TTIQFKLEKI
PIERESELTF SLSPDDLGTS SIMKIEGKFV NPVQVVLAKH VYEQVLQTLD
NLVYSEDLNK YPASATSSPC PDSPLPPLST CGESSVERKE NGLFSHSSLS
NTSQKSLSVK EVKSFTQIQA TFCISELQVQ LSGDLTLGAQ GLVSLKFQDF
EVEFSKDHPQ TLSIQIALHS LLMEDLLEKN PDSKYKNLMV SRGAPKPSSL
AQKEYLSQSC PSVSNVEYPD MPRSLPSHME EAPNVFQLYQ RPTSASRKKQ
KEVQDKDYPL TPPPSPTVDE PKILVGKSKF DDSLVHINIF LVDKKHPEFS
SSYNRVNRSI DVDFNCLDVL ITLQTWVVIL DFFGIGSTAD NHAMRLPPEG
ILHNVKLEPH ASMESGLQDP VNTKLDLKVH SLSLVLNKTT SELAKANVSK
LVAHLEMIEG DLALQGSIGS LSLSDLTCHG EFYRERFTTS GEEALIFQTF
KYGRPDPLLR REHDIRVSLR MASVQYVHTQ RFQAEVVAFI QHFTQLQDVL
GRQRAAIEGQ TVRDQAQRCS RVLLDIEAGA PVLLIPESSR SNNLIVANLG
KLKVKNKFLF AGFPGTFSLQ DKESVPSASP TGIPKHSLRK TTSTEEPRGT
HSQGQFTMPL AGMSLGSLKS EFVPSTSTKQ QGPQPTLSVG QESSSPEDHV
CLLDCVVVDL QDMDIFAAER HPREYSKAPE DSSGDLIFPS YFVRQTGGSL
LTEPCRLKLQ VERNLDKEIS HTVPDISIHG NLSSVHCSLD LYKYKLIRGL
LENNLGEPIE EFMRPYDLQD PRIHTVLSGE VYTCMCFLID MVNVSLELKD
PKRKEGAGSL ARFDFKKCKL LYESFSNQTK SINLVSHSMM AFDTRYAGQK
TSPGMTNVFS CIFQPAKNSS TTQGSIQIEL HFRSTKDSSC FTVVLNNLRV
FLIFDWLLLV HDFLHTPSDI KKQNHVTPSR HRNSSSESAI VPKTVKSGVV
TKRSSLPVSN ERHLEVKVNV TGTEFVVIED VSCFDTNAII LKGTTVLTYK
PRFVDRPFSG SLFGIEVFSC RLGNEHDTAL SIVDPVQIQM ELVGNSSYQN
SSGLMDAFNS EDFPPVLEIQ LQALDIRLSY NDVQLFLAIA KSIPEQANAA
VPDSVALESD SVGTYLPGAS RVGEEIREGT RHTLDPVLEL QLARLQELGF
SMDDCRKALL ACQGQLKKAA SWLFKNAEPL KSLSLASTSR DSPGAVAAPL
ISGVEIKAES VCICFIDDCM DCDVPLAELT FSRLNFLQRV RTSPEGYAHF
TLSGDYYNRA LSGWEPFIEP WPCSVSWQQQ AASRLHPPRL KLEAKAKPRL
DINITSVLID QYVSTKESWM ADYCKDDKDI ESAKSEDWMG SSVDPPCFGQ
SLPLVYLRTR STASLTNLEH QIYARAEVKT PKRRQPFVPF ALRNHTGCTL
WFATLTTTPT RAALSHSGSP GVVPEGNGTF LDDTHNVSEW REVLTGEEIP
FEFEARGKLR HRHTHDLRIH QLQVRVNGWE QVSPVSVDKV GTFFRYAAPD
KNSSSSTIGS PSSRTNIIHP QVYFSSLPPV RVVFAVTMEG SARKVITVRS
ALIVRNRLET PMELRLDSPS APDKPVVLPA IMPGDSFAVP LHLTSWRLQA
RPKGLGVFFC KAPIHWTNVV KTAEISSSKR ECHSMDTEKS RFFRFCVAIK
KENYPDYMPS NIFSDSAKQI FRQPGHTIYL LPTVVICNLL PCELDFYVKG
MPINGTLKPG KEAALHTADT SQNIELGVSL ENFPLCKELL IPPGTQNYMV
RMRLYDVNRR QLNLTIRIVC RAEGSLKIFI SAPYWLINKT GLPLIFRQDN
AKTDAAGQFE EHELARSLSP LLFCYADKEQ PNLCTMRIGR GIHPEGMPGW
CQGFSLDGGS GVRALKVIQQ GNRPGLIYNI GIDVKKGRGR YIDTCMVIFA
PRYLLDNKSS HKLAFAQREF ARGQGTANPE GYISTLPGSS VVFHWPRNDY
DQLLCVRLMD VPNCIWSGGF EVNKNNSFHI NMRDTLGKCF FLRVEITLRG
ATYRISFSDT DQLPPPFRID NFSKVPVVFT QHGVAEPRLR TEVKPMTSLD
YAWDEPTLPP FITLTVKGAG SSEINCNMND FQDNRQLYYE NFIYIAATYT
FSGLQEGTGR PVASNKAITC AELVLDVSPK TQRVILKKKE PGKRSQLWRM
TGTGMLAHEG SSVPHNPNKP SAARSTEGSA ILDIAGLAAV TDNRYEPLML
RKPDRRRSTT QTWSFREGKL TCGLHGLVVQ AKGGLSGLFD GAEVVLGPDT
SMELLGPVPP EQQFINQKMR PGSGMLSIRV IPDGPTRALQ ITDFCHRKSS
RSYEVDELPV TEQELQKLKN PDTEQELEVL VRLEGGIGLS LINKVPEELV
FASLTGINVH YTQLATSHML ELSIQDVQVD NQLIGTTQPF MLYVTPLSNE
NEVIETGPAV QVNAVKFPSK SALTNIYKHL MITAQRFTVQ IEEKLLLKLL
SFFGYDQAES EVEKYDENLH EKTAEQGGTP IRYYFENLKI SIPQIKLSVF
TSNKLPLDLK ALKSTLGFPL IRFEDAVINL DPFTRVHPYE TKEFIINDIL
KHFQEELLSQ AARILGSVDF LGNPMGLLND VSEGVTGLIK YGNVGGLIRN
VTHGVSNSAA KFAGTLSDGL GKTMDNRHQS EREYIRYHAA TSGEHLVAGI
HGLAHGIIGG LTSVITSTVE GVKTEGGVSG FISGLGKGLV GTVTKPVAGA
LDFASETAQA VRDTATLSGP RTQAQRVRKP RCCTGPQGLL PRYSESQAEG
QEQLFKLTDN IQDEFFIAVE NIDSYCVLIS SKAVYFLKSG DYVDREAIFL
EVKYDDLYHC LVSKDHGKVY VQVTKKAVST SSGVSIPGPS HQKPMVHVKS
EVLAVKLSQE INYAKSLYYE QQLMLRLSEN REQLELDS
Dopunska literatura
uredi
Andersson B, Wentland MA, Ricafrente JY, et al. (1996). "A "double adaptor" method for improved shotgun library construction". Anal. Biochem . 236 (1): 107–13. doi :10.1006/abio.1996.0138 . PMID 8619474 .
Yu W, Andersson B, Worley KC, et al. (1997). "Large-Scale Concatenation cDNA Sequencing" . Genome Res . 7 (4): 353–8. doi :10.1101/gr.7.4.353 . PMC 139146 . PMID 9110174 .
Seki N, Ohira M, Nagase T, et al. (1998). "Characterization of cDNA clones in size-fractionated cDNA libraries from human brain" . DNA Res . 4 (5): 345–9. doi :10.1093/dnares/4.5.345 . PMID 9455484 .
Nakayama M, Kikuno R, Ohara O (2003). "Protein–Protein Interactions Between Large Proteins: Two-Hybrid Screening Using a Functionally Classified Library Composed of Long cDNAs" . Genome Res . 12 (11): 1773–84. doi :10.1101/gr.406902 . PMC 187542 . PMID 12421765 .
Strausberg RL, Feingold EA, Grouse LH, et al. (2003). "Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences" . Proc. Natl. Acad. Sci. U.S.A . 99 (26): 16899–903. doi :10.1073/pnas.242603899 . PMC 139241 . PMID 12477932 .
Ota T, Suzuki Y, Nishikawa T, et al. (2004). "Complete sequencing and characterization of 21,243 full-length human cDNAs" . Nat. Genet . 36 (1): 40–5. doi :10.1038/ng1285 . PMID 14702039 .
Gerhard DS, Wagner L, Feingold EA, et al. (2004). "The Status, Quality, and Expansion of the NIH Full-Length cDNA Project: The Mammalian Gene Collection (MGC)" . Genome Res . 14 (10B): 2121–7. doi :10.1101/gr.2596504 . PMC 528928 . PMID 15489334 .
Velayos-Baeza A, Vettori A, Copley RR, et al. (2005). "Analysis of the human VPS13 gene family". Genomics . 84 (3): 536–49. doi :10.1016/j.ygeno.2004.04.012 . PMID 15498460 .
Gregory SG, Barlow KF, McLay KE, et al. (2006). "The DNA sequence and biological annotation of human chromosome 1" . Nature . 441 (7091): 315–21. Bibcode :2006Natur.441..315G . doi :10.1038/nature04727 . PMID 16710414 .