BLAST Resut
BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Eca_sc000108.1_g3020.1
(743 letters)
Database: ./nr
95,329,361 sequences; 35,143,497,570 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
BAA22288.1 polyprotein [Oryza australiensis] 820 0.0
AAC26250.1 contains similarity to reverse transcriptase (Pfam: r... 781 0.0
ABA98367.2 retrotransposon protein, putative, Ty1-copia subclass... 717 0.0
>BAA22288.1 polyprotein [Oryza australiensis]
Length = 1317
Score = 820 bits (2118), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 407/698 (58%), Positives = 501/698 (71%), Gaps = 52/698 (7%)
Query: 19 WEEDQRRIL-DASKKESVIEILEREVEVGAYVIEVNLADMADSPSWVYDTGCGAHIISEL 77
W+ + ++ + D KK+S G VI++NLA + + SWV+DTG AH L
Sbjct: 264 WKRNCKKYMEDLKKKQSTTS------ASGINVIDINLA-TSPTDSWVFDTGSVAHSCKSL 316
Query: 78 DELKEVKYLIKSEMVLRFANKARVAAVAVGTYELSLETGLILVLHNCYYVPTISRNIISA 137
++ + L + E+ LR N A VA VAVGT L L +GL+L L+NCY VPT+ +N+ISA
Sbjct: 317 QGMRRSRGLRRGEVNLRVGNGASVATVAVGTVPLHLPSGLVLELNNCYCVPTLCQNVISA 376
Query: 138 SVLDKEGYHAIIKDKSCSLYKGEMFYTSAKLCNGLYVVNIE-DEILNIDTKRLRTHDSNQ 196
S L EGY + CS+Y +MFY A L NGLYV+N+E I NI+T+R ++D N
Sbjct: 377 SCLQAEGYDFRSMNNGCSIYLRDMFYFHAPLVNGLYVLNLEASPIYNINTERQLSNDINP 436
Query: 197 SYLWHCRLGHINMKRMQKLHSDGLLGSCDLESYDTCEPCLVGKMTRSSFKGKGDRVSEPL 256
+++WHCRLGHIN KRM+KLH DGLL S D ES++TCE CL+GKMT++ F G +R S+ L
Sbjct: 437 TFIWHCRLGHINKKRMEKLHKDGLLHSFDFESFETCESCLLGKMTKAPFTGHSERASDLL 496
Query: 257 GLIHTDVCGPMSTLARGNYGYFITFTDDFTRYGYVYLMRHKSESFEVFKQFQNEVENQLG 316
L+HTDVCGPMS+ ARG Y YFITFTDDF+RYGY+YLMRHKSESFE FK+FQNEV+N LG
Sbjct: 497 ALVHTDVCGPMSSTARGGYQYFITFTDDFSRYGYIYLMRHKSESFEKFKEFQNEVQNHLG 556
Query: 317 KKIKAIRSDRGGKYLSQEFEDHLRSCEIVSQLTPPGTPQMNGVSERRNRTLMDMVRSMMN 376
K IK +RSDRGG+Y+SQEF +HL+ C IV QLTPPGTPQ NGVSERRNRTL+DMVRSMM+
Sbjct: 557 KTIKFLRSDRGGEYVSQEFGNHLKDCGIVPQLTPPGTPQWNGVSERRNRTLLDMVRSMMS 616
Query: 377 LADLPNTFWGYALETAAFTLNRAPSKAVEKTPYEIWTRKVPKLSFLKVWGCDVYVKRLQG 436
+DLP +FWGYALETAA TLNR PSK+VEKTPYEIWT + P LSFLK+WGC+ YVKRLQ
Sbjct: 617 QSDLPLSFWGYALETAALTLNRVPSKSVEKTPYEIWTGQPPSLSFLKIWGCEAYVKRLQS 676
Query: 437 DKLAPRSDKCLFVGYPKETKGYYFYNPSENKVFVARDDVFLEKEFLSKKTSGRNIEIEEV 496
DKL P+SDKC VGYPKETKGYYFYN + KVFVAR VFLEKEFLS++ SG + +EEV
Sbjct: 677 DKLTPKSDKCFVVGYPKETKGYYFYNREQAKVFVARHGVFLEKEFLSRRVSGIRVHLEEV 736
Query: 497 RSEQQTDTVVPMSEVAIPNYYEPTEKEREDEMVDLGDPPAQSECDSQENPQGVESVSPVV 556
+ +T + TE ++ED+ V PVV
Sbjct: 737 QETPETVSAT-------------TEPQQEDQSV----------------------APPVV 761
Query: 557 Q--APRRSARLIEL------SQQQELLLVEDNEPSTYTEAMTSPDSEKWLGAMKSEMESM 608
APRRS R ++Q+++LL++++EP TY EAM DS KWLGAMKSE+ESM
Sbjct: 762 DTPAPRRSERSRRAPDRYTGAEQRDILLLDNDEPKTYEEAMVGHDSNKWLGAMKSEIESM 821
Query: 609 SENQVWNLVDLPDGVKPIGCKWIFKMKTDKDGNVSVFKARLVAKGYRQVQGIDYEETFSP 668
+NQVWNLVD PDGVK I CKW+FK K D DGNV ++KARLVAKG++Q+QG+DY+ETFSP
Sbjct: 822 YDNQVWNLVDPPDGVKTIECKWLFKKKADMDGNVHIYKARLVAKGFKQIQGVDYDETFSP 881
Query: 669 VVMLKSIRIILAIAAHYDYEIWQMDVKTAFLNGKAKMD 706
V MLKSIRIILAIAA++DYEIWQMDVKTAFLNG D
Sbjct: 882 VAMLKSIRIILAIAAYFDYEIWQMDVKTAFLNGNLSED 919
>AAC26250.1 contains similarity to reverse transcriptase (Pfam: rvt.hmm, score
19.29) [Arabidopsis thaliana] CAB80804.1 putative
retrotransposon protein [Arabidopsis thaliana]
Length = 964
Score = 781 bits (2017), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 381/595 (64%), Positives = 448/595 (75%), Gaps = 44/595 (7%)
Query: 117 LILVLHNCYYVPTISRNIISASVLDKEGYHAIIKDKSCSLYKGEMFYTSAKLCNGLYVVN 176
++L L NCYYVP I++NIIS S LD EG+H IK+K CS + +MFY SA L NGL+V+N
Sbjct: 1 MVLELKNCYYVPAINKNIISVSCLDMEGFHFSIKNKCCSFDRDDMFYGSAPLDNGLHVLN 60
Query: 177 IEDEILNIDTKRLRTHDSNQSYLWHCRLGHINMKRMQKLHSDGLLGSCDLESYDTCEPCL 236
I NI TK+ +++D N ++LWHCRLGHIN K +QKLHSDGLL S D ESY+TCE CL
Sbjct: 61 QSMPIYNIRTKKFKSNDLNPTFLWHCRLGHINEKHIQKLHSDGLLNSFDYESYETCESCL 120
Query: 237 VGKMTRSSFKGKGDRVSEPLGLIHTDVCGPMSTLARGNYGYFITFTDDFTRYGYVYLMRH 296
+GKMT++ F G +R S+ LGLIHTDVCGPMST ARGNY YFITFTDDF+RYGYVYLM+H
Sbjct: 121 LGKMTKAPFTGHSERASDLLGLIHTDVCGPMSTSARGNYQYFITFTDDFSRYGYVYLMKH 180
Query: 297 KSESFEVFKQFQNEVENQLGKKIKAIRSDRGGKYLSQEFEDHLRSCEIVSQLTPPGTPQM 356
KS+SFE FK+FQNEV+NQ GK IKA+RSDRGG+YLSQ F DHLR C IVSQLTPPGTPQ
Sbjct: 181 KSKSFENFKEFQNEVQNQFGKSIKALRSDRGGEYLSQVFSDHLRECGIVSQLTPPGTPQW 240
Query: 357 NGVSERRNRTLMDMVRSMMNLADLPNTFWGYALETAAFTLNRAPSKAVEKTPYEIWTRKV 416
NGVSERRNRTL+DMVRSMM+ DLP+ FWGYALET+AF LNR PSK+VEKTPYEIWT KV
Sbjct: 241 NGVSERRNRTLLDMVRSMMSHTDLPSPFWGYALETSAFMLNRCPSKSVEKTPYEIWTGKV 300
Query: 417 PKLSFLKVWGCDVYVKRLQGDKLAPRSDKCLFVGYPKETKGYYFYNPSENKVFVARDDVF 476
P LSFLK+WGC+ Y KRL DKL P+SDKC FVGYPKETKGYYFY+P++NKVFV R+ F
Sbjct: 301 PNLSFLKIWGCESYAKRLITDKLGPKSDKCYFVGYPKETKGYYFYHPTDNKVFVVRNGAF 360
Query: 477 LEKEFLSKKTSGRNIEIEEVRSEQQTDTVVPMSEVAIPNYYEPTEKEREDEMVDLGDPPA 536
LE+EFLSK TSG + +EEVR E Q D VP S+ E+ +DL
Sbjct: 361 LEREFLSKGTSGSKVLLEEVR-EPQGD--VPTSQ--------------EEHQLDLR---- 399
Query: 537 QSECDSQENPQGVESVSPVVQAP--RRSARLIE--------LSQQQELLLVEDNEPSTYT 586
V P++ P RRS R + L ++E +EP++Y
Sbjct: 400 -------------RVVEPILVEPEVRRSERSRHEPDRFRDWVMDDHALFMIESDEPTSYE 446
Query: 587 EAMTSPDSEKWLGAMKSEMESMSENQVWNLVDLPDGVKPIGCKWIFKMKTDKDGNVSVFK 646
EA+ PDS+KWL A KSEMESMS+N+VW LVDLPDGVKPI CKWIFK K D DGN+ ++K
Sbjct: 447 EALMGPDSDKWLEAAKSEMESMSQNKVWTLVDLPDGVKPIECKWIFKKKIDMDGNIQIYK 506
Query: 647 ARLVAKGYRQVQGIDYEETFSPVVMLKSIRIILAIAAHYDYEIWQMDVKTAFLNG 701
A LVAKGY+QV GIDY+ET+SPV MLKSIRI+LA AAHYDYEIWQMDVKTAFLNG
Sbjct: 507 AGLVAKGYKQVHGIDYDETYSPVAMLKSIRILLATAAHYDYEIWQMDVKTAFLNG 561
>ABA98367.2 retrotransposon protein, putative, Ty1-copia subclass [Oryza sativa
Japonica Group]
Length = 1745
Score = 717 bits (1850), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 349/579 (60%), Positives = 431/579 (74%), Gaps = 18/579 (3%)
Query: 129 TISRNIISASVLDKEGYHAIIKDKSCSLYKGEMFYTSAKLCNGLYVVNIED-EILNIDTK 187
+ +N+ISAS L EGY D CS+Y ++FY A + +GLY+VN++ I NI+ K
Sbjct: 844 ALCKNVISASCLQAEGYGFRSVDNGCSVYYNDIFYFHAPMMSGLYIVNLDGCSIYNINAK 903
Query: 188 RLRTHDSNQSYLWHCRLGHINMKRMQKLHSDGLLGSCDLESYDTCEPCLVGKMTRSSFKG 247
R +D N +++WHCRLGHIN KRM+KLH DGLL S D ES++TCE CL+GKMT++ F G
Sbjct: 904 RQGPNDLNPTFIWHCRLGHINEKRMEKLHRDGLLHSLDFESFETCESCLLGKMTKAPFTG 963
Query: 248 KGDRVSEPLGLIHTDVCGPMSTLARGNYGYFITFTDDFTRYGYVYLMRHKSESFEVFKQF 307
+ +R SE L L+HTDVCGPMS+ AR +GYFITFTDDF+RYGYVYLMRHKSESFE FK+F
Sbjct: 964 QSERTSELLVLVHTDVCGPMSSTARCGFGYFITFTDDFSRYGYVYLMRHKSESFEKFKEF 1023
Query: 308 QNEVENQLGKKIKAIRSDRGGKYLSQEFEDHLRSCEIVSQLTPPGTPQMNGVSERRNRTL 367
QNEV+N L K IK +RSD GG+YLS EF +HL+ C IV QLTPPGTPQ NGVSERRNRTL
Sbjct: 1024 QNEVQNHLRKTIKYLRSDHGGEYLSLEFGNHLKGCGIVPQLTPPGTPQWNGVSERRNRTL 1083
Query: 368 MDMVRSMMNLADLPNTFWGYALETAAFTLNRAPSKAVEKTPYEIWTRKVPKLSFLKVWGC 427
++MVRSMM+ +L +FWGYALETAAFTLNR PSK+V+KTPYEIWT K P LSFLK+WGC
Sbjct: 1084 LNMVRSMMSQTNLLLSFWGYALETAAFTLNRVPSKSVDKTPYEIWTGKRPSLSFLKIWGC 1143
Query: 428 DVYVKRLQGDKLAPRSDKCLFVGYPKETKGYYFYNPSENKVFVARDDVFLEKEFLSKKTS 487
+VYVKRLQ DKL P+S+KC FVGYPKETKGYY YN E KVFVAR VFL+KEF+S+K S
Sbjct: 1144 EVYVKRLQSDKLTPKSNKCFFVGYPKETKGYYLYNREEGKVFVARHGVFLKKEFISRKDS 1203
Query: 488 GRNIEIEEVRSEQQTDTVVPMSEVAIPNYYEPTEKEREDEMVDLGDPPAQSECDSQENPQ 547
G + +EE+ Q+T S + ++ + + + + PA +
Sbjct: 1204 GSIVRLEEI---QETPENASTSTQPQQAEQDVVQQVEQVVVEPVVEAPASRRSER----- 1255
Query: 548 GVESVSPVVQAPRRSARLIELSQQQELLLVEDNEPSTYTEAMTSPDSEKWLGAMKSEMES 607
+ + P R A L Q+++LL++++EP+TY EAM PDSEKW GAMKSE+ES
Sbjct: 1256 -------IRRTPARYALLT--IGQRDILLLDNDEPTTYEEAMVGPDSEKWPGAMKSEIES 1306
Query: 608 MSENQVWNLVDLPDGVKPIGCKWIFKMKTDKDGNVSVFKARLVAKGYRQVQGIDYEETFS 667
M NQVWNLVD PDGVK I CKW+FK KT DGNV ++KARLVAKG+RQ+QG+DY+ETFS
Sbjct: 1307 MHVNQVWNLVDPPDGVKAIECKWVFKKKTYVDGNVHIYKARLVAKGFRQIQGVDYDETFS 1366
Query: 668 PVVMLKSIRIILAIAAHYDYEIWQMDVKTAFLNGKAKMD 706
PV MLKSI+I+LAIAA++DYEIWQMDVKTAFLNG D
Sbjct: 1367 PVAMLKSIQIVLAIAAYFDYEIWQMDVKTAFLNGNLDED 1405