BLAST Resut
BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Eca_sc000058.1_g0520.1
(663 letters)
Database: ./nr
95,329,361 sequences; 35,143,497,570 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
KYP61936.1 Retrovirus-related Pol polyprotein from transposon TN... 500 e-169
CAN79309.1 hypothetical protein VITISV_020559 [Vitis vinifera] 520 e-162
BAG72096.1 Gag-protease-integrase-RT-RNaseH polyprotein [Glycine... 501 e-159
>KYP61936.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94
[Cajanus cajan]
Length = 428
Score = 500 bits (1287), Expect = e-169, Method: Compositional matrix adjust.
Identities = 249/389 (64%), Positives = 289/389 (74%), Gaps = 3/389 (0%)
Query: 8 KRVKCFFRRKVGHEKKDCLKYKNWLEKKGNLISCVCHESFFVDAPTNTWWIDSGSTIHIV 67
K KCFF +K GH KKDCLK+KNWLEKKGNL + VC+ES NTWWIDSGSTIH+
Sbjct: 34 KESKCFFCKKKGHMKKDCLKFKNWLEKKGNLFAFVCYESNMTSTNHNTWWIDSGSTIHVS 93
Query: 68 NTMHGFLDLRKPKGNEVGIYSGNRMRSQVEAVGTFRLILKTGFVLDLNNVFYVPSFSKNL 127
NT+ G +LRKP G+E IYSG++M S VEAVGT L+L +GF+L+L FYVPSFSKNL
Sbjct: 94 NTLQGMQNLRKPMGSEQCIYSGSKMSSHVEAVGTCNLVLSSGFILNLEKTFYVPSFSKNL 153
Query: 128 ISVSKLVDVDYDFLFKKPCFPILKNNFSVGGGTLIDGLYKIELDPTFEHNYLNMHADVGI 187
IS+S+L + + F F F +L + +G G L DGLY I L + Y +MH G+
Sbjct: 154 ISISQLAPLGFSFKFMDSGFTLLNKSKVIGFGELRDGLYSINLQNN-DAAYNSMHVSSGL 212
Query: 188 KRNRIDENSSTLWHRRLGHISIERVKRLVKDGVLKTLDFTDFGTCVDCIKGKQTNKTSKG 247
KR ++E+SS LWHRRLGHISI+R+KRLV DGVL TLDF DF TCVDCIKGKQTNK+ KG
Sbjct: 213 KRCVMNEDSSMLWHRRLGHISIDRIKRLVNDGVLSTLDFADFETCVDCIKGKQTNKSKKG 272
Query: 248 DKRSSQILEIIHTDICGPFPTPCLNGQRYFISFIDDHTRFMYLYLLLEKAEAFDAFKSFK 307
KRSS ILEIIHTDIC P N RYFI+FIDD++R+MYLYLL K EA DAFK FK
Sbjct: 273 AKRSSNILEIIHTDICCPDMDA--NSPRYFITFIDDYSRYMYLYLLRSKDEALDAFKVFK 330
Query: 308 AEVEKQKDMKIKIVRSDRGGEYYGRYTEKGQMPGPFAKFLEEEGIVAQYTMSDTPQQNGV 367
AEVE Q +IKIVRSDRGGEYYG+YTE GQ PG FA+FL+E G VAQYTM +P QNGV
Sbjct: 331 AEVELQCGKQIKIVRSDRGGEYYGKYTENGQAPGLFARFLQEHGFVAQYTMLSSPDQNGV 390
Query: 368 AERRNRTLMDMVRSMISNTNLPLSLRSEA 396
AERRNRTLMDMVRSM SN LP L +EA
Sbjct: 391 AERRNRTLMDMVRSMRSNAKLPQFLWAEA 419
>CAN79309.1 hypothetical protein VITISV_020559 [Vitis vinifera]
Length = 1851
Score = 520 bits (1339), Expect = e-162, Method: Compositional matrix adjust.
Identities = 304/674 (45%), Positives = 399/674 (59%), Gaps = 67/674 (9%)
Query: 11 KCFFRRKVGHEKKDCLKYKNWLEKKGNLISCVCHESFFVDAPTNTWWIDSGSTIHIVNTM 70
KCFF +K GH KK C K++ WLEKKG IS C+ES VD NTWWI SGSTIH+ NT+
Sbjct: 236 KCFFYKKKGHMKKGCTKFQKWLEKKGKPISFFCYESNMVDVIYNTWWIYSGSTIHVSNTL 295
Query: 71 HGFLDLRKPKGNEVGIYSGNRMRSQVEAVGTFRLILKTGFVLDLNNVFYVPSFSKNLISV 130
G +LRKP +E IYSGN+MRS VEAVGT L+L +GF+L+L FYVPSFS+NLISV
Sbjct: 296 QGMQNLRKPMPSEQCIYSGNKMRSHVEAVGTCNLVLSSGFILNLEKTFYVPSFSRNLISV 355
Query: 131 SKLVDVDYDFLFKKPCFPILKNNFSVGGGTLIDGLYKIELDPTFEHNYLNMHADVGIKRN 190
S++V + Y F F + F + + VG GTL +GL+ I+L +N MH +G KR
Sbjct: 356 SRIVPLGYSFSFYETSFSLFYKSNLVGNGTLSNGLFSIDLQNDTTNN--TMHVHIGTKRC 413
Query: 191 RIDENSSTLWHRRLGHISIERVKRLVKDGVLKTLDFTDFGTCVDCIKGKQTNKTSKGDKR 250
++E+S LWHRRLGHISI+R+KRLV DGVL TLDFTDF TC+DCIKGKQTNK+ KG KR
Sbjct: 414 VMNEDSFMLWHRRLGHISIQRIKRLVNDGVLNTLDFTDFHTCMDCIKGKQTNKSKKGAKR 473
Query: 251 SSQILEIIHTDICGPFPTPCLNGQRYFISFIDDHTRFMYLYLLLEKAEAFDAFKSFKAEV 310
S ILEIIH+DIC P +G +YFISFIDD++R+MY+YLL K E AFK FKAEV
Sbjct: 474 SIDILEIIHSDIC--CPDMDAHGPKYFISFIDDYSRYMYIYLLHNKNETLGAFKVFKAEV 531
Query: 311 EKQKDMKIKIVRSDRGGEYYGRYTEKGQMPGPFAKFLEEEGIVAQYTMS--DTPQQNGVA 368
EKQ +IKIVR+DRGGE+YGRYTE GQ PGPFAKFL+E GIVAQYTM P++ +
Sbjct: 532 EKQCGKQIKIVRTDRGGEHYGRYTEDGQAPGPFAKFLQEHGIVAQYTMPRFPRPERTRIV 591
Query: 369 ERRNRTLMDMVRSMISNTNLPLSLRSEAKGYTQKEGIDCHETFSPVSKKDSLRIIMALVA 428
E RN +D +IS ++ + S +K+ ID + S + K +A+ +
Sbjct: 592 ELRNAKFLD--NDLISGSDRFQDIVS------KKDHIDAQPSTSTIKKIYKSEKSLAIPS 643
Query: 429 HFDLELHQMDVNTAFLNGHLGEEVYMVQPEGFRDENDHHLVCKLRKSIYGLKQASRQWYL 488
+ + L ++D + N P+ F + CK S WY
Sbjct: 644 DYVVYLQKLDYDIRVEN----------DPKTFL----QAISCK----------ESNLWYD 679
Query: 489 KFHNVITSFGFTE--NIVDQCIYLKVCGSKYIFLVL--YVDDILLATSDL---GLLHETK 541
+ + S E N+V+ K K++F + +I + L GL
Sbjct: 680 AMKDEMNSMASNEVWNLVELPDGSKAIECKWVFKTKKDSLGNIERYKARLVAKGLTQNKG 739
Query: 542 KFLSQTFEMKDLGEASNVI-------GIEIHRDRSKRSL---GLSQKAYIKR-------- 583
+TF ++ VI +E+ + K + L +K Y+K+
Sbjct: 740 IDYKETFSPVSKKDSLRVIMTLVAHFDLELQQMDVKTAFLNGNLEEKIYMKQLEGFSSSG 799
Query: 584 ----ILERFRMHKCASLVAPIAKGEKLTQSQCPHNALEQGEMKDIPYASAVGSLLYAQVC 639
+LERFRM C+ +API KG++ QCP N LE+ +MK+IPYAS VGSL+Y QVC
Sbjct: 800 GEHLVLERFRMKDCSPSIAPIVKGDRFNLDQCPKNDLEREQMKNIPYASDVGSLMYVQVC 859
Query: 640 TRPDLAFVVGLLGR 653
TRPD+AF VG+LGR
Sbjct: 860 TRPDIAFTVGMLGR 873
>BAG72096.1 Gag-protease-integrase-RT-RNaseH polyprotein [Glycine max]
Length = 1321
Score = 501 bits (1291), Expect = e-159, Method: Compositional matrix adjust.
Identities = 250/389 (64%), Positives = 289/389 (74%), Gaps = 4/389 (1%)
Query: 8 KRVKCFFRRKVGHEKKDCLKYKNWLEKKGNLISCVCHESFFVDAPTNTWWIDSGSTIHIV 67
K KCFF +K GH KK+C ++ WLEKKG IS VC+ES V NTWWIDSGSTIHI
Sbjct: 204 KVAKCFFCKKKGHMKKNCPGFQKWLEKKGKSISLVCYESNMVSVNINTWWIDSGSTIHIA 263
Query: 68 NTMHGFLDLRKPKGNEVGIYSGNRMRSQVEAVGTFRLILKTGFVLDLNNVFYVPSFSKNL 127
N++ G +LRKP G+E I SGN++ S VEA+GT L L +GF+L L FYVPSFS+NL
Sbjct: 264 NSLQGMQNLRKPVGSEQSILSGNKLGSHVEAIGTCILTLSSGFILKLERTFYVPSFSRNL 323
Query: 128 ISVSKLVDVDYDFLFKKPCFPILKNNFSVGGGTLIDGLYKIELDPTFEHNYLNMHADVGI 187
IS+S+LV Y F FK F + N+ VG G L DGLY + L Y +MH GI
Sbjct: 324 ISISRLVPFGYSFNFKDTSFELFYNSECVGNGILSDGLYLLGLQNN--ATYSSMHVQTGI 381
Query: 188 KRNRIDENSSTLWHRRLGHISIERVKRLVKDGVLKTLDFTDFGTCVDCIKGKQTNKTSKG 247
KR I+ENSS LWHRRLGHISIER+KRLVKDGVL TLDF DF TC+DCIKGKQTN + KG
Sbjct: 382 KRCNINENSSMLWHRRLGHISIERIKRLVKDGVLNTLDFADFKTCMDCIKGKQTNMSKKG 441
Query: 248 DKRSSQILEIIHTDICGPFPTPCLNGQRYFISFIDDHTRFMYLYLLLEKAEAFDAFKSFK 307
RSS ILEIIHTDIC P +GQ+YFI+FIDD++R+M +YLL K EA DAFK FK
Sbjct: 442 ANRSSSILEIIHTDICCPDMDA--HGQKYFITFIDDYSRYMNVYLLHNKYEALDAFKVFK 499
Query: 308 AEVEKQKDMKIKIVRSDRGGEYYGRYTEKGQMPGPFAKFLEEEGIVAQYTMSDTPQQNGV 367
AEVE Q +IKIVRSDRGGEYYGRYTE GQ PGPFAKFL+E GIVAQYTM +P QNGV
Sbjct: 500 AEVENQCGKQIKIVRSDRGGEYYGRYTENGQAPGPFAKFLQEHGIVAQYTMPGSPNQNGV 559
Query: 368 AERRNRTLMDMVRSMISNTNLPLSLRSEA 396
AERRNRTL+DMVRSM+SN+NLP SL +EA
Sbjct: 560 AERRNRTLLDMVRSMLSNSNLPKSLWAEA 588
Score = 396 bits (1018), Expect = e-120, Method: Compositional matrix adjust.
Identities = 192/268 (71%), Positives = 218/268 (81%)
Query: 393 RSEAKGYTQKEGIDCHETFSPVSKKDSLRIIMALVAHFDLELHQMDVNTAFLNGHLGEEV 452
R AKG+TQKEGID ETFSPVSKKDSLRII+ALVAHFDLEL QMDV TAFLNG L EEV
Sbjct: 866 RLVAKGFTQKEGIDYKETFSPVSKKDSLRIILALVAHFDLELQQMDVKTAFLNGDLEEEV 925
Query: 453 YMVQPEGFRDENDHHLVCKLRKSIYGLKQASRQWYLKFHNVITSFGFTENIVDQCIYLKV 512
YM QPEGF + HLVCKL KSIYGLKQASRQWYLKFH +I+SFGF EN +DQCIY KV
Sbjct: 926 YMKQPEGFSSNSGEHLVCKLNKSIYGLKQASRQWYLKFHGIISSFGFDENPMDQCIYHKV 985
Query: 513 CGSKYIFLVLYVDDILLATSDLGLLHETKKFLSQTFEMKDLGEASNVIGIEIHRDRSKRS 572
GSK FLVLYVDDILLA +D GLLHE K+FLS+ F+MKD+G+AS VIGI+IHRDRS+
Sbjct: 986 SGSKICFLVLYVDDILLAANDRGLLHEVKQFLSKNFDMKDMGDASYVIGIKIHRDRSRGI 1045
Query: 573 LGLSQKAYIKRILERFRMHKCASLVAPIAKGEKLTQSQCPHNALEQGEMKDIPYASAVGS 632
LGLSQ+ YI +ILERFRM C+ VAPI KG++ +QCP N E+ +MK+IPYAS VGS
Sbjct: 1046 LGLSQETYINKILERFRMKDCSPSVAPIVKGDRFNLNQCPKNDFEREQMKNIPYASVVGS 1105
Query: 633 LLYAQVCTRPDLAFVVGLLGRAISGKNI 660
L+YAQVCTRPD+AF VG+LGR S I
Sbjct: 1106 LMYAQVCTRPDIAFAVGMLGRYQSNPGI 1133