BLAST Resut
BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Eca_sc000010.1_g0530.1
(682 letters)
Database: ./nr
95,329,361 sequences; 35,143,497,570 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
BAG72096.1 Gag-protease-integrase-RT-RNaseH polyprotein [Glycine... 974 0.0
CAN69396.1 hypothetical protein VITISV_021034 [Vitis vinifera] 934 0.0
CAN83951.1 hypothetical protein VITISV_043907 [Vitis vinifera] 928 0.0
>BAG72096.1 Gag-protease-integrase-RT-RNaseH polyprotein [Glycine max]
Length = 1321
Score = 974 bits (2519), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 465/697 (66%), Positives = 557/697 (79%), Gaps = 17/697 (2%)
Query: 1 LHIWGCPAEVRIYNPHIKKLDPRTTSGYFIGYAVNSKGFRFYCPSHSQRIVEARNAKFFE 60
+ +WGCP+EVRIYNP KKLDPRT SGYFIGYA SKG+RFYCP H RIVE+RNAKF E
Sbjct: 622 MRVWGCPSEVRIYNPQEKKLDPRTISGYFIGYAERSKGYRFYCPHHITRIVESRNAKFIE 681
Query: 61 DFDSSGSDIPRLIEFEETREEPELPVDTRDLVIVQRSQPDIIAQQPTLEQP--------- 111
+ SGSD R + E E + LV++ Q +Q + P
Sbjct: 682 NDLISGSDQLRDLGSEIDYIESQPSTSNERLVVIHTPQVQRDDEQHMIGIPQTVVDNLVD 741
Query: 112 -----VHE--EQIPHEPVPQIEHEDVVLRRSSRTRKPAISSDYLVYLQESDFDIGPKRDP 164
+HE EQ + PQ E+ D LRRS+R RK AI SDY+VYLQESD++IG + DP
Sbjct: 742 QVDHQIHENDEQPVEQHDPQ-ENVDATLRRSTRVRKSAIPSDYIVYLQESDYNIGAENDP 800
Query: 165 NSFSEAINGEKSALWYDAMKEEMESMAKNQVWDLVELPKGSKAIGCRWVYKTKRDSSGNV 224
+F +A++ ++S LWYDAMK+EM SM N+VW+LVELP G+KAIGC+WV+KTK+DS GN+
Sbjct: 801 ETFDQAMSCKESNLWYDAMKDEMSSMQSNKVWNLVELPNGAKAIGCKWVFKTKKDSLGNI 860
Query: 225 ERYKARLVAKGYTQKEGIDYHETFSPVSKKDSLRIIMALVAHFDLELHQMDVKTAFLNGH 284
ERYKARLVAKG+TQKEGIDY ETFSPVSKKDSLRII+ALVAHFDLEL QMDVKTAFLNG
Sbjct: 861 ERYKARLVAKGFTQKEGIDYKETFSPVSKKDSLRIILALVAHFDLELQQMDVKTAFLNGD 920
Query: 285 LGEEVYMVQPEGFRDENDRHLVCKLRKSIYGLKQASRQWYLKFHNVITSFGFTENIVDQC 344
L EEVYM QPEGF + HLVCKL KSIYGLKQASRQWYLKFH +I+SFGF EN +DQC
Sbjct: 921 LEEEVYMKQPEGFSSNSGEHLVCKLNKSIYGLKQASRQWYLKFHGIISSFGFDENPMDQC 980
Query: 345 IYLKVSGSKYIFLVLYVDDILLATSDLGLLHETKEFLSQNFEMKDLGEASYVIGIEIHRD 404
IY KVSGSK FLVLYVDDILLA +D GLLHE K+FLS+NF+MKD+G+ASYVIGI+IHRD
Sbjct: 981 IYHKVSGSKICFLVLYVDDILLAANDRGLLHEVKQFLSKNFDMKDMGDASYVIGIKIHRD 1040
Query: 405 RSKRSLGLSQKAYIERILERFRMHNCASLVAPIAKGEKLTLSQCPQNALEQGEMKDIPYA 464
RS+ LGLSQ+ YI +ILERFRM +C+ VAPI KG++ L+QCP+N E+ +MK+IPYA
Sbjct: 1041 RSRGILGLSQETYINKILERFRMKDCSPSVAPIVKGDRFNLNQCPKNDFEREQMKNIPYA 1100
Query: 465 SAVGSLLYAQVCTRPDLAFVVGLLGRYQSNPGKEHWKAVKRVMRYLQGTKDYRLTYGYTD 524
S VGSL+YAQVCTRPD+AF VG+LGRYQSNPG +HW+A K+V+RYLQGTKDY L Y TD
Sbjct: 1101 SVVGSLMYAQVCTRPDIAFAVGMLGRYQSNPGIDHWRAAKKVLRYLQGTKDYMLMYRQTD 1160
Query: 525 HLELVGYSDSDFAGCVDSRKSTSGYIFLLAGGAISWRSTKQTILATSTMEAEFIACYEAT 584
+L+ +GYSDSDFAGCVDSR+STSGYIF++AGGAISW S KQ++ ATSTMEAEF++C+EAT
Sbjct: 1161 NLDAIGYSDSDFAGCVDSRRSTSGYIFMMAGGAISWGSVKQSLAATSTMEAEFVSCFEAT 1220
Query: 585 TQAIWLRNFVSGLKIVDTIERPLKILCDNSAAVFFSKNNKSGSRSKHIDIKYLLVRDKVK 644
+ +WL++F+SGLKI+DTI RPL+I CDNSAAVF +KNNKSGSRSKHIDIKYL +R++VK
Sbjct: 1221 SHGVWLKSFISGLKIIDTISRPLRIFCDNSAAVFMAKNNKSGSRSKHIDIKYLAIRERVK 1280
Query: 645 EHVVAIEHISTKLMIADPMTKALPAQVFLDHVERMGL 681
+ V IEHIST+LMIADP+TK +P F DHVERMGL
Sbjct: 1281 DKKVVIEHISTELMIADPLTKGMPPFKFKDHVERMGL 1317
>CAN69396.1 hypothetical protein VITISV_021034 [Vitis vinifera]
Length = 2026
Score = 934 bits (2415), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 437/686 (63%), Positives = 549/686 (80%), Gaps = 11/686 (1%)
Query: 1 LHIWGCPAEVRIYNPHIKKLDPRTTSGYFIGYAVNSKGFRFYCPSHSQRIVEARNAKFFE 60
+HIWGCPAE RIYNPH KKLD RT SGYFIGY SKG+ FYCP+H RIVE NA+F E
Sbjct: 430 IHIWGCPAETRIYNPHEKKLDSRTISGYFIGYPDKSKGYXFYCPNHXVRIVETXNARFLE 489
Query: 61 DFDSSGSDIPRLIEFEETREEPELPVDTRDLVIVQRSQPDIIAQQPTLEQPVHEEQIPHE 120
+ + SGS+ PR + EE R + P +++++ Q Q Q EQ + +P E
Sbjct: 490 NGEISGSNEPRKXDIEEIRVDIXPPFLPQEIIVPQPXQ-----QVEXNEQHNRDGSLPXE 544
Query: 121 PVPQIEH-----EDVVLRRSSRTRKPAISSDYLVYLQESDFDIGPKRDPNSFSEAINGEK 175
+P IE+ + LRRS R R+PAI+ DY+VYL ESDFDIG ++DP SFS+A+ +
Sbjct: 545 NIP-IENXVEPPQPXPLRRSQRERRPAITDDYVVYLXESDFDIGIRKDPVSFSQAMESDD 603
Query: 176 SALWYDAMKEEMESMAKNQVWDLVELPKGSKAIGCRWVYKTKRDSSGNVERYKARLVAKG 235
S+ W +AM EE++SMA N VWDL+ELP K +GC+WV+KTKRD+ GN+ER+KARLVAKG
Sbjct: 604 SSKWMEAMNEELKSMAHNGVWDLIELPNNCKPVGCKWVFKTKRDAKGNIERFKARLVAKG 663
Query: 236 YTQKEGIDYHETFSPVSKKDSLRIIMALVAHFDLELHQMDVKTAFLNGHLGEEVYMVQPE 295
+TQKEGIDY +TFSPVSKKDSLRIIMALVAHFDLELHQMDVKTAFLNG+L E++YM Q E
Sbjct: 664 FTQKEGIDYKDTFSPVSKKDSLRIIMALVAHFDLELHQMDVKTAFLNGNLDEDIYMEQXE 723
Query: 296 GFRDENDRHLVCKLRKSIYGLKQASRQWYLKFHNVITSFGFTENIVDQCIYLKVSGSKYI 355
GF + + HLVCKL+KSIYGLKQASRQWY+KF+N IT FGF EN VDQCIYLKVSGSK+I
Sbjct: 724 GFAKKGNEHLVCKLKKSIYGLKQASRQWYIKFNNTITYFGFKENXVDQCIYLKVSGSKFI 783
Query: 356 FLVLYVDDILLATSDLGLLHETKEFLSQNFEMKDLGEASYVIGIEIHRDRSKRSLGLSQK 415
FL+LYVDDILLA+SDLGLL ETKE+LS+NF M D+GEA+YVIGIEI RDRS+ LGLSQK
Sbjct: 784 FLILYVDDILLASSDLGLLRETKEYLSKNFHMVDMGEANYVIGIEIFRDRSRGVLGLSQK 843
Query: 416 AYIERILERFRMHNCASLVAPIAKGEKLTLSQCPQNALEQGEMKDIPYASAVGSLLYAQV 475
YI+R+LERF M +C+S +API KG+KL+ QCP+N E+ +MK IPYAS VGSL+YAQ
Sbjct: 844 GYIDRVLERFNMQSCSSGIAPILKGDKLSKMQCPRNNXEREQMKKIPYASXVGSLMYAQT 903
Query: 476 CTRPDLAFVVGLLGRYQSNPGKEHWKAVKRVMRYLQGTKDYRLTYGYTDHLELVGYSDSD 535
CTRPD++F VG+LGRYQS+PG EHWK K+VMRYLQGTKDY LTY ++ LE+VGYSBSD
Sbjct: 904 CTRPDISFAVGMLGRYQSDPGFEHWKXAKKVMRYLQGTKDYMLTYKRSEQLEVVGYSBSD 963
Query: 536 FAGCVDSRKSTSGYIFLLAGGAISWRSTKQTILATSTMEAEFIACYEATTQAIWLRNFVS 595
+ GC+BS KSTSG++F+LA GAISW+S KQ+I A+STMEAEF+AC+EA++ +WL+NF+S
Sbjct: 964 YGGCLBSLKSTSGFVFMLANGAISWKSEKQSITASSTMEAEFVACFEASSHVLWLQNFIS 1023
Query: 596 GLKIVDTIERPLKILCDNSAAVFFSKNNKSGSRSKHIDIKYLLVRDKVKEHVVAIEHIST 655
GL +VD I +PL+I CDN+A VFFSKN K S SKH+D+KYL+V+++V++ ++IE+I T
Sbjct: 1024 GLGVVDPIAKPLRIYCDNTATVFFSKNGKFSSGSKHMDLKYLVVKERVQKQQMSIENIRT 1083
Query: 656 KLMIADPMTKALPAQVFLDHVERMGL 681
LM+ADP+TK LP + +L+HV RMGL
Sbjct: 1084 TLMVADPLTKGLPPKAYLEHVMRMGL 1109
>CAN83951.1 hypothetical protein VITISV_043907 [Vitis vinifera]
Length = 745
Score = 928 bits (2398), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 446/704 (63%), Positives = 540/704 (76%), Gaps = 25/704 (3%)
Query: 1 LHIWGCPAEVRIYNPHIKKLDPRTTSGYFIGYAVNSKGFRFYCPSHSQRIVEARNAKFFE 60
+ +WGC +EVRIYNP KKLDPRT SGYFIGYA SKG++FYCPSH+ RIVE+RNAKF +
Sbjct: 1 MRVWGCSSEVRIYNPQEKKLDPRTISGYFIGYAEKSKGYKFYCPSHNTRIVESRNAKFLK 60
Query: 61 DFDSSGSDIPRLI--EFEETREEPELPVD---------------------TRDLVIVQRS 97
SGSD R I + + T +P D + +V V ++
Sbjct: 61 YDLVSGSDQFRNIVSDIDHTESQPSTSSDRLFIVHNTPQVQSGVEQTIAEVQSIVEVPQA 120
Query: 98 QPDIIAQQPTLEQPVHEEQIPHEPVPQIEHEDVVLRRSSRTRKPAISSDYLVYLQESDFD 157
+I+ Q E P E+Q+ EP +E LRRS+RT++ I +DY+VYLQE D++
Sbjct: 121 VDNILVDQVDQELPDTEQQV--EPHTFLEDIGATLRRSTRTKRSTIPNDYVVYLQECDYN 178
Query: 158 IGPKRDPNSFSEAINGEKSALWYDAMKEEMESMAKNQVWDLVELPKGSKAIGCRWVYKTK 217
IG K DP F +A++ ++S LWY+AMK+EM SM N VWDLVEL G+K IGC+WV+KTK
Sbjct: 179 IGAKNDPEFFLQAMSCKESELWYNAMKDEMSSMKCNDVWDLVELLNGAKTIGCKWVFKTK 238
Query: 218 RDSSGNVERYKARLVAKGYTQKEGIDYHETFSPVSKKDSLRIIMALVAHFDLELHQMDVK 277
+DS N+ERYK RLVAKG+TQKEGIDY ETFS VSKKDSLRII+ALVAHFDLEL QMDVK
Sbjct: 239 KDSLDNIERYKVRLVAKGFTQKEGIDYTETFSLVSKKDSLRIILALVAHFDLELQQMDVK 298
Query: 278 TAFLNGHLGEEVYMVQPEGFRDENDRHLVCKLRKSIYGLKQASRQWYLKFHNVITSFGFT 337
T FLNG L EEVYM QPEGF + LVCKL+KSIY LKQASR+WYLKFHN+ +SFGF
Sbjct: 299 TTFLNGELEEEVYMKQPEGFPSSDGEQLVCKLKKSIYSLKQASRKWYLKFHNINSSFGFE 358
Query: 338 ENIVDQCIYLKVSGSKYIFLVLYVDDILLATSDLGLLHETKEFLSQNFEMKDLGEASYVI 397
EN++DQCIYLKVSGSK FLVLY+DDILLAT+D G L+E K+FLS+NF MKD+GEASYVI
Sbjct: 359 ENVMDQCIYLKVSGSKICFLVLYMDDILLATNDKGFLYEVKQFLSKNFNMKDMGEASYVI 418
Query: 398 GIEIHRDRSKRSLGLSQKAYIERILERFRMHNCASLVAPIAKGEKLTLSQCPQNALEQGE 457
GI+IHRDR + LGLSQ+ YI ++LERF M NC+ V+PI K + L QCP+N LE+ +
Sbjct: 419 GIKIHRDRFQGILGLSQETYINKVLERFWMKNCSLSVSPIVKSNRFNLDQCPKNDLEREQ 478
Query: 458 MKDIPYASAVGSLLYAQVCTRPDLAFVVGLLGRYQSNPGKEHWKAVKRVMRYLQGTKDYR 517
MK+IPYASAVGSL+YAQVCTRPD+AF VG+LGRYQSNPGK+HWKA K+VMRYLQGTKDY+
Sbjct: 479 MKNIPYASAVGSLMYAQVCTRPDIAFAVGMLGRYQSNPGKDHWKAAKKVMRYLQGTKDYK 538
Query: 518 LTYGYTDHLELVGYSDSDFAGCVDSRKSTSGYIFLLAGGAISWRSTKQTILATSTMEAEF 577
L Y T +LE+VGYSDSDFAGCVDSRKSTSGYIF+LAGGAISWRS KQT+ ATSTMEAEF
Sbjct: 539 LMYRRTSNLEVVGYSDSDFAGCVDSRKSTSGYIFILAGGAISWRSVKQTMTATSTMEAEF 598
Query: 578 IACYEATTQAIWLRNFVSGLKIVDTIERPLKILCDNSAAVFFSKNNKSGSRSKHIDIKYL 637
I+C+E T+ +WL +F+ GL+++D+I R L I CDNS VF +KNNK+GSRSKHIDIKYL
Sbjct: 599 ISCFETTSHGVWLTSFIFGLRVMDSISRSLSIYCDNSVVVFMAKNNKTGSRSKHIDIKYL 658
Query: 638 LVRDKVKEHVVAIEHISTKLMIADPMTKALPAQVFLDHVERMGL 681
+ ++VKE V IEHIST+LMI DP+TK +P F DHV MGL
Sbjct: 659 AISERVKEKKVVIEHISTELMIVDPLTKGMPPLKFKDHVVNMGL 702