BLAST Resut
BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Eca_sc000117.1_g0510.1
(738 letters)
Database: ./nr
95,329,361 sequences; 35,143,497,570 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
KYP38661.1 Retrovirus-related Pol polyprotein from transposon TN... 741 0.0
KYP61781.1 Retrovirus-related Pol polyprotein from transposon TN... 697 0.0
GAU21017.1 hypothetical protein TSUD_201660 [Trifolium subterran... 692 0.0
>KYP38661.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94
[Cajanus cajan]
Length = 778
Score = 741 bits (1913), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 386/672 (57%), Positives = 466/672 (69%), Gaps = 79/672 (11%)
Query: 54 TRRIYISRDVTFFENRFPMS--STPTVKPSISRPAPFISIDDDDTFVFPANDKLRSISSP 111
+R+IY+SRDV F E FP S+P+ I P I+ DD ++R S
Sbjct: 178 SRKIYVSRDVQFHEAIFPFQDFSSPSFPDEICINTP---INADDLLDPITIVQIRPDISL 234
Query: 112 DNSRF--TDESAVAVPASLDPGTGVPSDTPGDPTTGPTSTGPPSPSRVPSANRDVDSGSG 169
N++ +DE+ ++ T P D ST S S S+N D+ S
Sbjct: 235 ANNQVENSDETTIS--------TSNPQDD--------NSTNSQSLSEDASSN-DISS--- 274
Query: 170 VPPEQSVPPPSNVSSLPNPSSDDRSSDSVVADSGRPVRSRRPPSHLDDYVCSNTNTI-KY 228
++ PP+N S P RRPP HL DYVC++ +I +
Sbjct: 275 ---TETSLPPTNHSQHP----------------------RRPPHHLKDYVCNHIKSITNF 309
Query: 229 LLENYFSINSFSNSHQQFLSSVLSTTEPTSFTQAMKSPHWKEAMAKEISALEENKTWTIT 288
L NY S+++ SNSHQ FL++++ EP SF+QAMKS W+EAMAKEI ALE N TW++
Sbjct: 310 PLANYLSLSNLSNSHQAFLTNIIDNQEPKSFSQAMKSAEWREAMAKEIQALESNNTWSLC 369
Query: 289 VLPSGKKPIGCKWVYKIKFKSDGSIERYKARLVAKGYTQIEGLDYNDTFAPVAKLVTVRV 348
LP GK IGCKWVYKIK++SDGSIERYKARLVAKGY QIEG+DY+DTFAPVAKLV VR+
Sbjct: 370 PLPQGKSSIGCKWVYKIKYRSDGSIERYKARLVAKGYAQIEGIDYHDTFAPVAKLVIVRL 429
Query: 349 LLSLAAIKGWSLHQLDVNNAFLQGDLNEEVYMKLPPGFSSQGESKVCRLHKSLYGLKQAS 408
LLS+AAIK WSLHQLDVNN FLQGDLNEEVYMKLPP FS +GE+ VC+LHK +YGLKQAS
Sbjct: 430 LLSIAAIKNWSLHQLDVNNDFLQGDLNEEVYMKLPPRFSRKGETYVCKLHKFIYGLKQAS 489
Query: 409 RQWFSKFSSVLIERGFSQSKSDYSFFTFRSHSTQIYVLVYVDDIIITGNNDHAISQLKIF 468
RQWFSKFS+ +I+RGF QS SDYS FT+ S T ++VLVYVDDIIIT NND AI +K F
Sbjct: 490 RQWFSKFSTTIIQRGFRQSISDYSLFTYISGQTSVFVLVYVDDIIITSNNDDAIFNIKQF 549
Query: 469 LNKAFSLKDLGRLQYFLGIEVSRSSQGIFLCQRKYALDILKDSGLTAARPSEFPMEQKLR 528
L ++FS+KD G L+YFLGIEVSRS +GIFLCQRKY LDIL D+G+T+ RPS+FPMEQ LR
Sbjct: 550 LAQSFSIKDHGNLRYFLGIEVSRSKKGIFLCQRKYTLDILSDTGMTSCRPSDFPMEQHLR 609
Query: 529 LSPTDGTPLPDPSVYRRLIGRLLYLTVTRPDITFAGTLNKGVFLSANSSLHITGYCDSDW 588
L P DGT LPDP+ YRRLIGRLLYLTVTRP DSDW
Sbjct: 610 LRPNDGTLLPDPTAYRRLIGRLLYLTVTRP--------------------------DSDW 643
Query: 589 AGCPSTRRSTTGYFTMLGSSPLPWKSKKQPTVAKSSAEAEYRALAILTCELQWLKYLLLD 648
AGCP+TRRSTTGYFTMLGSSP+ WK+KKQPTV++SSAEAEYR+LA LT ELQWL YLL D
Sbjct: 644 AGCPTTRRSTTGYFTMLGSSPISWKTKKQPTVSRSSAEAEYRSLAALTYELQWLTYLLSD 703
Query: 649 FGIDHSDPMTVYCDNRAALHIADNPVFHERTKHIEIDCHIVREKIKDNVIATRFTSTENQ 708
G+ H P+ ++CD++A +HIA+NPVFHERTKHIEIDCH VREKIK +I + + +Q
Sbjct: 704 LGLPHPQPIPIHCDSQAGIHIAENPVFHERTKHIEIDCHFVREKIKAGLITPSYLRSRDQ 763
Query: 709 LADIFTKPLGAE 720
LADIFTKPLG +
Sbjct: 764 LADIFTKPLGGD 775
>KYP61781.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial
[Cajanus cajan]
Length = 1413
Score = 697 bits (1798), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 364/778 (46%), Positives = 474/778 (60%), Gaps = 100/778 (12%)
Query: 4 PPNLSHLRVFGCLCYARIPLVK-TKMDARSSAGIFLGYPNGQKGYRIYDISTRRIYISRD 62
PP+L+HLRVFG LCY K K +RS +FLGYP G+KG+R+YD+ SRD
Sbjct: 683 PPSLNHLRVFGSLCYVHNRDSKGDKFASRSRRCVFLGYPYGKKGWRVYDLELGVFLTSRD 742
Query: 63 VTFFENRFPMSSTPTVKPS---ISRP--APFISIDDDDTFVFPANDKLRSISSPDNSRFT 117
V F E+ FP + + P I + AP+ S+DDDD +D +N +
Sbjct: 743 VVFSESEFPCAESQNSTPKHIDIEKDNLAPWNSVDDDDEMEIAQDD-------FENRKGA 795
Query: 118 DESAVAVPASLDPGTGVPSDTPGDPTTGPTSTGPPSPSRVPSANRDVDSGSGVPPEQSVP 177
+ ++ SL TG +D+ D T G S
Sbjct: 796 NNGSIV---SLQEPTGPHTDSH-DIETSVIERGGLS------------------------ 827
Query: 178 PPSNVSSLPNPSSDDRSSDSVVADSGRPVRSRRPPSHLDDYVCSNTNTIK---------- 227
+ VS+ P GR R++ P + L D+V + +
Sbjct: 828 --AYVSTHP---------------LGRGHRAKMPSTRLRDFVANTICRLDPSPSSLATSR 870
Query: 228 -----YLLENYFSINSFSNSHQQFLSSVLSTTEPTSFTQAMKSPHWKEAMAKEISALEEN 282
YL+ +Y + N+FS ++ FL++V TEP SF QAMK W++AM EI ALE N
Sbjct: 871 PSGTPYLITHYVNYNNFSAHYRHFLAAVSIGTEPQSFAQAMKDEQWQQAMQAEIQALENN 930
Query: 283 KTWTITVLPSGKKPIGCKWVYKIKFKSDGSIERYKARLVAKGYTQIEGLDYNDTFAPVAK 342
TWT+ LP GKK IGCKWVY+IK+ SDGSIER+KARLV G Q+EGLDYN+TFAPVAK
Sbjct: 931 GTWTLEPLPPGKKAIGCKWVYRIKYNSDGSIERFKARLVILGNNQVEGLDYNETFAPVAK 990
Query: 343 LVTVRVLLSLAAIKGWSLHQLDVNNAFLQGDLNEEVYMKLPPGFSSQGESKVCRLHKSLY 402
+VT+R LL++AA + W LHQ+DV+NAFL GDL EEVYMKLPPGF SQ KVCRL KSLY
Sbjct: 991 MVTMRTLLAVAAARKWELHQMDVHNAFLHGDLKEEVYMKLPPGFRSQASGKVCRLRKSLY 1050
Query: 403 GLKQASRQWFSKFSSVLIERGFSQSKSDYSFFTFRSHSTQIYVLVYVDDIIITGNNDHAI 462
GLKQA R WF+K + L GF+QS SD+S FT + Q++VLVYVDD++I+GN++ AI
Sbjct: 1051 GLKQAPRCWFAKLAGALKRYGFAQSSSDHSLFTLQRERVQLHVLVYVDDLVISGNDNAAI 1110
Query: 463 SQLKIFLNKAFSLKDLGRLQYFLGIEVSRSSQGIFLCQRKYALDILKDSGLTAARPSEFP 522
K++LN F +KDLG L+YFLGIEV+R+S GIFLCQRKYALDI+ + GL A+P+ FP
Sbjct: 1111 KAFKLYLNACFHMKDLGMLKYFLGIEVARNSTGIFLCQRKYALDIISEVGLLGAKPAGFP 1170
Query: 523 MEQKLRLSPTDGTPLPDPSVYRRLIGRLLYLTVTRPDITFA------------------- 563
M+Q +L GT LPDP YRRL+GRL+YL+VTRP++++
Sbjct: 1171 MDQHHQLPLAKGTLLPDPERYRRLVGRLIYLSVTRPELSYCVHTLAQFMQHPRQEHWDAA 1230
Query: 564 --------GTLNKGVFLSANSSLHITGYCDSDWAGCPSTRRSTTGYFTMLGSSPLPWKSK 615
G +G+ L AN L + +CDSDWA CP TRRS TG+F +LG SP+ WK+K
Sbjct: 1231 LRVVRYLKGNPGQGILLRANCDLQLYAWCDSDWASCPLTRRSLTGWFILLGDSPISWKTK 1290
Query: 616 KQPTVAKSSAEAEYRALAILTCELQWLKYLLLDFGIDHSDPMTVYCDNRAALHIADNPVF 675
KQ TV++SS+EAEYR++A TCEL+WLK LL G HS PM +YCDN+AALHIA N VF
Sbjct: 1291 KQHTVSRSSSEAEYRSMATTTCELKWLKELLSCLGCAHSGPMKLYCDNQAALHIAANLVF 1350
Query: 676 HERTKHIEIDCHIVREKIKDNVIATRFTSTENQLADIFTKPLGAEPFKNIIGKLGVTS 733
HERTKHIE+DCH V +++ I+T + QLADIFTK LG F ++ KLG+ +
Sbjct: 1351 HERTKHIEVDCHFVHDELLQGTISTHHVPSHAQLADIFTKALGKTQFDCLLSKLGICN 1408
>GAU21017.1 hypothetical protein TSUD_201660 [Trifolium subterraneum]
Length = 1252
Score = 692 bits (1785), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 357/780 (45%), Positives = 472/780 (60%), Gaps = 84/780 (10%)
Query: 2 GSPPNLSHLRVFGCLCYARIPLVK-TKMDARSSAGIFLGYPNGQKGYRIYDISTRRIYIS 60
G P+L+HLRVFGCLCY + K ++RS IF+GYP G+KG+R+YD+ T IS
Sbjct: 514 GKSPSLTHLRVFGCLCYVHNQDRRGDKFESRSRKCIFVGYPYGKKGWRVYDLETSAFLIS 573
Query: 61 RDVTFFENRFPMSSTPTVKPSISRPAPFISIDDDDTFVFPANDKLRSISSPDNSRFTDES 120
RDV F E++FP + + S S A + D D+ ND L ++ N
Sbjct: 574 RDVVFCEDKFPFHKSLNRRMSQSPTADLWTYDCDN------NDHLMVGNNSSNE------ 621
Query: 121 AVAVPASLDPGTGVPSDTPGDPTTGPTSTGPPSPSRVPSANRDVDSGSGVPPEQSVPPPS 180
T + D GD T S + Q+ PS
Sbjct: 622 -----------THLVIDGVGDSVNTGTEL------------------SDLNDAQNSAAPS 652
Query: 181 NVSSLPNPSSDDRSSDSVVADSGRPVRSRRPPSHLDDYVCSNTNTIK------------- 227
S++ N + + + GR R + P + L D+V +
Sbjct: 653 YGSNINNEDNGGYENMDGEENLGRGHRIKLPSTKLKDFVTHAVRNLSPSTSSLPPSQSSG 712
Query: 228 --YLLENYFSINSFSNSHQQFLSSVLSTTEPTSFTQAMKSPHWKEAMAKEISALEENKTW 285
Y + ++ + ++FS+ +Q FL+++ + EP +F +A+K W+ AM EI ALE N TW
Sbjct: 713 TPYPIAHFVNSSNFSSKYQYFLAAITTGNEPRTFAEAVKYKQWRTAMQLEIQALENNNTW 772
Query: 286 TITVLPSGKKPIGCKWVYKIKFKSDGSIERYKARLVAKGYTQIEGLDYNDTFAPVAKLVT 345
TI LP GKK IGCKWVYKIK+ SDGSIERYKARLV G Q+EGLDYN+TFAPVAK+VT
Sbjct: 773 TIETLPHGKKSIGCKWVYKIKYHSDGSIERYKARLVILGNNQVEGLDYNETFAPVAKMVT 832
Query: 346 VRVLLSLAAIKGWSLHQLDVNNAFLQGDLNEEVYMKLPPGFSSQGESKVCRLHKSLYGLK 405
VR L++AA + W LHQ+DV+NAFL GDL EE+YMKLPPGF S ++ CRL KSLYGLK
Sbjct: 833 VRTFLAVAAARNWELHQMDVHNAFLHGDLEEEIYMKLPPGFYSSAPNQACRLRKSLYGLK 892
Query: 406 QASRQWFSKFSSVLIERGFSQSKSDYSFFTFRSHSTQIYVLVYVDDIIITGNNDHAISQL 465
QA R WF+K ++ L + GF QS SDYS FT Q+ VLVYVDD+I++GN+ AI
Sbjct: 893 QAPRCWFAKLAAALKKYGFKQSGSDYSLFTLHKDDVQLNVLVYVDDLIVSGNDTSAIQSF 952
Query: 466 KIFLNKAFSLKDLGRLQYFLGIEVSRSSQGIFLCQRKYALDILKDSGLTAARPSEFPMEQ 525
K +L+ F +KDLG L+YFLGIEV+R+S GIFLCQRKYALDI+ + GL A+P+ PM+Q
Sbjct: 953 KSYLSTCFYMKDLGLLRYFLGIEVARNSTGIFLCQRKYALDIISEVGLLGAKPAHIPMDQ 1012
Query: 526 KLRLSPTDGTPLPDPSVYRRLIGRLLYLTVTRPDITFA---------------------- 563
LS D L +P YRRL+GRL+YL+VTRP++++
Sbjct: 1013 NHHLSLVDEPLLSEPEKYRRLVGRLIYLSVTRPELSYCVHMLAQFMQQPRLPHWEAALRV 1072
Query: 564 -----GTLNKGVFLSANSSLHITGYCDSDWAGCPSTRRSTTGYFTMLGSSPLPWKSKKQP 618
G +G+FL A+ L + +CDSDWA CP TRRS TG+ +LG+SP+ WK+KKQ
Sbjct: 1073 VKYLKGNPGQGIFLRADCDLQLYAWCDSDWASCPLTRRSLTGWLVLLGNSPISWKTKKQH 1132
Query: 619 TVAKSSAEAEYRALAILTCELQWLKYLLLDFGIDHSDPMTVYCDNRAALHIADNPVFHER 678
TV++SSAEAEYR+LA TCEL+WLK LL G+ H PM VYCD++AA+HIA NPVFHER
Sbjct: 1133 TVSRSSAEAEYRSLATTTCELKWLKELLSSLGVSHPRPMKVYCDSQAAMHIAANPVFHER 1192
Query: 679 TKHIEIDCHIVREKIKDNVIATRFTSTENQLADIFTKPLGAEPFKNIIGKLGVTSIPALT 738
TKHIE+DCH VR+++ I+T + ST QLADI TK LG + F +GKLG+ ++ A T
Sbjct: 1193 TKHIEVDCHFVRDELLCGNISTHYVSTRTQLADILTKALGKQQFDYFLGKLGIRNLHAPT 1252