BLAST Resut
BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Eca_sc000118.1_g0400.1
(576 letters)
Database: ./nr
95,329,361 sequences; 35,143,497,570 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
KYP36220.1 Copia protein [Cajanus cajan] KYP68721.1 Copia protei... 395 e-128
KYP42518.1 Retrovirus-related Pol polyprotein from transposon TN... 397 e-126
KZV21171.1 hypothetical protein F511_24735 [Dorcoceras hygrometr... 392 e-122
>KYP36220.1 Copia protein [Cajanus cajan] KYP68721.1 Copia protein [Cajanus
cajan]
Length = 585
Score = 395 bits (1015), Expect = e-128, Method: Compositional matrix adjust.
Identities = 213/555 (38%), Positives = 309/555 (55%), Gaps = 54/555 (9%)
Query: 27 KPNAQRPKRARPFCDYCQQHGHVRATCYQLNGYPNRSNSGARPN---RPPRPTGPLAATV 83
KP + R++ CD+C + GH++ C+++ GYP N+ N R + A
Sbjct: 75 KPKGRGGSRSK--CDHCGKTGHIKPNCFEIIGYPENWNTRRTQNDTRRQGHKSNAFLAFE 132
Query: 84 IETVDPPTGAPLATSLTQEEYQGLLSLLDHQRHSNGTSNPTQGSTDAGIDFTGIQFLSSP 143
E + P GA A +L H++H + + + I+
Sbjct: 133 DEEKEAPKGAKAAHTL-------------HEKHMTHNTRVHENMAGNNKNLREIE----- 174
Query: 144 WVIDSGAAKHI--CSSLSRFSHYVDAPPQCFVRLPDGKRIQVRHIGTIVFNSNLILKNVL 201
WV+DSGA+ H+ C SL + ++ P +V +P G + V +G I + N+ L+NVL
Sbjct: 175 WVLDSGASHHMTPCLSLLKAVRKIEKP--FYVTVPTGNVVLVESMGYIDLDKNIKLENVL 232
Query: 202 HVPEFKINLISVSQLTNSLRCLVTFDFCSCVFQDPVTKTRIGLGDFHEGLYFLRTDIRIC 261
VP+F NLISV +L +C++T+D CV QD K IGLGD HEG+Y LR +
Sbjct: 233 FVPQFSCNLISVHKLARDSKCILTYDENRCVLQDQTMKEMIGLGDMHEGVYILRRPTKSI 292
Query: 262 SFSISSNSLYNKDVSDIWHWRLGHPSSSSIFKSLISRTQCMKS--VSFNKNPCEICPLSK 319
F+ + K+++ WH RLGHP +F++L + +K ++ + C+IC SK
Sbjct: 293 YFTA-----FLKNMAGTWHSRLGHP----LFEALQKISNIVKCSFITNKEECCDICHKSK 343
Query: 320 FTRLSFPLSNSTSSRSFELIHADSWGPFSVPSNSGCRYFLTLVDDFTRCTWIFLMSHKSD 379
R F SN+ + F LIH D WG ++ S++G YFLT+VDD++R TW++LM HKS+
Sbjct: 344 QCRFPFNRSNNKAEAPFHLIHYDLWGKYNTASHNGSHYFLTIVDDYSRATWVYLMRHKSE 403
Query: 380 TTQFLKNFSNYISTQFDSSISNLQSGQGTIPIPKIQSIRSDNGMEFMNHELQSWFRKKGI 439
T LK F I QF+ + + IRSDNG EF N Q + R++GI
Sbjct: 404 TLDMLKKFCTIIKRQFNVDV---------------KKIRSDNGTEFTNSSFQRYIREEGI 448
Query: 440 IHQTSCPHTPQQNDIIERKHRHILEVARSLRLQGHLPIFFWGECVLTAVYLINKLPTPTL 499
+H+TSC TPQQN +ERKHRHIL VAR+LR + +LPI FWGECVLTA +LIN+ PT
Sbjct: 449 VHETSCVGTPQQNARVERKHRHILNVARALRFEANLPIHFWGECVLTATHLINRTPTIAN 508
Query: 500 QGISPHEKLLGSPPKFDHLRVFGCLCFTRR-PLVKTKLDSRASPGIFAGYPHNQKGYRVF 558
GI+P+E L G PP +DHLR+FGCLC+ + + K R+ +F GYP NQKG++V+
Sbjct: 509 SGITPYEMLYGKPPSYDHLRIFGCLCYVKNSSKQQDKFTPRSKKYMFIGYPQNQKGWKVY 568
Query: 559 DLNYRKIITSRDVLF 573
+L + SRDV+F
Sbjct: 569 NLETHEFFISRDVIF 583
>KYP42518.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94
[Cajanus cajan]
Length = 769
Score = 397 bits (1019), Expect = e-126, Method: Compositional matrix adjust.
Identities = 218/554 (39%), Positives = 316/554 (57%), Gaps = 55/554 (9%)
Query: 26 EKPNAQRPKRARPFCDYCQQHGHVRATCYQLNGYPNRSNSGARPNRPPRPTGPLAATVIE 85
+K N PK P CD+C + GH + C+++ GYP N P R R + +
Sbjct: 199 KKKNRDGPK---PRCDHCGKIGHDKTKCFEIVGYPANWN----PRRNTRNS-------TK 244
Query: 86 TVDPPTGAPLATSLTQEEYQGLLSLLDHQRHSNGTSNPTQGSTDAGIDF-TGIQFLSSPW 144
+ GA LA E QG H+ S + GS + D+ +G Q ++ W
Sbjct: 245 RTEHSGGANLA----WENNQGT------DGHALSGSQDSGGSHGSKNDYMSGNQMINDVW 294
Query: 145 VIDSGAAKHICSSLSRFSHYVDAPPQCFVRLPDGKRIQVRHIGTIVFNSNLILKNVLHVP 204
V+DSGA+ H+ S S+ D + +P G + V GT+ N N+ L NVL +P
Sbjct: 295 VLDSGASHHMTSLYSQLDEVQDFSIPLRITVPIGDVVLVHKKGTMKLNENIKLYNVLFIP 354
Query: 205 EFKINLISVSQLTNSLRCLVTFDFCSCVFQDPVTKTRIGLGDFHEGLYFLRTDIRICSFS 264
EF+ NLIS+ +LT+ L C+VT+ CV QD K IG G +G+Y + S
Sbjct: 355 EFRCNLISIHKLTHDLNCVVTYSVDECVIQDQTRKRMIGFGRLCDGIYIFTQQVGGYSLV 414
Query: 265 ISSNSLYNKDVSDIWHWRLGHPSSSSIFKSLISRTQCMKSVSFNKNP----CEICPLSKF 320
SS D++ +WH R+GHPS ++S+ + S SFN N C+IC SK
Sbjct: 415 ASSG-----DITTLWHARMGHPSDQ-----VLSKLSTIISFSFNANNKMECCDICHRSKQ 464
Query: 321 TRLSFPLSNSTSSRSFELIHADSWGPFSVPSNSGCRYFLTLVDDFTRCTWIFLMSHKSDT 380
RL F L+ + S+ F+LIH D WG + S++G YFLT+VDDFTR WI+L+ K++T
Sbjct: 465 CRLPFSLNYNKVSKVFDLIHCDLWGKYHTASHNGSHYFLTIVDDFTRAVWIYLLKDKTET 524
Query: 381 TQFLKNFSNYISTQFDSSISNLQSGQGTIPIPKIQSIRSDNGMEFMNHELQSWFRKKGII 440
T + N+ + TQFD+ K++ +RSDNG +F+N ++ S+F++ GI+
Sbjct: 525 TNVIINYYRMVQTQFDT---------------KVKVVRSDNGTKFVNSKIHSFFQEVGIL 569
Query: 441 HQTSCPHTPQQNDIIERKHRHILEVARSLRLQGHLPIFFWGECVLTAVYLINKLPTPTLQ 500
HQTSC +PQQN +ERKHRHIL VAR+LR Q +LP+ FWGECVLTA++LIN+ PT Q
Sbjct: 570 HQTSCVSSPQQNGRVERKHRHILNVARALRFQANLPLTFWGECVLTAIHLINRTPTVANQ 629
Query: 501 GISPHEKLLGSPPKFDHLRVFGCLCFTRRPLVKT-KLDSRASPGIFAGYPHNQKGYRVFD 559
G++P+E L G P + H+RVFGCLC+ + KT K +++A IF GYP QKG+R+++
Sbjct: 630 GLTPYEMLYGKQPSYAHIRVFGCLCYAKTLTKKTDKFEAQADRCIFIGYPQGQKGWRIYN 689
Query: 560 LNYRKIITSRDVLF 573
L ++ + SRDV+F
Sbjct: 690 LQKQQFMVSRDVIF 703
>KZV21171.1 hypothetical protein F511_24735 [Dorcoceras hygrometricum]
Length = 977
Score = 392 bits (1006), Expect = e-122, Method: Compositional matrix adjust.
Identities = 229/590 (38%), Positives = 316/590 (53%), Gaps = 57/590 (9%)
Query: 9 EAATLATYNNDEQNFRMEKPNA-----QRPKRARPFCDYCQQHGHVRATCYQLNGYP--- 60
E A NN ME P A K P C+ C GH + TCY+L GYP
Sbjct: 111 EEAHRTALNNQSM---MEVPTAVFYSSSVKKSDHPRCENCNIVGHTKETCYKLVGYPPGH 167
Query: 61 ---NRSNSGARPNRPPRPTGPLAA--TVIETVDPPTGAPLATSLTQEEYQGLLSLLDHQR 115
+ G + L+A T E P AP S T +Y+ ++ LL+
Sbjct: 168 KLHRKFPQGKSSKNQMKNHQQLSAHNTSQEYTQAPASAP---SFTPAQYEQIIKLLELAP 224
Query: 116 HSNGTSNPTQGSTDAGIDFTGIQFLSSPWVIDSGAAKHICSSLSRFSHYVDA-PPQCFVR 174
S+ + G++ + S PW++DSGA HI + + VR
Sbjct: 225 SSDKPAANFAGASQGPV---STPEHSIPWILDSGANAHITGTSKNLQNIQPCDSANGSVR 281
Query: 175 LPDGKRIQVRHIGTIVFNSNLILKNVLHVPEFKINLISVSQLTNSLRCLVTFDFCSCVFQ 234
LP+G + G++ S L NVLHVP+FK NL+S+S+ T C V F C FQ
Sbjct: 282 LPNGNLTHILSTGSLTIPSFCTLHNVLHVPDFKFNLLSISKFTKDHHCSVVFYPDFCFFQ 341
Query: 235 DPVTKTRIGLGDFHEGLYFL-------RTDIRICSFSISSNSLYNKDVS-DIWHWRLGHP 286
D T +G+G + GLY+L +TD + S ++ S KD+ +IWH R GH
Sbjct: 342 DLSTGKIMGIGKLYNGLYYLAETPQNIQTDRILSSPKLTCTSFACKDIDINIWHQRFGHM 401
Query: 287 SSSSIFKSLISRTQCMKSVSFN--KNPCEICPLSKFTRLSFPLSNSTSS-RSFELIHADS 343
S ISR Q + ++ N +PC ICP+SK TR FP + T + R F LIH D+
Sbjct: 402 S--------ISRLQHLPFITQNTLNSPCYICPISKQTRSMFPAKDHTPAPRPFSLIHMDT 453
Query: 344 WGPFSVPSNSGCRYFLTLVDDFTRCTWIFLMSHKSDTTQFLKNFSNYISTQFDSSISNLQ 403
WGP+ P+++G RYFLT+VDDF+RCTW+FLM KSD +KNF +++TQF++
Sbjct: 454 WGPYRTPTHNGARYFLTIVDDFSRCTWVFLMHLKSDVLTVIKNFLTFVTTQFNT------ 507
Query: 404 SGQGTIPIPKIQSIRSDNGMEFMNHELQSWFRKKGIIHQTSCPHTPQQNDIIERKHRHIL 463
K+Q+IR+DN ++F+N E + F GIIHQ+SCP+TPQQN ++ERKHRHIL
Sbjct: 508 ---------KVQTIRTDNALDFLNSECNNLFNSLGIIHQSSCPYTPQQNGLVERKHRHIL 558
Query: 464 EVARSLRLQGHLPIFFWGECVLTAVYLINKLPTPTLQGISPHEKLLGSPPKFDHLRVFGC 523
VAR+++ Q +P +WGECVL A Y+IN+ PTP L +P+E L PP + H+RVFGC
Sbjct: 559 NVARAVKFQASIPDIYWGECVLHAAYIINRTPTPLLSHKTPYEALFSKPPSYQHMRVFGC 618
Query: 524 LCFTRRPLVKTKLDSRASPGIFAGYPHNQKGYRVFDLNYRKIITSRDVLF 573
LCF K D RA +F GYP +QKGY++ D +I SRDV+F
Sbjct: 619 LCFASNLKPSHKFDVRARACVFLGYPPHQKGYKLLDTLTNRIFISRDVVF 668