BLAST Resut
BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Eca_sc000072.1_g0550.1
(381 letters)
Database: ./nr
95,329,361 sequences; 35,143,497,570 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
JAT57914.1 Retrovirus-related Pol polyprotein from transposon TN... 316 e-100
KYP71220.1 Retrovirus-related Pol polyprotein from transposon TN... 316 2e-98
KZV56298.1 hypothetical protein F511_00295 [Dorcoceras hygrometr... 307 4e-91
>JAT57914.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94,
partial [Anthurium amnicola]
Length = 529
Score = 316 bits (809), Expect = e-100, Method: Compositional matrix adjust.
Identities = 192/410 (46%), Positives = 251/410 (61%), Gaps = 33/410 (8%)
Query: 1 MGAILVQQGLLKTLNGKAALAESLSEAEKEDLMERAHGQILLCLSNEVLREVTTETTTDG 60
M AIL QQGL K L G ++ +LS+ EK+D+ ERA I LCLSNEVLREV E +
Sbjct: 23 MRAILTQQGLQKALLGIEKMSSTLSQEEKQDMDERALAVIQLCLSNEVLREVIHEKSAAA 82
Query: 61 LW--------------------------NVRWESLKDHMDELNRVFLDLANIDVKFEEED 94
LW V S+K H+DE N + +DL N++VK E+ED
Sbjct: 83 LWLKLESLYMTKSLTSKLHLKQRLFMLKMVEGTSVKTHLDEFNSILMDLENLEVKIEDED 142
Query: 95 KAVLLLASLPPAYESFVDSFMSGKTTTTFEDVKATLNSKELRRRSMKESSS---GDGLVV 151
+A+LLL SLPP+++ F ++ + G+ + + EDVK++L SKEL R + SS +GLV
Sbjct: 143 QALLLLCSLPPSFKHFRETLLYGRDSISVEDVKSSLFSKELMDRDLTGCSSEGVSEGLVA 202
Query: 152 RGRSQH--VVDGKGRSQSKSKPKKYRCFNCKKEGHFKRNCPELRKKPAEATIIEEDDSSF 209
RGRSQ KG +SKS+ K C CK +GH K C L+ K + E +
Sbjct: 203 RGRSQERSFEKKKGERRSKSRNKFKICNYCKMKGHIKSECYGLKNKQKKEEKNPEKAAEV 262
Query: 210 LAIIEECDEAMVLAINACFAEDRWILDTGCTFHVCPHKEWFSTYTVINGGKVLMRNDASC 269
+EC E VL++NA + D WILD+G ++H+CP+K+ F TY +GG VLM N+A
Sbjct: 263 GVAKDEC-EDYVLSVNAVRSNDEWILDSGYSYHMCPYKDRFHTYEHDDGGVVLMGNNAPR 321
Query: 270 KVVGIDTILIKMHDGVVRTITGINHIPDLKRNPISLGTLDDNGCKYVGEGGVLRVSKGGL 329
K +GI +I IKM+DG+VRT+T + H+ DLK+N ISLGTL+ NGCKY EGGVLRVS+G L
Sbjct: 322 KTIGIGSIRIKMYDGIVRTLTQVRHVLDLKKNLISLGTLEANGCKYSAEGGVLRVSRGAL 381
Query: 330 TNMKGLKKNGLYVLQGNTVTG-SVAVSSSDKEVETTKLWHMRLGHMSESG 378
MK + N LY L G TVTG + AVS S E E TKLWHMRLGHMSE G
Sbjct: 382 ILMKVKRTNSLYTLIGTTVTGAAAAVSPSMSESEVTKLWHMRLGHMSEKG 431
>KYP71220.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94
[Cajanus cajan]
Length = 690
Score = 316 bits (809), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 181/418 (43%), Positives = 244/418 (58%), Gaps = 50/418 (11%)
Query: 1 MGAILVQQGLLKTLNGKAALAESLSEAEKEDLMERAHGQILLCLSNEVLREVTTETTTDG 60
M A+L Q GL K L+GKA ++++ + ++L E+A I LCLS EVLREV ETT
Sbjct: 25 MKAVLTQNGLKKALDGKAKKPVNMTDEQWDELDEKALSAIQLCLSKEVLREVANETTAAA 84
Query: 61 LWNVRWESL---------------------------KDHMDELNRVFLDLANIDVKFEEE 93
LW ++ ESL + H++E N + +DL NI++K ++E
Sbjct: 85 LW-LKLESLYMTKSLANKLRLKERLYTIRMVEGTPIQSHLNEFNSIIMDLENIEIKIDDE 143
Query: 94 DKAVLLLASLPPAYESFVDSFM-SGKTTTTFEDVKATLNSKELRRRSMKESSSGDGLVVR 152
DKAVLL+ SLP Y+ F + + S T +FEDVK+ L SKE + G+GL VR
Sbjct: 144 DKAVLLIVSLPSTYKHFKEIMLYSNNDTLSFEDVKSNLLSKEKFDLDIHSEDKGEGLSVR 203
Query: 153 GRSQH---VVDGKGRSQSKSKPKKYRCFNCKKEGHFKRNCPELRKK---------PAEAT 200
GR+Q + K RS+S+ + C CKK GH +C L+KK PAEA
Sbjct: 204 GRTQEKGSTSNKKSRSKSRGRKSNKTCRYCKKFGHDISDCFILKKKQERQEKGKNPAEAA 263
Query: 201 IIEEDDSSFLAIIEECDEAMVLAINACFAEDRWILDTGCTFHVCPHKEWFSTYTVINGGK 260
+E D + I D+ ++ WILD+GCTFH+CP+K+ F+T ++ G
Sbjct: 264 NVETDSDGDVMISVSSDKR---------SKTEWILDSGCTFHMCPYKDLFTTLEPVDSGV 314
Query: 261 VLMRNDASCKVVGIDTILIKMHDGVVRTITGINHIPDLKRNPISLGTLDDNGCKYVGEGG 320
VLM ND CK+ GI TI IK HDG ++T++ + IPDLKRN ISLGTL+ GCKY EGG
Sbjct: 315 VLMGNDTQCKIAGIGTIQIKTHDGTIKTLSNVRFIPDLKRNLISLGTLESLGCKYSAEGG 374
Query: 321 VLRVSKGGLTNMKGLKKNGLYVLQGNTVTGSVAVSSSDKEVETTKLWHMRLGHMSESG 378
VL+VSKG + +K + LY+LQG+ VTGS AVSSS + + TKLWHMRLGHMSE G
Sbjct: 375 VLKVSKGAIVLLKANRIGSLYILQGSIVTGSAAVSSSMSDKDATKLWHMRLGHMSEKG 432
>KZV56298.1 hypothetical protein F511_00295 [Dorcoceras hygrometricum]
Length = 1309
Score = 307 bits (787), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 176/420 (41%), Positives = 250/420 (59%), Gaps = 59/420 (14%)
Query: 1 MGAILVQQGLLKTLNGKAALAESLSEAEKEDLMERAHGQILLCLSNEVLREVTTETTTDG 60
M A+LV GL LN + +++ + + + +A ILLCL +EVLREV E +
Sbjct: 24 MKALLVHTGLGGALNPEPQ-DDTIDKKKIVETDSKAFSAILLCLGDEVLREVAEEVSALS 82
Query: 61 LWNVRWESL---------------------------KDHMDELNRVFLDLANIDVKFEEE 93
LWN + ESL K HMDE N++ LDL N+D+K +E
Sbjct: 83 LWN-KLESLYLKRSLANRLYLKKSLYTIHLEEGKDLKKHMDEFNKIILDLKNVDIKITDE 141
Query: 94 DKAVLLLASLPPAYESFVDSFMSGKTTTTFEDVKATLNSKELRRRS-MKESSSGDGLVVR 152
D A+L+L+SLP +YE FVD+ + GK T T +VK+ LNSKEL +++ K S+G+GL VR
Sbjct: 142 DCAILMLSSLPRSYEHFVDTMLYGKETLTMAEVKSALNSKELHKKNETKMESTGEGLNVR 201
Query: 153 GR-----SQHVVDGKGRSQSKSKPKKYRCFNCKKEGHFKRNCP-------ELRKKPAEAT 200
GR S++ GK RSQS+++ K +CF C KEGHFK++CP E RK P +A
Sbjct: 202 GRTYKRESRNEKGGKHRSQSRTR-GKLKCFVCHKEGHFKKDCPDRRARNPERRKDPGDAA 260
Query: 201 IIEEDDSSFLAIIEECDEAMVLAINACFAEDRWILDTGCTFHVCPHKEWFSTYTVINGGK 260
++ + S A VL ++ +D W++D+GC+FH+CP K WF G
Sbjct: 261 VVSDGYES----------AEVLVVSRTNKQDCWVMDSGCSFHMCPIKSWFQNLVEEESGH 310
Query: 261 VLMRNDASCKVVGIDTILIKMHDGVVRTITGINHIPDLKRNPISLGTLDDNGCKYVGEGG 320
VL+ N+ CKV+GI ++L+KMHDG VRTIT + ++PDL+RN +S+G LD G EGG
Sbjct: 311 VLLGNNRECKVMGIGSVLLKMHDGCVRTITEVRYVPDLRRNLLSIGMLDSKGFNVKIEGG 370
Query: 321 VLRVSKGGLTNMKGLKKNGLYVLQGNTVTGS--VAVSSSDKEVETTKLWHMRLGHMSESG 378
++V KG LT M+G + NGLY+L+ +TVTGS AV ++K +LWH+RLGH+SE G
Sbjct: 371 TMKVIKGSLTVMRGSQDNGLYILEASTVTGSSNAAVGGANK----ARLWHLRLGHVSEKG 426