BLAST Resut
BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Eca_sc000058.1_g0150.1
(338 letters)
Database: ./nr
95,329,361 sequences; 35,143,497,570 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
KZV56298.1 hypothetical protein F511_00295 [Dorcoceras hygrometr... 248 2e-70
KHN13665.1 Retrovirus-related Pol polyprotein from transposon TN... 229 1e-69
KYP36635.1 Retrovirus-related Pol polyprotein from transposon TN... 226 3e-68
>KZV56298.1 hypothetical protein F511_00295 [Dorcoceras hygrometricum]
Length = 1309
Score = 248 bits (632), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 145/364 (39%), Positives = 210/364 (57%), Gaps = 66/364 (18%)
Query: 3 AKYEIETFDGSNDFDLWKIKMRTVLVQQGLLKMLNGKAALAESLSEVEKEDLMKRAHGQI 62
K+++E F GSNDF LW+IKM+ +LV GL LN + +++ + + + +A I
Sbjct: 4 TKFDLEKFTGSNDFSLWRIKMKALLVHTGLGGALNPEPQ-DDTIDKKKIVETDSKAFSAI 62
Query: 63 LLCLSNEMF-----------------------------------------EGKSLKDHMN 81
LLCL +E+ EGK LK HM+
Sbjct: 63 LLCLGDEVLREVAEEVSALSLWNKLESLYLKRSLANRLYLKKSLYTIHLEEGKDLKKHMD 122
Query: 82 ELNRVFLDLANISVKFEEEDKAVLLLESLLPSHESLVDSFMSGKTTTTFEDVKAALNSKE 141
E N++ LDL N+ +K +ED A+L+L SL S+E VD+ + GK T T +VK+ALNSKE
Sbjct: 123 EFNKIILDLKNVDIKITDEDCAILMLSSLPRSYEHFVDTMLYGKETLTMAEVKSALNSKE 182
Query: 142 LRRKS-IKESSLGDGLVVRRRT-----QHVVDGKGRSRSKSKPKNYRCFNGKKEGHFKKN 195
L +K+ K S G+GL VR RT ++ GK RS+S+++ K +CF KEGHFKK+
Sbjct: 183 LHKKNETKMESTGEGLNVRGRTYKRESRNEKGGKHRSQSRTRGK-LKCFVCHKEGHFKKD 241
Query: 196 CP-------ELRKKPGEAAVIEEDDGSVLAIIEECDEAMVLAINPCFAEDRWILDTGCTF 248
CP E RK PG+AAV+ DG + A VL ++ +D W++D+GC+F
Sbjct: 242 CPDRRARNPERRKDPGDAAVV--SDGY--------ESAEVLVVSRTNKQDCWVMDSGCSF 291
Query: 249 HVCPHKEWFSTYTVINNGKVLMGNDAPCKVVGIGTIQIKMHDGVVRTITDIRHVPDLKKN 308
H+CP K WF +G VL+GN+ CKV+GIG++ +KMHDG VRTIT++R+VPDL++N
Sbjct: 292 HMCPIKSWFQNLVEEESGHVLLGNNRECKVMGIGSVLLKMHDGCVRTITEVRYVPDLRRN 351
Query: 309 LISL 312
L+S+
Sbjct: 352 LLSI 355
>KHN13665.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94,
partial [Glycine soja]
Length = 337
Score = 229 bits (584), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 133/339 (39%), Positives = 198/339 (58%), Gaps = 49/339 (14%)
Query: 1 MMAKYEIETFDGSNDFDLWKIKMRTVLVQQGLLKMLNGKAALAESLSEVEKEDLMKRAHG 60
M ++++E F G NDF L +IKM+ +LV QGL L G + L +LS+ EK+DL+ +AH
Sbjct: 1 MGTRFDVEKFTGENDFSLRRIKMQALLVHQGLDDALQGASKLPSTLSDKEKKDLLSKAHS 60
Query: 61 QILLCLSNE-----------------------------------------MFEGKSLKDH 79
I+L L +E M EG S+K+H
Sbjct: 61 TIILSLGDEVLREVAEEKSAAGIWLKLESLYMTKSLTNKLYLKKRLHQLKMEEGSSIKEH 120
Query: 80 MNELNRVFLDLANISVKFEEEDKAVLLLESLLPSHESLVDSFMSGKTTTTFEDVKAALNS 139
++ + LDL ++ V+ +EED+AV+LL SL S E+LVD+ + G+ T T E+VKA LNS
Sbjct: 121 VSLFTKAVLDLKSVDVRIDEEDQAVMLLCSLPSSFENLVDTMLFGRDTLTLEEVKATLNS 180
Query: 140 KELRRKSIKESSLG---DGLVVRRRTQHVVDGKGRSRSKSKPKNYR-CFNGKKEGHFKKN 195
+EL++K + G + L+ R R + D K +++ +SK KN + C+ KKEGHF+K
Sbjct: 181 RELKKKITENKGEGGDPEALMARGRLEK-RDSKSKNKRRSKYKNEKACYYCKKEGHFRKE 239
Query: 196 CPELRKKPGEAAVIEEDDGSVLAIIEECDEAMVLAINPCFAEDRWILDTGCTFHVCPHKE 255
CPE RKK +E D +V+A + + A VL+I+ + WILD+GC+FH+ P+ E
Sbjct: 240 CPE-RKKKNNGKYNDESDIAVVA--DGYESAEVLSISTKKHSEEWILDSGCSFHMTPNLE 296
Query: 256 WFSTYTVINNGKVLMGNDAPCKVVGIGTIQIKMHDGVVR 294
WFS+Y I+ GKVLMGN+ C V+GIGTI++K+ DG V+
Sbjct: 297 WFSSYKEIDGGKVLMGNNMVCNVIGIGTIKLKVQDGFVK 335
>KYP36635.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94,
partial [Cajanus cajan]
Length = 364
Score = 226 bits (577), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 149/367 (40%), Positives = 212/367 (57%), Gaps = 63/367 (17%)
Query: 1 MMAKYEIETFDGSNDFDLWKIKMRTVLVQQGLLKMLNGKAALAESLSEVEK-------ED 53
M IE F+G N F+LW+IKMR +L +Q + L + E LSE ++ E+
Sbjct: 3 MTNTIRIEKFNGKNSFNLWRIKMRALLKEQRVWAPLATTSVKQEVLSEPKEKASASKIEE 62
Query: 54 LMK---RAHGQILLCLSNE----------------------------------------- 69
L + +AH ILL LS+E
Sbjct: 63 LAEQEEKAHSLILLSLSDEVLYEVADEETASGLWCKLEKLYMTKSICNKLLLKRRLFGLH 122
Query: 70 MFEGKSLKDHMNELNRVFLDLANISVKFEEEDKAVLLLESLLPSHESLVDSFMSGKTTTT 129
M EG LKDH++ELN V ++L +I VK E+ED A++LL SL PS+ES V+S GK T
Sbjct: 123 MKEGTPLKDHLDELNSVLMELRDIDVKIEDEDAAMILLASLPPSYESFVNSLSVGKECIT 182
Query: 130 FEDVKAALNSKELRRKSI--KESSLGDGLVVRRRTQHVVDGKGRSRSKSK--PKNYRCFN 185
E+VK++L+S+E R ++ E S G LVV +++ K +S+ K+ PK+ C
Sbjct: 183 MEEVKSSLHSREFRLRASGNSEESNGSSLVVSNSGKNMKKKKDKSKRKTNVNPKDI-CNY 241
Query: 186 GKKEGHFKKNCPELRKKPGEAAVIEEDDGSVLAIIEECDEAMVLAINPCFAEDRWILDTG 245
K+ GH+KK+CP+ + KP AAV +E+ S E + + +A P +ED+WILD+G
Sbjct: 242 CKEPGHWKKDCPKKKGKPS-AAVAKEESTS------ENELVLSIADQPQHSEDQWILDSG 294
Query: 246 CTFHVCPHKEWFSTYTVINNGKVLMGNDAPCKVVGIGTIQIKMHDGVVRTITDIRHVPDL 305
C+FH+CP++ WF TY + G V MGNDAPCK +GIGTI+IKMHDG+ RT+T++RHVP+L
Sbjct: 295 CSFHMCPNRTWFDTYEKKSGGNVFMGNDAPCKTIGIGTIKIKMHDGITRTLTEVRHVPEL 354
Query: 306 KKNLISL 312
KKNLIS+
Sbjct: 355 KKNLISV 361