BLAST Resut
BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Eca_sc000077.1_g0130.1
(235 letters)
Database: ./nr
95,329,361 sequences; 35,143,497,570 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
KYP31892.1 Retrovirus-related Pol polyprotein from transposon TN... 216 4e-67
AAF02855.1 Similar to retrotransposon proteins [Arabidopsis thal... 229 2e-65
AAD21687.1 Strong similarity to gi|3600044 T12H20.12 protease ho... 226 2e-64
>KYP31892.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94,
partial [Cajanus cajan]
Length = 264
Score = 216 bits (551), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 105/226 (46%), Positives = 143/226 (63%), Gaps = 2/226 (0%)
Query: 1 LGHSHSAVIKQLHDDNKIVISNKIDKTFCKSCELGKSKCLPFANSSTTTNTPLYLIHCDV 60
LGH ++ + ++ + +S FC++C+ GK LPF SS+ L+H DV
Sbjct: 36 LGHPNNKFLDRVLKSCNVKLSPSDHFNFCEACQYGKMHFLPFKTSSSHAKEIFELVHTDV 95
Query: 61 WGPAPVTTPNGARYYILFLDDFSKYSWIYALKLRSDSPKCFCHFKATNENLLKEKIVYFQ 120
WGPAPVT+P+G +YY+ FLDDFS+++WIY LK +SD+ + F FK ENL ++I Q
Sbjct: 96 WGPAPVTSPSGFKYYVHFLDDFSRFTWIYPLKHKSDTAQAFTQFKNMAENLFNKRIKTIQ 155
Query: 121 SDGAPELQKGEFRAFLDKNGILFRCSCPYTPQQNGKAERKHRHITELGNTLSFHCLLPKQ 180
DG E + + A + GI FR SCPYT QQNG+AERKHRHITE TL +P
Sbjct: 156 CDGGGEYKTVQNHAI--EAGIQFRMSCPYTSQQNGRAERKHRHITEFSLTLLAQAKMPLH 213
Query: 181 LWFDAFSTVVYIINRLPTKTLQSSTPYDVLFHHSPDYSVLRVFGCA 226
W++AFST VY+INRLP+ Q+ +PY +LF PDY+ L+ FGCA
Sbjct: 214 YWWEAFSTAVYLINRLPSLVTQNESPYSLLFRKEPDYNSLKPFGCA 259
>AAF02855.1 Similar to retrotransposon proteins [Arabidopsis thaliana]
Length = 1522
Score = 229 bits (583), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 109/228 (47%), Positives = 148/228 (64%), Gaps = 1/228 (0%)
Query: 1 LGHSHSAVIKQLHDDNKIVISNKIDKTFCKSCELGKSKCLPFANSSTTTNTPLYLIHCDV 60
LGH+++ V+ QL I+I NK+ KT C++C LGKS LPF S+ + PL IHCD+
Sbjct: 464 LGHANAEVLHQLASSKSIIIINKVVKTVCEACHLGKSTRLPFMLSTFNASRPLERIHCDL 523
Query: 61 WGPAPVTTPNGARYYILFLDDFSKYSWIYALKLRSDSPKCFCHFKATNENLLKEKIVYFQ 120
WGP+P ++ G RYY++F+D +S+++W Y LKL+SD F F+ EN L KI FQ
Sbjct: 524 WGPSPTSSVQGFRYYVVFIDHYSRFTWFYPLKLKSDFFSTFVMFQKLVENQLGHKIKIFQ 583
Query: 121 SDGAPELQKGEFRAFLDKNGILFRCSCPYTPQQNGKAERKHRHITELGNTLSFHCLLPKQ 180
DG E +F L +GI SCPYTPQQNG AERKHRHI ELG ++ F LP +
Sbjct: 584 CDGGGEFISSQFLKHLQDHGIQQNMSCPYTPQQNGMAERKHRHIVELGLSMIFQSKLPLK 643
Query: 181 LWFDAFSTVVYIINRLPTKTLQSS-TPYDVLFHHSPDYSVLRVFGCAC 227
W ++F T ++IN LPT +L ++ +PY L+ +P+YS LRVFGCAC
Sbjct: 644 YWLESFFTANFVINLLPTSSLDNNESPYQKLYGKAPEYSALRVFGCAC 691
>AAD21687.1 Strong similarity to gi|3600044 T12H20.12 protease homolog from
Arabidopsis thaliana BAC gb|AF080119 and is a member of
the reverse transcriptase family PF|00078 [Arabidopsis
thaliana]
Length = 1415
Score = 226 bits (576), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 106/227 (46%), Positives = 143/227 (62%)
Query: 1 LGHSHSAVIKQLHDDNKIVISNKIDKTFCKSCELGKSKCLPFANSSTTTNTPLYLIHCDV 60
LGH++S ++ L + I I+ C+ C++GKS LPF S + PL IHCD+
Sbjct: 460 LGHANSKALQHLQNSKAIQINKSRTSPVCEPCQMGKSSRLPFLISDSRVLHPLDRIHCDL 519
Query: 61 WGPAPVTTPNGARYYILFLDDFSKYSWIYALKLRSDSPKCFCHFKATNENLLKEKIVYFQ 120
WGP+PV + G +YY +F+DD+S+YSW Y L +S+ F F+ EN L KI FQ
Sbjct: 520 WGPSPVVSNQGLKYYAIFVDDYSRYSWFYPLHNKSEFLSVFISFQKLVENQLNTKIKVFQ 579
Query: 121 SDGAPELQKGEFRAFLDKNGILFRCSCPYTPQQNGKAERKHRHITELGNTLSFHCLLPKQ 180
SDG E + + L ++GI R SCPYTPQQNG AERKHRH+ ELG ++ FH P++
Sbjct: 580 SDGGGEFVSNKLKTHLSEHGIHHRISCPYTPQQNGLAERKHRHLVELGLSMLFHSHTPQK 639
Query: 181 LWFDAFSTVVYIINRLPTKTLQSSTPYDVLFHHSPDYSVLRVFGCAC 227
W ++F T YIINRLP+ L++ +PY+ LF PDYS LRVFG AC
Sbjct: 640 FWVESFFTANYIINRLPSSVLKNLSPYEALFGEKPDYSSLRVFGSAC 686