BLAST Resut
BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Eca_sc000118.1_g0880.1
(387 letters)
Database: ./nr
95,329,361 sequences; 35,143,497,570 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AFP55574.1 non-ltr retroelement reverse transcriptase [Rosa rugosa] 293 1e-85
XP_016694234.1 PREDICTED: uncharacterized protein LOC107910809 [... 275 6e-84
XP_012448868.1 PREDICTED: uncharacterized protein LOC105772073 [... 275 2e-82
>AFP55574.1 non-ltr retroelement reverse transcriptase [Rosa rugosa]
Length = 1656
Score = 293 bits (750), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 145/366 (39%), Positives = 213/366 (58%), Gaps = 3/366 (0%)
Query: 1 MSKAFDRIEWKFLTEIMHKLGYSRHWIRMISQCISTSLMAVLVNGRPGPIFYPSRGIRQD 60
M+KA+DR+EW FL ++ K+G+ WI ++ C++TS ++VL+NG+PGP F PSRG+RQ
Sbjct: 1186 MNKAYDRVEWNFLEAVLRKMGFVDSWIGLVMSCVTTSSLSVLINGKPGPSFLPSRGLRQG 1245
Query: 61 DPLSPFLFTIAMEALSRLINANHNGNKFIGFPTNNPNLNISHLLFADDCIIFGRNSLENI 120
DPLSPFLF + LSR+IN + NL +SHL FADD + F R +L+N
Sbjct: 1246 DPLSPFLFLFVNDVLSRMINKMCQDSLLTPVTIGPNNLPVSHLFFADDSLFFLRATLQNC 1305
Query: 121 HTLKSILKMFCDSSGQMINYSKSNIFYSKNSHPKFKRLITRTLKVKYASTSEKYLGAQLF 180
TL +L +C +SGQ+IN KS+IF+S N+ P+ L++ +++ S YLG F
Sbjct: 1306 ETLSDLLHTYCIASGQLINVEKSSIFFSPNTPPEIAHLLSSIMQIPVVSDPGTYLGLPTF 1365
Query: 181 IGANKKKVFNDIIDKIKNKLDKWNYNFLSQAGRTIVISTIAAAVPRYQMQCFALPKGTSK 240
+KKK I D I K+ W LSQAG+ ++I +A A+P Y M CF P K
Sbjct: 1366 WHRSKKKALGFIKDSILRKVKGWKQATLSQAGKEVLIKAVATAIPAYPMGCFKFPSTLCK 1425
Query: 241 SISTLQKSFWWGR--SKGICTKSWASICTPKKMGGLGIHNTELDNQAMLSKLAWKIKSEP 298
++ + FWWG ++GI KSW + PKK GG+G N E N ++L+K AW++ P
Sbjct: 1426 ELNGILADFWWGNVDTRGIHWKSWDFLARPKKDGGMGFRNLEDFNNSLLAKQAWRLHQNP 1485
Query: 299 KAIWVQFLKAKYYNNSEHPNLAKS-HHSWHWKNISKHLHNIDKHSFWEVQNGKNIEIWKD 357
A+W + L+ YY S K + SW W ++ + I K + W + NG ++ I D
Sbjct: 1486 FALWARVLEQLYYPRSSFLEAPKGPNPSWIWNSLLIGRNFIHKEALWNIGNGFSVNIVGD 1545
Query: 358 NWIPTL 363
NWIP++
Sbjct: 1546 NWIPSI 1551
>XP_016694234.1 PREDICTED: uncharacterized protein LOC107910809 [Gossypium
hirsutum]
Length = 570
Score = 275 bits (703), Expect = 6e-84, Method: Compositional matrix adjust.
Identities = 151/394 (38%), Positives = 217/394 (55%), Gaps = 15/394 (3%)
Query: 1 MSKAFDRIEWKFLTEIMHKLGYSRHWIRMISQCISTSLMAVLVNGRPGPIFYPSRGIRQD 60
MSKA+DR+EW F+ ++M ++G+++ WI I +C++T V++NG G FYP RG+RQ
Sbjct: 99 MSKAYDRVEWIFIKKLMSRMGFAQSWIDNIMKCLTTVSYKVVINGTVGMKFYPERGLRQG 158
Query: 61 DPLSPFLFTIAMEALSRLINANHNGNKFIGFPTNNPNLNISHLLFADDCIIFGRNSLENI 120
DP SPF+F I LS L+ G + ISHLLFADDCI+FG +
Sbjct: 159 DPFSPFIFLIC-GGLSSLMRTAVEEGFLKGVKVSRRGPQISHLLFADDCILFGEATCRGA 217
Query: 121 HTLKSILKMFCDSSGQMINYSKSNIFYSKNSHPKFKRLITRTLKVKYASTSEKYLGAQLF 180
+ K+IL + SGQ +N+ KS IF+SKN+ + +R I L+V+ ++ SE+YLG
Sbjct: 218 TSFKAILSEYRRCSGQCVNFEKSTIFFSKNTIEEERRRIVILLRVRSSNESERYLGLPTM 277
Query: 181 IGANKKKVFNDIIDKIKNKLDKWNYNFLSQAGRTIVISTIAAAVPRYQMQCFALPKGTSK 240
+G KK F + DKIK + D W+ FLSQ G+ + I A+P Y M CF L K
Sbjct: 278 VGRQKKVSFQVLKDKIKQRTDNWSTRFLSQGGKEVFIKAALQAIPTYSMACFLLSKSLCD 337
Query: 241 SISTLQKSFWWGRSK---GICTKSWASICTPKKMGGLGIHNTELDNQAMLSKLAWKIKSE 297
+ + + FWW + + GI SW ++C+ K+ GGLG N N A+L+K W++ +
Sbjct: 338 EMEAIIERFWWKKGQGKGGIHWSSWKNLCSLKENGGLGFRNLSQFNIALLAKQGWRLFNY 397
Query: 298 PKAIWVQFLKAKYYNNS-----EHPNLAKSHHSWHWKNISKHLHNIDKHSFWEVQNGKNI 352
P ++ + LKAKYY NS E NL S WK+I + + W V G NI
Sbjct: 398 PNSLLAKVLKAKYYPNSNFLSAELGNLP----SLTWKSIWAAKGLLTQGLCWRVGKGNNI 453
Query: 353 EIWKDNWIPTLNNLLENI--YNSQLTKVSQLIEG 384
IW D WIP + + N +L VS LI+G
Sbjct: 454 SIWNDRWIPGIEPSIWQYSHQNGELENVSDLIDG 487
>XP_012448868.1 PREDICTED: uncharacterized protein LOC105772073 [Gossypium
raimondii]
Length = 748
Score = 275 bits (704), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 148/393 (37%), Positives = 218/393 (55%), Gaps = 7/393 (1%)
Query: 1 MSKAFDRIEWKFLTEIMHKLGYSRHWIRMISQCISTSLMAVLVNGRPGPIFYPSRGIRQD 60
MSKA+DR EW F+ E+M K+G+ W+ ++ +C++ +V++N G FYP+RG+RQ
Sbjct: 7 MSKAYDRAEWSFVEEVMKKMGFDSEWVTILLRCVTLVSYSVVINNHIGESFYPTRGLRQG 66
Query: 61 DPLSPFLFTIAMEALSRLINANHNGNKFIGFPTNNPNLNISHLLFADDCIIFGRNSLENI 120
DPLSPFLF E LS L+ N+ G + +SHLLFADDC++FG + +
Sbjct: 67 DPLSPFLFLFCGEGLSSLMKLGMMENRVRGVKASKSRPPVSHLLFADDCLLFGEATERSS 126
Query: 121 HTLKSILKMFCDSSGQMINYSKSNIFYSKNSHPKFKRLITRTLKVKYASTSEKYLGAQLF 180
LK IL + SGQ +N+SKS IF+S NS + +R ITR L V+ + E+YLG
Sbjct: 127 IYLKQILHNYEVCSGQKVNFSKSTIFFSSNSQEEERRTITRVLGVRRSDNMERYLGLPNL 186
Query: 181 IGANKKKVFNDIIDKIKNKLDKWNYNFLSQAGRTIVISTIAAAVPRYQMQCFALPKGTSK 240
+G KK+ F + DK K +++ W+ +LSQ G+ I I I ++P Y M CF LPK
Sbjct: 187 VGRRKKEAFQILKDKFKQRIENWSIKYLSQGGKEIFIKAILQSIPTYAMSCFLLPKSLCN 246
Query: 241 SISTLQKSFWWG---RSKGICTKSWASICTPKKMGGLGIHNTELDNQAMLSKLAWKIKSE 297
+ + +WW R KGI +W +C K+ GGLG + N A+L+K W++ +
Sbjct: 247 ELEGIIAKYWWQKNKRKKGIHWCAWKDVCLQKESGGLGFRSFNKFNVALLAKQGWRLFNY 306
Query: 298 PKAIWVQFLKAKYYNNSEHPNLAKSH-HSWHWKNISKHLHNIDKHSFWEVQNGKNIEIWK 356
P + + LKAKY+ + N + + S W++I + W V G +I IW
Sbjct: 307 PSYLLARVLKAKYFPKTNFLNASLGNLPSLTWRSIWASKKLLLDGLCWRVGIGNSISIWN 366
Query: 357 DNWIP--TLNNLLENIYNSQLTKVSQLI-EGNQ 386
D WIP T +NL N N + VS LI EG++
Sbjct: 367 DRWIPGVTEDNLQNNSRNDGIRLVSDLINEGSK 399