BLAST Resut
BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Eca_sc000066.1_g0520.1
(619 letters)
Database: ./nr
95,329,361 sequences; 35,143,497,570 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AFP55574.1 non-ltr retroelement reverse transcriptase [Rosa rugosa] 342 2e-99
CCA66036.1 hypothetical protein [Beta vulgaris subsp. vulgaris] 329 1e-95
XP_007226707.1 hypothetical protein PRUPE_ppa020120mg [Prunus pe... 316 1e-92
>AFP55574.1 non-ltr retroelement reverse transcriptase [Rosa rugosa]
Length = 1656
Score = 342 bits (876), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 191/554 (34%), Positives = 291/554 (52%), Gaps = 67/554 (12%)
Query: 132 EFNLINNQLEDLYRLQESLWKEKSRNNFLSVGDKNTKIFHSQAIQRNRTNKIIAIKDTNG 191
+ N++ +Q+ L+ E W ++SR N+L +GD+N+ FH IQR + NKI+ +KD +G
Sbjct: 926 QINILTDQVTKLWTQDEMYWHQRSRVNWLKLGDQNSSFFHQTTIQRRQYNKIVRLKDDHG 985
Query: 192 EWQEGLENIQAVFTSHLLNISSSQGPNNNHTILNLFHQTITLEQNIELSRIPDQEEINSA 251
W + ++ F + + S GP +L+ +T E N LS E+ A
Sbjct: 986 NWLDSEADVALQFLDYFTALYQSNGPQQWEEVLDFVDTAVTAEMNKILSSPVSLLEVKKA 1045
Query: 252 I----------------------------------------SSL-------------KKE 258
+ SSL K +
Sbjct: 1046 VFDLGATKAPGPDGFSGIFYQNQWEWVQSIIHESALQHQTSSSLLQVMNRTHLALIPKVK 1105
Query: 259 AAPGPDGYPPTSF---------NLTGTRFKSSLHNIISPSQAAYIPDRQISDNIIMGQEI 309
A P Y P + + +R + + +IS +Q+A++ +RQI DN+I+ EI
Sbjct: 1106 APTHPSHYRPIALCNFSYKILTKIIASRLQPFMSELISDNQSAFVSNRQIQDNVIIAHEI 1165
Query: 310 IHSLKTMKGAT-GYFGLKLDMSKAFDRIEWSFLADILAKLGYSDHWIKMIAQCMTTSSMA 368
H LK + G FGLKLDM+KA+DR+EW+FL +L K+G+ D WI ++ C+TTSS++
Sbjct: 1166 YHHLKLTRSCNNGAFGLKLDMNKAYDRVEWNFLEAVLRKMGFVDSWIGLVMSCVTTSSLS 1225
Query: 369 VLVNGRPGPIFYPSRGIRQGDPLSPFLFTLAMEGLSRLLKEEQDRDNFKGFPTNNPN-LT 427
VL+NG+PGP F PSRG+RQGDPLSPFLF + LSR++ + +D+ T PN L
Sbjct: 1226 VLINGKPGPSFLPSRGLRQGDPLSPFLFLFVNDVLSRMINK-MCQDSLLTPVTIGPNNLP 1284
Query: 428 ISHLLFADDCIIFGRNSLDNIHTLKTILDAFCNASGQMINYAKSNIFYSKNSHPKFKRII 487
+SHL FADD + F R +L N TL +L +C ASGQ+IN KS+IF+S N+ P+ ++
Sbjct: 1285 VSHLFFADDSLFFLRATLQNCETLSDLLHTYCIASGQLINVEKSSIFFSPNTPPEIAHLL 1344
Query: 488 MRSLKVKYASTSEKYLGAQLFIGANKKQIYNDILHKIKMKLEKWNHSFLSQAGRTIVIST 547
+++ S YLG F +KK+ I I K++ W + LSQAG+ ++I
Sbjct: 1345 SSIMQIPVVSDPGTYLGLPTFWHRSKKKALGFIKDSILRKVKGWKQATLSQAGKEVLIKA 1404
Query: 548 IAAVVPRYQMQCFALPKGICKSISTLQKSFWWG--KSKGICTKSWSSICLPKFLGGLGIH 605
+A +P Y M CF P +CK ++ + FWWG ++GI KSW + PK GG+G
Sbjct: 1405 VATAIPAYPMGCFKFPSTLCKELNGILADFWWGNVDTRGIHWKSWDFLARPKKDGGMGFR 1464
Query: 606 SPELDNHAMLSKLA 619
+ E N+++L+K A
Sbjct: 1465 NLEDFNNSLLAKQA 1478
>CCA66036.1 hypothetical protein [Beta vulgaris subsp. vulgaris]
Length = 1369
Score = 329 bits (844), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 185/571 (32%), Positives = 295/571 (51%), Gaps = 66/571 (11%)
Query: 115 NNIKVLITKLNNTTNKKEFNLINNQLEDLYRLQESLWKEKSRNNFLSVGDKNTKIFHSQA 174
+ +KVL+ + N ++ ++++L + +E W ++SR +++ GDKNTK FH +A
Sbjct: 309 HQMKVLMESEPSEDNIMHMRALDARMDELEKREEVYWHQRSRQDWIKSGDKNTKFFHQKA 368
Query: 175 IQRNRTNKIIAIKDTNGEWQEGLENIQAVFTSHLLNISSSQGPNNNHTILNLFHQTITLE 234
R + N + I++ GEW E +++ F + N+ S ILN+ IT E
Sbjct: 369 SHREQRNNVRRIRNEAGEWFEDEDDVTECFAHYFENLFQSGNNCEMDPILNIVKPQITDE 428
Query: 235 QNIELSRIPDQEEINSAISSL--------------------------------------- 255
+L +EE+++A++ +
Sbjct: 429 LGTQLDAPFRREEVSAALAQMHPNKAPGPDGMNALFYQHFWDTIGEDVTTKVLNMLNNVD 488
Query: 256 --------------KKEAAPGPDGYPPTSF---------NLTGTRFKSSLHNIISPSQAA 292
KK+ P + P S + R K L +I SQ+
Sbjct: 489 NIGAVNQTHIVLIPKKKHCESPVDFRPISLCNVLYKIVAKVLANRMKMVLPMVIHESQSG 548
Query: 293 YIPDRQISDNIIMGQEIIHSLKTMK-GATGYFGLKLDMSKAFDRIEWSFLADILAKLGYS 351
++P R I+DN+++ E H L+ K G GY GLKLDMSKA+DR+EW FL +++ KLG+
Sbjct: 549 FVPGRLITDNVLVAYECFHFLRKKKTGKKGYLGLKLDMSKAYDRVEWCFLENMMLKLGFP 608
Query: 352 DHWIKMIAQCMTTSSMAVLVNGRPGPIFYPSRGIRQGDPLSPFLFTLAMEGLSRLLKEEQ 411
+ K++ C+T++ +VLVNG+P F+PSRG+RQGDPLSPFLF + EGLS LL++ +
Sbjct: 609 TRYTKLVMNCVTSARFSVLVNGQPSRNFFPSRGLRQGDPLSPFLFVVCAEGLSTLLRDAE 668
Query: 412 DRDNFKGFPTNNPNLTISHLLFADDCIIFGRNSLDNIHTLKTILDAFCNASGQMINYAKS 471
++ G + ISHL FADD ++F R + + + + IL + ASGQ +N KS
Sbjct: 669 EKKVIHGVKIGHRVSPISHLFFADDSLLFIRATEEEVENVMDILSTYEAASGQKLNMEKS 728
Query: 472 NIFYSKNSHPKFKRIIMRSLKVKYASTSEKYLGAQLFIGANKKQIYNDILHKIKMKLEKW 531
+ YS+N P + L K EKYLG FIG++KK+++ I ++ KL+ W
Sbjct: 729 EMSYSRNLEPDKINTLQMKLAFKTVEGHEKYLGLPTFIGSSKKRVFQAIQDRVWKKLKGW 788
Query: 532 NHSFLSQAGRTIVISTIAAVVPRYQMQCFALPKGICKSISTLQKSFWWGK---SKGICTK 588
+LSQAGR ++I +A +P Y MQCF +PK I I + ++F+WG+ + +
Sbjct: 789 KGKYLSQAGREVLIKAVAQAIPTYAMQCFVIPKSIIDGIEKMCRNFFWGQKEEERRVAWV 848
Query: 589 SWSSICLPKFLGGLGIHSPELDNHAMLSKLA 619
+W + LPK GGLGI + ++ N A+L+K A
Sbjct: 849 AWEKLFLPKKEGGLGIRNFDVFNRALLAKQA 879
Score = 64.3 bits (155), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 35/110 (31%), Positives = 56/110 (50%), Gaps = 7/110 (6%)
Query: 9 VLSWNVRGITSFSTRKHLMSLINKHNPDIIFLAETKTKSYNINKYIKINKHYDHFFV--- 65
+LSWN RG+ S S L L+ NP I+FL+ETK KSY + +K ++H
Sbjct: 4 ILSWNCRGMGSPSALSALRRLLASENPQIVFLSETKLKSYEMES-VKKKLKWEHMVAVDC 62
Query: 66 --EPTNKSGGLAMYWKHNLTLKILYSDNNMIHVEIHNTNDNPDYVITGFY 113
E + GGLAM W+ + ++++ +N I + + ++ TG Y
Sbjct: 63 EGECRKRRGGLAMLWRSEIKVQVMSMSSNHIDIVVGEEAQG-EWRFTGIY 111
>XP_007226707.1 hypothetical protein PRUPE_ppa020120mg [Prunus persica] EMJ27906.1
hypothetical protein PRUPE_ppa020120mg [Prunus persica]
Length = 1011
Score = 316 bits (810), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 174/523 (33%), Positives = 272/523 (52%), Gaps = 48/523 (9%)
Query: 133 FNLINNQLEDLYRLQESLWKEKSRNNFLSVGDKNTKIFHSQAIQRNRTNKIIAIKDTNGE 192
++L+ +QL+ L +E+ WK++S+ ++L GD+NT+ FH +A R + N + ++D G
Sbjct: 249 YHLLMSQLDSLLSREEAFWKQRSKVSWLKEGDRNTRFFHQRASNRKQRNYVKGLRDNTGR 308
Query: 193 WQEGLENIQAVFTSHLLNISSSQGPNNNHTILNLFHQTITLEQNIELSRIPDQEEINSAI 252
W+E + +Q V + ++ +S + ++ +T + N L EI+ A+
Sbjct: 309 WREDEQGLQYVVLDYFTHLFTSSASGSEGESIDAVESRVTPDMNNLLLTDYCDAEIHEAV 368
Query: 253 SSLKKEAAPGPDGYPPTSFN----------------------------------LTGTRF 278
+ APGPDG PP F + R
Sbjct: 369 FQMYPTKAPGPDGMPPIFFQKYWHIVGSDVKHPKDMSQLRPISLCNVLFKIATKVLANRL 428
Query: 279 KSSLHNIISPSQAAYIPDRQISDNIIMGQEIIHSLKTMKGATGYFG-LKLDMSKAFDRIE 337
K LH IISPSQ+A+I R ISDN I+ EIIH L+ + F LK+DMSKA+DRIE
Sbjct: 429 KLILHKIISPSQSAFISGRLISDNTILAAEIIHYLRRRRRGKKGFMVLKMDMSKAYDRIE 488
Query: 338 WSFLADILAKLGYSDHWIKMIAQCMTTSSMAVLVNGRPGPIFYPSRGIRQGDPLSPFLFT 397
WSFL I+ KLG+++ WI+++ C++T S + ++NG P +PSRG+ QGDPLSP+LF
Sbjct: 489 WSFLEAIMRKLGFAEQWIQLMLTCISTVSYSFVINGTPHGFLHPSRGLHQGDPLSPYLFL 548
Query: 398 LAMEGLSRLLKEEQDRDNFKGFPTNNPNLTISHLLFADDCIIFGRNSLDNIHTLKTILDA 457
L EGL+ L+ +++ KG ISHL FADD ++F R ++ +
Sbjct: 549 LCAEGLTELIAQKEREGFLKGVSICRGAPAISHLFFADDSVLFARANMADCM-------- 600
Query: 458 FCNASGQMINYAKSNIFYSKNSHPKFKRIIMRSLKVKYASTSEKYLGAQLFIGANKKQIY 517
ASGQ +N+ KS + +SKN H + ++ + + + +YLG + + K +
Sbjct: 601 --RASGQQVNFQKSAVCFSKNVHRGDQLMLAQFMGIPCVDHHSQYLGLPMVLDKKKGASF 658
Query: 518 NDILHKIKMKLEKWNHSFLSQAGRTIVISTIAAVVPRYQMQCFALPKGICKSISTLQKSF 577
N + ++ KL+ W LS AG+ I+I +A +P Y M F LPK +C+ ++ L F
Sbjct: 659 NHLKERLWKKLQTWKGKLLSGAGKEILIKVVAQAIPIYTMSYFLLPKYVCEDLNKLVAQF 718
Query: 578 WWGKS---KGICTKSWSSICLPKFLGGLGIHSPELDNHAMLSK 617
WW S K I +W +C PK GGLG + N A+L+K
Sbjct: 719 WWNSSTENKKIHWMAWDRLCAPKEEGGLGFRNLHAFNLALLAK 761