BLAST Resut
BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Eca_sc000117.1_g1410.1
(372 letters)
Database: ./nr
95,329,361 sequences; 35,143,497,570 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
XP_010673168.1 PREDICTED: uncharacterized protein LOC104889608 [... 133 2e-30
NP_194638.1 ribonuclease H-like protein [Arabidopsis thaliana] C... 131 2e-30
XP_010693383.1 PREDICTED: uncharacterized protein LOC104906342 [... 132 4e-30
>XP_010673168.1 PREDICTED: uncharacterized protein LOC104889608 [Beta vulgaris subsp.
vulgaris]
Length = 1712
Score = 133 bits (334), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 104/381 (27%), Positives = 186/381 (48%), Gaps = 41/381 (10%)
Query: 5 RLLTVDQLIN--NNQWNGILIHQLFDSEVGKAICSI----NLRHDRDDTMKWSLTKSGNF 58
RL V LI+ + +W+ ++++LF+ + +AI ++ L HDR + W+ TK G +
Sbjct: 1316 RLKYVCDLIDFGSMEWDANVVNELFNEQDIQAILAVPLSERLPHDR---VAWAFTKDGRY 1372
Query: 59 TIKSMYFHLRTKNMDYHHPEWNYIWEIQTIPRIKLFLWKCCTNSLPVRGQIG-QYIGNVF 117
++K+ Y +++N+D H W IW +Q P+++ FLWK C+NSLPVR + ++I +
Sbjct: 1373 SVKTAYMVGKSRNLDLFHRAWVTIWGLQVSPKVRHFLWKICSNSLPVRAILKHRHITSDD 1432
Query: 118 NCGFC-DKVESLSHIMLHCTFTHTIWFFFNV--RVESIDN---LQDWILKWKSLEPELQS 171
C C + E++SH +LHC+ +W + ++ + D L W +W+ +E +
Sbjct: 1433 TCPLCLEGPETISHALLHCSKVREVWEMAGLTSKLPNGDGASWLDSWD-EWQEVEKDSLV 1491
Query: 172 LYANVLWFIWKARCEQCFSS----KEQTPV----------QVSNNIKGSGFALKKTKPAT 217
+ V +++W R + F EQ + S +I GS
Sbjct: 1492 ALSYVAYYVWHRRNKVVFEDWCRPNEQVAALAMRAAADYNEYSQHIYGS-------VAGQ 1544
Query: 218 NRKDSPVWQPPESPYLKCNVDASFKTLTEVAGYGGIIHNDKADFLFAFAGVIRG-QSAEE 276
N + S VWQPP + +K N DAS V G G + N+ + LFA + ++ E
Sbjct: 1545 NARSSKVWQPPPAGCVKLNADASIGDDGWV-GMGVVARNEVGEVLFAASRRVKAWWPVEV 1603
Query: 277 CEGLGILKILQWAHQLQITHLILESDNLNIIRYLNNCSSAIEWQTEKLMDNICEISGYFV 336
EG + ++ A + ++I E+D L I L+ + + ++++ S FV
Sbjct: 1604 AEGKALCLAIKLARSHDLQNVIFETDCLTITNRLSRGALFFS-DLDAVLEDALFFSRDFV 1662
Query: 337 SIRFVHIPRDLNNWADKLAKW 357
S+++ H+ RD N A LA++
Sbjct: 1663 SVKWSHVLRDGNFVAHHLARF 1683
>NP_194638.1 ribonuclease H-like protein [Arabidopsis thaliana] CAB43923.1
putative protein [Arabidopsis thaliana] CAB79667.1
putative protein [Arabidopsis thaliana] AAY78807.1
putative reverse transcriptase/RNA-dependent DNA
polymerase [Arabidopsis thaliana] AEE85585.1
Ribonuclease H-like superfamily protein [Arabidopsis
thaliana]
Length = 575
Score = 131 bits (330), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 106/389 (27%), Positives = 178/389 (45%), Gaps = 44/389 (11%)
Query: 6 LLTVDQLINNN--QWNGILIHQLFDSEVGKAICSINLRHDRD-DTMKWSLTKSGNFTIKS 62
+L V LI+ + +W +I LF K I + R D+ W T SG++T+KS
Sbjct: 168 ILKVSDLIDESGREWRKDVIEMLFPEVERKLIGELRPGGRRILDSYTWDYTSSGDYTVKS 227
Query: 63 MYFHL------RTKNMDYHHPEWN----YIWEIQTIPRIKLFLWKCCTNSLPVRGQIG-Q 111
Y+ L R+ + P N IW+ QT P+I+ FLWKC +NSLPV G + +
Sbjct: 228 GYWVLTQIINKRSSPQEVSEPSLNPIYQKIWKSQTSPKIQHFLWKCLSNSLPVAGALAYR 287
Query: 112 YIGNVFNCGFCDKV-ESLSHIMLHCTFTHTIWFFFNVRVESIDNLQD-------WILKWK 163
++ C C E+++H++ CTF W ++ + D W+
Sbjct: 288 HLSKESACIRCPSCKETVNHLLFKCTFARLTWAISSIPIPLGGEWADSIYVNLYWVFNLG 347
Query: 164 SLEPELQS---LYANVLWFIWKARCEQCFSSKEQTPVQVSNNIKGSGFALK--------K 212
+ P+ + L +LW +WK R E F +E +V + +
Sbjct: 348 NGNPQWEKASQLVPWLLWRLWKNRNELVFRGREFNAQEVLRRAEDDLEEWRIRTEAESCG 407
Query: 213 TKPATNRKDSPVWQPPESPYLKCNVDASFKTLTEVAGYGGIIHNDKADFLFAFAGVI-RG 271
TKP NR W+PP ++KCN DA++ E G G ++ N+K + + A + +
Sbjct: 408 TKPQVNRSSCGRWRPPPHQWVKCNTDATWNRDNERCGIGWVLRNEKGEVKWMGARALPKL 467
Query: 272 QSAEECEGLGILKILQWA----HQLQITHLILESDNLNIIRYLNNCSSAIEWQTEKLMDN 327
+S E E L+ ++WA + Q ++I ESD+ +I LNN I + + +
Sbjct: 468 KSVLEAE----LEAMRWAVLSLSRFQYNYVIFESDSQVLIEILNN--DEIWPSLKPTIQD 521
Query: 328 ICEISGYFVSIRFVHIPRDLNNWADKLAK 356
+ + F ++FV IPR+ N A+++A+
Sbjct: 522 LQRLLSQFTEVKFVFIPREGNTLAERVAR 550
>XP_010693383.1 PREDICTED: uncharacterized protein LOC104906342 [Beta vulgaris subsp.
vulgaris]
Length = 1157
Score = 132 bits (331), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 100/372 (26%), Positives = 173/372 (46%), Gaps = 34/372 (9%)
Query: 9 VDQLINNN--QWNGILIHQLFDSEVGKAICSINL-RHDRDDTMKWSLTKSGNFTIKSMYF 65
V +LI+++ +WN L+ +LF + I +I L + D + W+ TKSG +++K+ Y
Sbjct: 766 VSELIHSDTGEWNLELLARLFTERDQECILAIPLSERSQRDIITWAFTKSGEYSVKTAYM 825
Query: 66 HLRTKNMDYHHPEWNYIWEIQTIPRIKLFLWKCCTNSLPVRGQIG-QYIGNVFNCGFCDK 124
+ +D H W IW I+ P+++ FLW+ CT +LP + + +++ +C +C
Sbjct: 826 VGKGFELDNFHNAWVTIWNIEASPKVRFFLWRLCTGTLPTKALLHYRHLIEEEHCPWCGA 885
Query: 125 VESLSHIMLHCTFTHTIWFFFN----VRVESIDNLQDWILKWKSLEPELQSLYANVLWFI 180
VE+ H + C+ +W ++ + D++ KSLE + Q A + W I
Sbjct: 886 VETDRHAIFECSRVAELWEGSGSSHLIQSVGTTTMLDFVASRKSLEKKEQQKLAMLAWCI 945
Query: 181 WKARCEQCFSSKEQTPVQV---------------SNNIKGSGFALKKTKPATNRKDSPVW 225
W R E+ F++ TP V S I GS + +R + +W
Sbjct: 946 WSERNEKVFNNT-FTPNTVLLARLHRLTTEHDKYSQRIYGS-------RREGSRGSAKIW 997
Query: 226 QPPESPYLKCNVDASFKTLTEVAGYGGIIHNDKADFLFAFAGVIRGQ-SAEECEGLGILK 284
Q P ++K N DAS + G G + ++ LFA +R E EG +L
Sbjct: 998 QSPAVGHVKLNCDASL-AVDGWRGLGVVARDNAGRVLFAACRRVRANWPVEIAEGKALLM 1056
Query: 285 ILQWAHQLQITHLILESDNLNIIRYLNNCSSAIEWQTEKLMDNICEISGYFVSIRFVHIP 344
L+ A + + + LESD+ +I L+ + + ++D+I S F+S+ + H+
Sbjct: 1057 ALRLAERFGLRQVTLESDSQVLITRLSKAMTYFS-DLDSVLDDILAKSCNFLSVDWSHVK 1115
Query: 345 RDLNNWADKLAK 356
RD N A LAK
Sbjct: 1116 RDGNVVAHHLAK 1127