BLAST Resut
BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Eca_sc000058.1_g0450.1
(710 letters)
Database: ./nr
95,329,361 sequences; 35,143,497,570 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
OAP02304.1 hypothetical protein AXX17_AT3G39340 [Arabidopsis tha... 343 1e-98
AAD21687.1 Strong similarity to gi|3600044 T12H20.12 protease ho... 339 7e-98
CAB43904.1 putative protein [Arabidopsis thaliana] CAB81478.1 pu... 328 8e-94
>OAP02304.1 hypothetical protein AXX17_AT3G39340 [Arabidopsis thaliana]
Length = 2099
Score = 343 bits (881), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 167/375 (44%), Positives = 242/375 (64%), Gaps = 7/375 (1%)
Query: 234 WLADTGASSHMTHSADNLKGSQPYSGTEAVMVGSGNFLPISSTGTSTLLTSTHVFDLGNV 293
W+ D+GA++H+T+S NL+ SQPY G+++VMVG+G+FLPI+ TG++TL +S+ + L +V
Sbjct: 446 WVGDSGATAHVTNSTHNLQQSQPYGGSDSVMVGNGDFLPITHTGSTTLPSSSGILSLKDV 505
Query: 294 LCVPNLKRNLLSISKFTKDNSCSVEFMPWGYKIKDIWSKQTIAEGPMRNNLYPIEAQCVR 353
L PN+ ++L+S+SK T+D CSV+F ++ D +K+ +A+G N LY ++ V
Sbjct: 506 LVCPNIGKSLVSVSKLTRDYPCSVDFDCDYVRVTDKATKKLLAQGNNFNGLYVLKDSSVH 565
Query: 354 SSQDITAAHVTKITTNYDIWHARLGHSHSGTIRQLHQEHKIIVSNINDKPFCKSCELGKR 413
+ + TT+ D+WH RLGH + ++ LH+ + +S + K C++C+ GK
Sbjct: 566 AFYS-----SRQQTTSEDVWHMRLGHPNQQILQLLHKNKAVNISK-SSKGICEACQYGKS 619
Query: 414 RCLPFTSSSSTTSHPLYMIHCDIWGPTPIQTPHGARYYILFLDDYSKYAWIYAMKVRSDS 473
LPF+SS ST S PL IHCD+WGP PI++ G YY +F+D+YS++ W Y +K +SD
Sbjct: 620 SRLPFSSSCSTISKPLQKIHCDLWGPAPIKSVQGFSYYAIFVDNYSRFCWFYPLKFKSDF 679
Query: 474 IKCFTHFKATTENLLKEKIVYFQSDGAPELQKGDFRAFLDKHGITFRSSCPYTPQQNGKA 533
K FT F+A EN + KI FQ DG E F L +HGI SCPYTPQQNG A
Sbjct: 680 FKIFTIFQALVENQFQNKIGSFQCDGGGEFTSARFLNHLQQHGIQQLISCPYTPQQNGLA 739
Query: 534 ERKHRHITELGNTLSFHCSLPKKLWFDAFSTAVFIINRLPTPTL-QGLTPFEILFHIPPD 592
ERKHRH+ EL + H +P K W + F TA F+IN LPT L + +P+E+L+ P+
Sbjct: 740 ERKHRHLIELALAMMCHSRMPLKYWVEGFFTANFLINLLPTTALTESKSPYEVLYVHKPN 799
Query: 593 YTNLRIFGCACYPHL 607
YT+LR+FGCACYP L
Sbjct: 800 YTSLRVFGCACYPTL 814
>AAD21687.1 Strong similarity to gi|3600044 T12H20.12 protease homolog from
Arabidopsis thaliana BAC gb|AF080119 and is a member of
the reverse transcriptase family PF|00078 [Arabidopsis
thaliana]
Length = 1415
Score = 339 bits (869), Expect = 7e-98, Method: Compositional matrix adjust.
Identities = 163/387 (42%), Positives = 236/387 (60%), Gaps = 9/387 (2%)
Query: 224 ISPIQGKDSSWLADTGASSHMTHSADNLKGSQPYSGTEAVMVGSGNFLPISSTGTSTLLT 283
+S GK+ W D+ A++H+T S + L+ + Y G +AV+VG G +LPI+ TG++T+ +
Sbjct: 314 VSDDTGKE--WHPDSAATAHVTSSTNGLQSATEYEGDDAVLVGDGTYLPITHTGSTTIKS 371
Query: 284 STHVFDLGNVLCVPNLKRNLLSISKFTKDNSCSVEFMPWGYKIKDIWSKQTIAEGPMRNN 343
S L VL VPN++++LLS+SK D C V F I D+ +++ + GP RN
Sbjct: 372 SNGKIPLNEVLVVPNIQKSLLSVSKLCDDYPCGVYFDANKVCIIDLQTQKVVTTGPRRNG 431
Query: 344 LYPIEAQCVRSSQDITAAHVTK-ITTNYDIWHARLGHSHSGTIRQLHQEHKIIVSNINDK 402
LY +E +Q+ A + + ++WH RLGH++S ++ L I ++
Sbjct: 432 LYVLE------NQEFVALYSNRQCAATEEVWHHRLGHANSKALQHLQNSKAIQINKSRTS 485
Query: 403 PFCKSCELGKRRCLPFTSSSSTTSHPLYMIHCDIWGPTPIQTPHGARYYILFLDDYSKYA 462
P C+ C++GK LPF S S HPL IHCD+WGP+P+ + G +YY +F+DDYS+Y+
Sbjct: 486 PVCEPCQMGKSSRLPFLISDSRVLHPLDRIHCDLWGPSPVVSNQGLKYYAIFVDDYSRYS 545
Query: 463 WIYAMKVRSDSIKCFTHFKATTENLLKEKIVYFQSDGAPELQKGDFRAFLDKHGITFRSS 522
W Y + +S+ + F F+ EN L KI FQSDG E + L +HGI R S
Sbjct: 546 WFYPLHNKSEFLSVFISFQKLVENQLNTKIKVFQSDGGGEFVSNKLKTHLSEHGIHHRIS 605
Query: 523 CPYTPQQNGKAERKHRHITELGNTLSFHCSLPKKLWFDAFSTAVFIINRLPTPTLQGLTP 582
CPYTPQQNG AERKHRH+ ELG ++ FH P+K W ++F TA +IINRLP+ L+ L+P
Sbjct: 606 CPYTPQQNGLAERKHRHLVELGLSMLFHSHTPQKFWVESFFTANYIINRLPSSVLKNLSP 665
Query: 583 FEILFHIPPDYTNLRIFGCACYPHLDP 609
+E LF PDY++LR+FG ACYP L P
Sbjct: 666 YEALFGEKPDYSSLRVFGSACYPCLRP 692
>CAB43904.1 putative protein [Arabidopsis thaliana] CAB81478.1 putative protein
[Arabidopsis thaliana]
Length = 1415
Score = 328 bits (840), Expect = 8e-94, Method: Compositional matrix adjust.
Identities = 159/385 (41%), Positives = 238/385 (61%), Gaps = 15/385 (3%)
Query: 230 KDSSWLADTGASSHMTHSADNLKGSQPYSGTEAVMVGSGNFLPISSTGTSTLLTSTHVFD 289
K + W+ D+GA+SH+T+S L+ +QPYSG ++V+VG+ +FLPI+ G++ L ++
Sbjct: 289 KSNPWVTDSGATSHITNSTSQLQSAQPYSGEDSVIVGNSDFLPITHIGSAVLTSNQGNLP 348
Query: 290 LGNVLCVPNLKRNLLSISKFTKDNSCSVEFMPWGYKIKDIWSKQTIAEGPMRNNLYPIE- 348
L +VL PN+ ++LLS+SK T D C +EF G +KD +KQ + +G N+LY +E
Sbjct: 349 LRDVLVCPNITKSLLSVSKLTSDYPCVIEFDSDGVIVKDKLTKQLLTKGTRHNDLYLLEN 408
Query: 349 ---AQCVRSSQDITAAHVTKITTNYDIWHARLGHSHSGTIRQLHQEHKIIVSNINDKPFC 405
C S Q T+ ++WH RLGH + ++QL + I++S + C
Sbjct: 409 PKFMACYSSRQQATSD---------EVWHMRLGHPNQDVLQQLLRNKAIVISKTSHS-LC 458
Query: 406 KSCELGKRRCLPFTSSSSTTSHPLYMIHCDIWGPTPIQTPHGARYYILFLDDYSKYAWIY 465
+C++GK LPF SS +S L +HCD+WGP P+ + G RYY++F+D+YS++ W Y
Sbjct: 459 DACQMGKICKLPFASSDFVSSRLLERVHCDLWGPAPVVSSQGFRYYVIFIDNYSRFTWFY 518
Query: 466 AMKVRSDSIKCFTHFKATTENLLKEKIVYFQSDGAPELQKGDFRAFLDKHGITFRSSCPY 525
++++SD F F+ EN ++KI FQ DG E F + L + GI SCPY
Sbjct: 519 PLRLKSDFFSVFLTFQKMVENQCQQKIASFQCDGGGEFISNQFVSHLAECGIRQLISCPY 578
Query: 526 TPQQNGKAERKHRHITELGNTLSFHCSLPKKLWFDAFSTAVFIINRLPTPTLQGL-TPFE 584
TPQQNG AERKHRHITELG+++ F +P+ LW +AF T+ F+ N LP+ L+ +P+E
Sbjct: 579 TPQQNGIAERKHRHITELGSSMMFQGKVPQFLWVEAFYTSNFLCNLLPSSVLKDQKSPYE 638
Query: 585 ILFHIPPDYTNLRIFGCACYPHLDP 609
+L P YT+LR+FGCACYP+L P
Sbjct: 639 VLMGKAPVYTSLRVFGCACYPNLRP 663