BLAST Resut
BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Eca_sc000071.1_g0190.1
(1304 letters)
Database: Araport11_genes.201606.pep
48,359 sequences; 20,855,782 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT4G29090.1 | Ribonuclease H-like superfamily protein | Chr4:143... 198 1e-53
AT1G43760.1 | DNAse I-like superfamily protein | Chr1:16528880-1... 159 5e-40
AT3G24255.5 | RNA-directed DNA polymerase (reverse transcriptase... 134 2e-32
AT3G24255.4 | RNA-directed DNA polymerase (reverse transcriptase... 134 2e-32
AT3G24255.7 | RNA-directed DNA polymerase (reverse transcriptase... 128 1e-29
>AT4G29090.1 | Ribonuclease H-like superfamily protein |
Chr4:14333528-14335255 FORWARD LENGTH=575 | 201606
Length = 575
Score = 198 bits (504), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 160/580 (27%), Positives = 261/580 (45%), Gaps = 31/580 (5%)
Query: 738 SIPIYFLSTNLVPKGIINKIEKCQRNFWWGHNLEKSKMHYINWERLQATQKDGGLGVRNL 797
++P Y ++ L+PK + +I +FWW + E MH+ W+ L + +GG+G +++
Sbjct: 2 ALPTYTMACFLLPKTVCKQIISVLADFWWRNKQEAKGMHWKAWDHLSCYKAEGGIGFKDI 61
Query: 798 GIVNQALVGKLVWRFQKERDALWVRLLTAKYLKHIDFWKFQPKPSTLSTSTWKSMLKLRE 857
N AL+GK +WR ++L ++ ++Y D + + WKS+ +E
Sbjct: 62 EAFNLALLGKQMWRMLSRPESLMAKVFKSRYFHKSD--PLNAPLGSRPSFVWKSIHASQE 119
Query: 858 AIQKGMCYVVGNGKTIRIWEDPWV---PNLRGFKPGR--RWEDQTETGPIYVSELIDIDG 912
+++G VVGNG+ I IW W+ P + R E + + + VS+LID G
Sbjct: 120 ILRQGARAVVGNGEDIIIWRHKWLDSKPASAALRMQRVPPQEYASVSSILKVSDLIDESG 179
Query: 913 -SWKKRFIEEMFDPLATSAILMINLPGRDIDDKLVWIHTEKGSFTTKSLY---TSIVKDM 968
W+K IE +F + I + GR I D W +T G +T KS Y T I+
Sbjct: 180 REWRKDVIEMLFPEVERKLIGELRPGGRRILDSYTWDYTSSGDYTVKSGYWVLTQIINKR 239
Query: 969 PTTFRILELPFGLTWKKIWSRLNLAPRIKTFVWRVLHNALPTKKKSLKFNPSASKLCEFC 1028
+ + E ++KIW + +P+I+ F+W+ L N+LP + S C C
Sbjct: 240 SSPQEVSEPSLNPIYQKIW-KSQTSPKIQHFLWKCLSNSLPVAGALAYRHLSKESACIRC 298
Query: 1029 KEEEEDVDHIFRKCRMARETWICPPLELRLEGE------RNFHDILCNWLAGKEDDKVIA 1082
+E V+H+ KC AR TW + + L GE N + + + +K
Sbjct: 299 PSCKETVNHLLFKCTFARLTWAISSIPIPLGGEWADSIYVNLYWVFNLGNGNPQWEKASQ 358
Query: 1083 LRICVLWYLWKARNKLVFGEQPHGSRQIINAAMELNDEFEVQ-------NIPDNNSPGEG 1135
L +LW LWK RN+LVF + +++++ A + +E+ ++ P N G
Sbjct: 359 LVPWLLWRLWKNRNELVFRGREFNAQEVLRRAEDDLEEWRIRTEAESCGTKPQVNRSSCG 418
Query: 1136 KSWTPPGPNELKINFDAGYV--DRRADIAAVCRNSKGEFV-MALVHSTSATSALEAEAKA 1192
+ W PP +K N DA + + R I V RN KGE M S LEAE +A
Sbjct: 419 R-WRPPPHQWVKCNTDATWNRDNERCGIGWVLRNEKGEVKWMGARALPKLKSVLEAELEA 477
Query: 1193 ALLAMKIATDLHQYICIIEGDSINVVKACSSSEEDIPWKIRSTIIDIKNSKRNLTECYFS 1252
A+ + I E DS +++ ++ E I ++ TI D++ TE F
Sbjct: 478 MRWAVLSLSRFQYNYVIFESDSQVLIEILNNDE--IWPSLKPTIQDLQRLLSQFTEVKFV 535
Query: 1253 FCPRNANKVAHLLAHSNRRNNLNTPSEMVIVPECIVSQLE 1292
F PR N +A +A + P IVP S ++
Sbjct: 536 FIPREGNTLAERVARESLSFLNYDPKLYSIVPSWARSSMD 575
>AT1G43760.1 | DNAse I-like superfamily protein |
Chr1:16528880-16531065 REVERSE LENGTH=626 | 201606
Length = 626
Score = 159 bits (401), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 113/401 (28%), Positives = 192/401 (47%), Gaps = 20/401 (4%)
Query: 75 MGDFNAIMTDMEKWG--GSGMNRRSAEQFRSMISNCELIDIGYSGPAYTWVNGREVNSHI 132
+GDF+ I + + + + R E+F++ + + +L+DI G YTW N ++ N I
Sbjct: 224 VGDFDQIAATSDHYSVLQTSIPMRGLEEFQNCLRDSDLVDIPSRGVHYTWSNHQDDNPII 283
Query: 133 RERLDRVLANPLWKIQFPDSLVKHLPRYNSDHAP--IILNTSKPKIKGCMPFRFEAHWTV 190
R +LDR +AN W FP ++ SDH+P IIL + K C FR+ + +
Sbjct: 284 R-KLDRAIANGDWFSSFPSAIAVFELSGVSDHSPCIIILENLPKRSKKC--FRYFSFLST 340
Query: 191 HDEFDNMMMETW------GAVRGGFIQKLPQLAKKLKKWSREKVGQLFNKIKEAEQELLL 244
H F + W G+ + L K K +R+ G + +K KEA L
Sbjct: 341 HPTFLVSLTVAWEEQIPVGSHMFSLGEHLKAAKKCCKLLNRQGFGNIQHKTKEALDSLES 400
Query: 245 IQNQ---PPSQAAIHDEVRVVQRLKDLRKMEEIYWLQRAKRHWVQDIDRNTRFFHLSVLN 301
IQ+Q PS + E ++ E ++ Q+++ W+QD D NTRFFH +L
Sbjct: 401 IQSQLLTNPSDSLFRVEHVARKKWNFFAAALESFYRQKSRIKWLQDGDANTRFFHKVILA 460
Query: 302 RRRKNNILTVKLDDHSWSDDPTKITNVFLEHFFRASRAEPISDFPEALTMTD--RPLLAT 359
+ KN I +++DD ++ T++ + + ++ ++ P+++ P
Sbjct: 461 NQAKNLIKFLRMDDDVRVENVTQVKEMIVAYYTHLLGSDSDILTPDSVQRIKDIHPFRCN 520
Query: 360 ENMA--ISIPPSKEEIWQTIKEMKTCKAPGPDGFSPIFYKKCWGVIGDEVTTQIRTIFES 417
+ +A +S PS +EI + M KAPGPD F+ F+ + W V+ D ++ F +
Sbjct: 521 DTLASRLSALPSDKEITAAVFAMPRNKAPGPDSFTAEFFWESWFVVKDSTIAAVKEFFRT 580
Query: 418 GQLPGNLNHTNIALIPKKQNPERPIDYRPIALCNVLYRVIT 458
G L N T I LIPK ++ +RP++ C V+Y++IT
Sbjct: 581 GHLLKRFNATAITLIPKVTGVDQLSMFRPVSCCTVVYKIIT 621
>AT3G24255.5 | RNA-directed DNA polymerase (reverse
transcriptase)-related family protein |
Chr3:8789309-8790907 FORWARD LENGTH=532 | 201606
Length = 532
Score = 134 bits (337), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 132/523 (25%), Positives = 207/523 (39%), Gaps = 49/523 (9%)
Query: 611 YAPSVSHLMFADDLFFFGKASTENISSLKNILDIYANCSGQKINYTKSAIHLSANCDAGK 670
Y ++HL FADDL F +I + I +A+ SG +I+ KS I+++ D K
Sbjct: 4 YWMLITHLCFADDLLVFTDGKKSSIEGILQIFGKFADFSGLQISLEKSTIYMAGVKDNDK 63
Query: 671 REMILQTLGVKEMESEDVYLGNFLLKPKHKISSYDFILQKVEKKLTGWKRSSLSHAGRTI 730
+ IL + YLG LL K S Y +++K+ ++ W LS AGR
Sbjct: 64 AD-ILHSFPFASGALPVRYLGLPLLTKKMTTSDYGPLVEKIRVRIGKWTARHLSFAGRLQ 122
Query: 731 LLKSELSSIPIYFLSTNLVPKGIINKIEKCQRNFWWGHNLEKSKMHYINWERLQATQKDG 790
L+ S + S+ +++S +P I +I+ +F W +K + W + + +G
Sbjct: 123 LISSVIHSLTNFWMSAFRLPSACIKEIDSICSSFLWSGPELNTKKAKVAWSDVCTPKDEG 182
Query: 791 GLGVRNLGIVNQALVGKLVWRFQKERDALWVRLLTAKYLKHIDFWKFQPKPSTLSTSTWK 850
GLG+R+L N+ + KL+WR +LWV+ L L+ FW +TL + WK
Sbjct: 183 GLGIRSLKEANKVSLLKLIWRMLSS-TSLWVQWLRLYLLRKGSFWSISGN-TTLGSWMWK 240
Query: 851 SMLKLREAIQKGMCYVVGNGKTIRIWEDPWVPNLRGFKPGRRWEDQTETGPIYVSELIDI 910
+LK R + + + NG W D W K GR LID+
Sbjct: 241 KILKHRALASGFVKHDIHNGSNTSFWFDNWS------KIGR---------------LIDV 279
Query: 911 DGSWKKRFIEEMFDPLATSAILMIN-LPGRDIDDKLVWIH----------TEKGSFTTK- 958
G + I+ A+ A ++N P R D L+ I G T +
Sbjct: 280 TG--HRGCIDMGITLHASVAEAVVNHRPRRHRHDTLLRIEDVIAEVRHQGLTSGEDTVRW 337
Query: 959 ----SLYTSIVKDMPTTFRILELPFGLTW-KKIWSRLNLAPRIKTFVWRVLHNALPTKKK 1013
++ T E + W K +W + P+ W + N L T +
Sbjct: 338 KGNGDIFKPCFNTKETWAATREPKLKVNWYKGVWFS-HATPKYSVLAWIAIKNRLTTGDR 396
Query: 1014 SLKFNPSASKLCEFCKEEEEDVDHIFRKCRMARETWICPPLELRLEGERNFHDILCNWLA 1073
L +N A C C E DH+F C + E W +L + N + + L
Sbjct: 397 MLSWNAGADSSCVLCHHLVETRDHLFFTCPYSAEVWSTLTRKLLSQHFTNRWEAILKLLT 456
Query: 1074 GKEDDKVIALRI-----CVLWYLWKARNKLVFGEQPHGSRQII 1111
K + L LWK RN GE P + Q++
Sbjct: 457 NKSLGHEVPFLTRYTFQLTLHSLWKERNGRRHGEVPQAAAQMV 499
>AT3G24255.4 | RNA-directed DNA polymerase (reverse
transcriptase)-related family protein |
Chr3:8789309-8790907 FORWARD LENGTH=532 | 201606
Length = 532
Score = 134 bits (337), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 132/523 (25%), Positives = 207/523 (39%), Gaps = 49/523 (9%)
Query: 611 YAPSVSHLMFADDLFFFGKASTENISSLKNILDIYANCSGQKINYTKSAIHLSANCDAGK 670
Y ++HL FADDL F +I + I +A+ SG +I+ KS I+++ D K
Sbjct: 4 YWMLITHLCFADDLLVFTDGKKSSIEGILQIFGKFADFSGLQISLEKSTIYMAGVKDNDK 63
Query: 671 REMILQTLGVKEMESEDVYLGNFLLKPKHKISSYDFILQKVEKKLTGWKRSSLSHAGRTI 730
+ IL + YLG LL K S Y +++K+ ++ W LS AGR
Sbjct: 64 AD-ILHSFPFASGALPVRYLGLPLLTKKMTTSDYGPLVEKIRVRIGKWTARHLSFAGRLQ 122
Query: 731 LLKSELSSIPIYFLSTNLVPKGIINKIEKCQRNFWWGHNLEKSKMHYINWERLQATQKDG 790
L+ S + S+ +++S +P I +I+ +F W +K + W + + +G
Sbjct: 123 LISSVIHSLTNFWMSAFRLPSACIKEIDSICSSFLWSGPELNTKKAKVAWSDVCTPKDEG 182
Query: 791 GLGVRNLGIVNQALVGKLVWRFQKERDALWVRLLTAKYLKHIDFWKFQPKPSTLSTSTWK 850
GLG+R+L N+ + KL+WR +LWV+ L L+ FW +TL + WK
Sbjct: 183 GLGIRSLKEANKVSLLKLIWRMLSS-TSLWVQWLRLYLLRKGSFWSISGN-TTLGSWMWK 240
Query: 851 SMLKLREAIQKGMCYVVGNGKTIRIWEDPWVPNLRGFKPGRRWEDQTETGPIYVSELIDI 910
+LK R + + + NG W D W K GR LID+
Sbjct: 241 KILKHRALASGFVKHDIHNGSNTSFWFDNWS------KIGR---------------LIDV 279
Query: 911 DGSWKKRFIEEMFDPLATSAILMIN-LPGRDIDDKLVWIH----------TEKGSFTTK- 958
G + I+ A+ A ++N P R D L+ I G T +
Sbjct: 280 TG--HRGCIDMGITLHASVAEAVVNHRPRRHRHDTLLRIEDVIAEVRHQGLTSGEDTVRW 337
Query: 959 ----SLYTSIVKDMPTTFRILELPFGLTW-KKIWSRLNLAPRIKTFVWRVLHNALPTKKK 1013
++ T E + W K +W + P+ W + N L T +
Sbjct: 338 KGNGDIFKPCFNTKETWAATREPKLKVNWYKGVWFS-HATPKYSVLAWIAIKNRLTTGDR 396
Query: 1014 SLKFNPSASKLCEFCKEEEEDVDHIFRKCRMARETWICPPLELRLEGERNFHDILCNWLA 1073
L +N A C C E DH+F C + E W +L + N + + L
Sbjct: 397 MLSWNAGADSSCVLCHHLVETRDHLFFTCPYSAEVWSTLTRKLLSQHFTNRWEAILKLLT 456
Query: 1074 GKEDDKVIALRI-----CVLWYLWKARNKLVFGEQPHGSRQII 1111
K + L LWK RN GE P + Q++
Sbjct: 457 NKSLGHEVPFLTRYTFQLTLHSLWKERNGRRHGEVPQAAAQMV 499
>AT3G24255.7 | RNA-directed DNA polymerase (reverse
transcriptase)-related family protein |
Chr3:8789309-8793208 FORWARD LENGTH=828 | 201606
Length = 828
Score = 128 bits (321), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 130/531 (24%), Positives = 205/531 (38%), Gaps = 72/531 (13%)
Query: 615 VSHLMFADDLFFFGKASTENISSLKNILDIYANCSGQKINYTKSAIHLSANCDAGKREMI 674
++HL FADDL F +I + I +A+ SG +I+ KS I+++ D K + I
Sbjct: 8 ITHLCFADDLLVFTDGKKSSIEGILQIFGKFADFSGLQISLEKSTIYMAGVKDNDKAD-I 66
Query: 675 LQTLGVKEMESEDVYLGNFLLKPKHKISSYDFILQKVEKKLTGWKRSSLSHAGRTILLKS 734
L + YLG LL K S Y +++K+ ++ W LS AGR L+ S
Sbjct: 67 LHSFPFASGALPVRYLGLPLLTKKMTTSDYGPLVEKIRVRIGKWTARHLSFAGRLQLISS 126
Query: 735 ELSSIPIYFLSTNLVPKGIINKIEKCQRNFWWGHNLEKSKMHYINWERLQATQKDGGLGV 794
+ S+ +++S +P I +I+ +F W +K + W + + +GGLG+
Sbjct: 127 VIHSLTNFWMSAFRLPSACIKEIDSICSSFLWSGPELNTKKAKVAWSDVCTPKDEGGLGI 186
Query: 795 RNLGIVNQALVGKLVWRFQKERDALWVRLLTAKYLKHIDFWKFQPKPSTLSTSTWKSMLK 854
R+L N+ + KL+WR +LWV+ L L+ FW +TL + WK +LK
Sbjct: 187 RSLKEANKVSLLKLIWRMLSST-SLWVQWLRLYLLRKGSFWSISGN-TTLGSWMWKKILK 244
Query: 855 LREAIQKGMCYVVGNGKTIRIWEDPWVPNLRGFKPGRRWEDQTETGPIYVSELIDIDGSW 914
R + + + NG W D W K GR LID+ G
Sbjct: 245 HRALASGFVKHDIHNGSNTSFWFDNWS------KIGR---------------LIDVTG-- 281
Query: 915 KKRFIEEMFDPLATSAILMIN-LPGRDIDDKLVWIHT----------EKGSFTTK----- 958
+ I+ A+ A ++N P R D L+ I G T +
Sbjct: 282 HRGCIDMGITLHASVAEAVVNHRPRRHRHDTLLRIEDVIAEVRHQGLTSGEDTVRWKGNG 341
Query: 959 SLYTSIVKDMPTTFRILELPFGLTW-KKIWSRLNLAPRIKTFVWRVLHNALPTKKKSLKF 1017
++ T E + W K +W + P+ W + N L T + L +
Sbjct: 342 DIFKPCFNTKETWAATREPKLKVNWYKGVWFS-HATPKYSVLAWIAIKNRLTTGDRMLSW 400
Query: 1018 NPSASKLCEFCKEEEEDVDHIFRKCRMARETWICPPLELRLEGERNFHDILCNWLAGKED 1077
N A C C E DH+F C + E P R + H
Sbjct: 401 NAGADSSCVLCHHLVETRDHLFFTCPYSAEV----PFLTRYTFQLTLHS----------- 445
Query: 1078 DKVIALRICVLWYLWKARNKLVFGEQPHGSRQIINAAMELNDEFEVQNIPD 1128
LWK RN GE P + Q++ ++ + N+ D
Sbjct: 446 -------------LWKERNGRRHGEVPQAAAQMMKYESQIMVKLSDGNLTD 483