BLAST Resut
BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Eca_sc000118.1_g1320.1
(1435 letters)
Database: Araport11_genes.201606.pep
48,359 sequences; 20,855,782 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT4G23160.3 | cysteine-rich RECEPTOR-like kinase | Chr4:12129485... 400 e-119
AT4G23160.2 | cysteine-rich RECEPTOR-like kinase | Chr4:12129485... 400 e-119
ATMG00810.1 | DNA/RNA polymerases superfamily protein | ChrM:227... 218 6e-64
ATMG00820.1 | Reverse transcriptase (RNA-dependent DNA polymeras... 127 3e-33
AT1G34070.1 | Copia-like polyprotein/retrotransposon | Chr1:1240... 94 3e-20
>AT4G23160.3 | cysteine-rich RECEPTOR-like kinase |
Chr4:12129485-12133157 FORWARD LENGTH=1043 | 201606
Length = 1043
Score = 400 bits (1029), Expect = e-119, Method: Compositional matrix adjust.
Identities = 202/498 (40%), Positives = 299/498 (60%), Gaps = 7/498 (1%)
Query: 924 EEPSTFLQASKHEHWRAAMAEEINALHKNKTWKLVPKKSNMNLVDCKWVFRVKQNSDGSI 983
+EPST+ +A + W AM +EI A+ TW++ N + CKWV+++K NSDG+I
Sbjct: 84 KEPSTYNEAKEFLVWCGAMDDEIGAMETTHTWEICTLPPNKKPIGCKWVYKIKYNSDGTI 143
Query: 984 ARHKARLVARGFTQQQGIDYTETFSPVVRPATIRTILSFAVTGNWEIRQLDVKSAFLNGD 1043
R+KARLVA+G+TQQ+GID+ ETFSPV + +++ IL+ + N+ + QLD+ +AFLNGD
Sbjct: 144 ERYKARLVAKGYTQQEGIDFIETFSPVCKLTSVKLILAISAIYNFTLHQLDISNAFLNGD 203
Query: 1044 LQETVFMSQPKGFE----DQHHPDYVCQLQKAIYGLKQAPRAWHHRFSSFLFELGFNQSI 1099
L E ++M P G+ D P+ VC L+K+IYGLKQA R W +FS L GF QS
Sbjct: 204 LDEEIYMKLPPGYAARQGDSLPPNAVCYLKKSIYGLKQASRQWFLKFSVTLIGFGFVQSH 263
Query: 1100 SDPSMFIIRSSSGITILLLYVDDIIVTGSSSENLQSLISKLKSQFDITDLGSLSYFLGME 1159
SD + F+ +++ +L+YVDDII+ ++ + L S+LKS F + DLG L YFLG+E
Sbjct: 264 SDHTYFLKITATLFLCVLVYVDDIIICSNNDAAVDELKSQLKSCFKLRDLGPLKYFLGLE 323
Query: 1160 AIRESSSLLLTQQKYTTDLITKFGLLHSKPVSTPSITGKKLSKLDGEPLSNPQEYRSLVG 1219
R ++ + + Q+KY DL+ + GLL KP S P S G + + YR L+G
Sbjct: 324 IARSAAGINICQRKYALDLLDETGLLGCKPSSVPMDPSVTFSAHSGGDFVDAKAYRRLIG 383
Query: 1220 ALQYLTLTRPDIAYSVNQVSKFRHEPTTIHWKAAKRILRFLKGSITTGIILRSSLSLPLL 1279
L YL +TR DI+++VN++S+F P H +A +IL ++KG++ G+ S + L
Sbjct: 384 RLMYLQITRLDISFAVNKLSQFSEAPRLAHQQAVMKILHYIKGTVGQGLFYSSQAEMQLQ 443
Query: 1280 GFSDADWAGSPDDRRSVTGSCIFLGPNLIMWTSKTQPTVARSSTEAEYRAVAHTAADIIW 1339
FSDA + D RRS G C+FLG +LI W SK Q V++SS EAEYRA++ +++W
Sbjct: 444 VFSDASFQSCKDTRRSTNGYCMFLGTSLISWKSKKQQVVSKSSAEAEYRALSFATDEMMW 503
Query: 1340 LKNLLGELGYVSSNSPIIYCDNLSTTYLAVNPILHSKTRHSAIDFHFVREQ-VNDGKLRV 1398
L EL S +++CDN + ++A N + H +T+H D H VRE+ V L
Sbjct: 504 LAQFFRELQLPLSKPTLLFCDNTAAIHIATNAVFHERTKHIESDCHSVRERSVYQATLSY 563
Query: 1399 QYLPSESQLADIFTKGLS 1416
+ + Q D FT+ LS
Sbjct: 564 SFQAYDEQ--DGFTEYLS 579
>AT4G23160.2 | cysteine-rich RECEPTOR-like kinase |
Chr4:12129485-12133157 FORWARD LENGTH=1043 | 201606
Length = 1043
Score = 400 bits (1029), Expect = e-119, Method: Compositional matrix adjust.
Identities = 202/498 (40%), Positives = 299/498 (60%), Gaps = 7/498 (1%)
Query: 924 EEPSTFLQASKHEHWRAAMAEEINALHKNKTWKLVPKKSNMNLVDCKWVFRVKQNSDGSI 983
+EPST+ +A + W AM +EI A+ TW++ N + CKWV+++K NSDG+I
Sbjct: 84 KEPSTYNEAKEFLVWCGAMDDEIGAMETTHTWEICTLPPNKKPIGCKWVYKIKYNSDGTI 143
Query: 984 ARHKARLVARGFTQQQGIDYTETFSPVVRPATIRTILSFAVTGNWEIRQLDVKSAFLNGD 1043
R+KARLVA+G+TQQ+GID+ ETFSPV + +++ IL+ + N+ + QLD+ +AFLNGD
Sbjct: 144 ERYKARLVAKGYTQQEGIDFIETFSPVCKLTSVKLILAISAIYNFTLHQLDISNAFLNGD 203
Query: 1044 LQETVFMSQPKGFE----DQHHPDYVCQLQKAIYGLKQAPRAWHHRFSSFLFELGFNQSI 1099
L E ++M P G+ D P+ VC L+K+IYGLKQA R W +FS L GF QS
Sbjct: 204 LDEEIYMKLPPGYAARQGDSLPPNAVCYLKKSIYGLKQASRQWFLKFSVTLIGFGFVQSH 263
Query: 1100 SDPSMFIIRSSSGITILLLYVDDIIVTGSSSENLQSLISKLKSQFDITDLGSLSYFLGME 1159
SD + F+ +++ +L+YVDDII+ ++ + L S+LKS F + DLG L YFLG+E
Sbjct: 264 SDHTYFLKITATLFLCVLVYVDDIIICSNNDAAVDELKSQLKSCFKLRDLGPLKYFLGLE 323
Query: 1160 AIRESSSLLLTQQKYTTDLITKFGLLHSKPVSTPSITGKKLSKLDGEPLSNPQEYRSLVG 1219
R ++ + + Q+KY DL+ + GLL KP S P S G + + YR L+G
Sbjct: 324 IARSAAGINICQRKYALDLLDETGLLGCKPSSVPMDPSVTFSAHSGGDFVDAKAYRRLIG 383
Query: 1220 ALQYLTLTRPDIAYSVNQVSKFRHEPTTIHWKAAKRILRFLKGSITTGIILRSSLSLPLL 1279
L YL +TR DI+++VN++S+F P H +A +IL ++KG++ G+ S + L
Sbjct: 384 RLMYLQITRLDISFAVNKLSQFSEAPRLAHQQAVMKILHYIKGTVGQGLFYSSQAEMQLQ 443
Query: 1280 GFSDADWAGSPDDRRSVTGSCIFLGPNLIMWTSKTQPTVARSSTEAEYRAVAHTAADIIW 1339
FSDA + D RRS G C+FLG +LI W SK Q V++SS EAEYRA++ +++W
Sbjct: 444 VFSDASFQSCKDTRRSTNGYCMFLGTSLISWKSKKQQVVSKSSAEAEYRALSFATDEMMW 503
Query: 1340 LKNLLGELGYVSSNSPIIYCDNLSTTYLAVNPILHSKTRHSAIDFHFVREQ-VNDGKLRV 1398
L EL S +++CDN + ++A N + H +T+H D H VRE+ V L
Sbjct: 504 LAQFFRELQLPLSKPTLLFCDNTAAIHIATNAVFHERTKHIESDCHSVRERSVYQATLSY 563
Query: 1399 QYLPSESQLADIFTKGLS 1416
+ + Q D FT+ LS
Sbjct: 564 SFQAYDEQ--DGFTEYLS 579
>ATMG00810.1 | DNA/RNA polymerases superfamily protein |
ChrM:227709-228431 REVERSE LENGTH=240 | 201606
Length = 240
Score = 218 bits (554), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 111/224 (49%), Positives = 150/224 (66%), Gaps = 1/224 (0%)
Query: 1116 LLLYVDDIIVTGSSSENLQSLISKLKSQFDITDLGSLSYFLGMEAIRESSSLLLTQQKYT 1175
LLLYVDDI++TGSS+ L LI +L S F + DLG + YFLG++ S L L+Q KY
Sbjct: 3 LLLYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPSGLFLSQTKYA 62
Query: 1176 TDLITKFGLLHSKPVSTPSITGKKLSKLDGEPLSNPQEYRSLVGALQYLTLTRPDIAYSV 1235
++ G+L KP+STP + K S + +P ++RS+VGALQYLTLTRPDI+Y+V
Sbjct: 63 EQILNNAGMLDCKPMSTP-LPLKLNSSVSTAKYPDPSDFRSIVGALQYLTLTRPDISYAV 121
Query: 1236 NQVSKFRHEPTTIHWKAAKRILRFLKGSITTGIILRSSLSLPLLGFSDADWAGSPDDRRS 1295
N V + HEPT + KR+LR++KG+I G+ + + L + F D+DWAG RRS
Sbjct: 122 NIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCDSDWAGCTSTRRS 181
Query: 1296 VTGSCIFLGPNLIMWTSKTQPTVARSSTEAEYRAVAHTAADIIW 1339
TG C FLG N+I W++K QPTV+RSSTE EYRA+A TAA++ W
Sbjct: 182 TTGFCTFLGCNIISWSAKRQPTVSRSSTETEYRALALTAAELTW 225
>ATMG00820.1 | Reverse transcriptase (RNA-dependent DNA polymerase) |
ChrM:228573-229085 REVERSE LENGTH=170 | 201606
Length = 170
Score = 127 bits (318), Expect = 3e-33, Method: Composition-based stats.
Identities = 64/132 (48%), Positives = 88/132 (66%), Gaps = 7/132 (5%)
Query: 892 MITRSRDGTRKAKVLFTDSLSKFPLLENSSLLEEPSTFLQASKHEHWRAAMAEEINALHK 951
M+TRS+ G K K+ L +++ +EP + + A K W AM EE++AL +
Sbjct: 1 MLTRSKAGINKLN-------PKYSLTITTTIKKEPKSVIFALKDPGWCQAMQEELDALSR 53
Query: 952 NKTWKLVPKKSNMNLVDCKWVFRVKQNSDGSIARHKARLVARGFTQQQGIDYTETFSPVV 1011
NKTW LVP N N++ CKWVF+ K +SDG++ R KARLVA+GF Q++GI + ET+SPVV
Sbjct: 54 NKTWILVPPPVNQNILGCKWVFKTKLHSDGTLDRLKARLVAKGFHQEEGIYFVETYSPVV 113
Query: 1012 RPATIRTILSFA 1023
R ATIRTIL+ A
Sbjct: 114 RTATIRTILNVA 125
>AT1G34070.1 | Copia-like polyprotein/retrotransposon |
Chr1:12402283-12403209 FORWARD LENGTH=308 | 201606
Length = 308
Score = 94.0 bits (232), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 78/267 (29%), Positives = 121/267 (45%), Gaps = 30/267 (11%)
Query: 35 SLTNISNLISIRLD--ASNYLLWRSQLLPILHSQDLYKFVNGSFPPPNEFLPASAEGSSE 92
++NI + I + LD SNY WR L S D+ ++G+ LP +A
Sbjct: 12 GVSNIKSHIPVMLDIEESNYDAWRELFLTHCLSFDVMGHIDGTL------LPTNAND--- 62
Query: 93 LTPNPDFVYWYRVDQMLLSWINATLTEPILLQVLGLTSS--RAVWESLEHTFSSLNSARL 150
V W + D ++ + TLT P Q +TSS R +W +++ F + AR
Sbjct: 63 -------VNWQKRDGIVKLSLYGTLT-PKQFQGSFVTSSTSRDIWLRIKNQFRNNKDARA 114
Query: 151 MNLKLQLQTTKKGSLSVSEYLLRLKTLSDSLAAIGHPVADDELVLITLSGLGPSYEGFVT 210
+ L +L+T G + V++Y ++K L+DSL + PV D LV+ L+GL P ++ +
Sbjct: 115 LRLDSELRTKDIGDMRVADYYRKMKKLADSLRNVDVPVTDRNLVMYVLNGLNPKFDNIIN 174
Query: 211 SMTTRLTPPTSSELHSHLLTHECRLQLLQPPTP-----QPSALFTQMSSPPPRFSNNFSR 265
+ R P+ + + L E RL+ P P S+ S PP + S
Sbjct: 175 VIKHRQPFPSFDDAATMLQEEEDRLKRAIKPNPTHVDHSSSSTVLACSEAPPVTNFQRSG 234
Query: 266 GGRGSYSQRNRGGH----RGGRGSYSN 288
G + Y R RG + RGGR SY N
Sbjct: 235 GNQMGYRGRGRGNNIFRGRGGRFSYYN 261