BLAST Resut
BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Eca_sc000066.1_g1780.1
(696 letters)
Database: Araport11_genes.201606.pep
48,359 sequences; 20,855,782 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT2G39950.1 | flocculation protein | Chr2:16676758-16680204 REVE... 347 e-110
AT2G39950.9 | flocculation protein | Chr2:16676758-16679961 REVE... 332 e-105
AT2G39950.2 | flocculation protein | Chr2:16676758-16679961 REVE... 332 e-105
AT2G39950.8 | flocculation protein | Chr2:16676758-16678262 REVE... 213 2e-61
AT2G39950.7 | flocculation protein | Chr2:16676758-16678262 REVE... 213 2e-61
>AT2G39950.1 | flocculation protein | Chr2:16676758-16680204 REVERSE
LENGTH=636 | 201606
Length = 636
Score = 347 bits (891), Expect = e-110, Method: Compositional matrix adjust.
Identities = 243/592 (41%), Positives = 335/592 (56%), Gaps = 63/592 (10%)
Query: 86 QWLQALDLQLIGACRADERMKPLFKLNVSSGVAEDRLLAQLSQHFDAAEVGILGRCLFVP 145
+WLQALD+Q++GACR DER+KPL KLNVS+G+AEDRLLA LSQHF+ AE+G+L RC +P
Sbjct: 75 RWLQALDMQVMGACRGDERLKPLLKLNVSNGMAEDRLLAHLSQHFEPAEIGMLARCFCIP 134
Query: 146 LVSIRVGKVIKRGSLLCPTAERGNLNLTMLPSSNLAISFVGDDGCTEKLAMLRSKSESSA 205
LVS+RVGK+IK G L+ PT RGNL+L +LP+S+L +SF+GD+G +E+L SKS+ SA
Sbjct: 135 LVSVRVGKIIKEGILMRPTPIRGNLSLMVLPTSDLRLSFIGDNGHSEQLFTYTSKSQCSA 194
Query: 206 VVIEDIQSDSSGRSFRLQVPDGQVSYFWCSEKSQAHGAKLLAKVKSLLRKKPSLAKLSGI 265
V IE+I DSSGRSF +++ +G Y+WCSEKS+ G +L K+K L++KKPS+++L+GI
Sbjct: 195 VSIEEITVDSSGRSFVIRIANGNAFYYWCSEKSKLLGTELRRKMKDLIKKKPSISELTGI 254
Query: 266 SESRLDCFAIHLHAYLHGSTSGTKASTPDDSPSLLGSGLENSALQLDSSNHLSSSLSEPS 325
ESRL A HL YL GS P++ G + + S ++ S +
Sbjct: 255 EESRLGSVASHLRLYLMGSV----------VPNIKGCQVPSPDSSSSSGFSETADSSSSA 304
Query: 326 DSKG-SCSVVVSPYSTSQGSLCPSLNSLADFVHGNEFMERFGQDRDS--SICSLSVIDNL 382
SK + + +QGSL P +S + N + +D+ S S+ DN
Sbjct: 305 SSKSLRARHCGTQQTKTQGSLSPRASSFKENTLRNASLRISSRDKSKGHSEGHFSIFDNS 364
Query: 383 PVVSTSTVDVVNLSQAETKLAEAVLPIPILPRGFLEAPEKSEFLPSVHLFPSSQVVPTGP 442
+ S T +V Q+E ++ EA + + + A E++E PS P + GP
Sbjct: 365 SITSIPT-NVEGFIQSEGEVEEATENYNGIRQ--IIAFEEAESTPSTMTGPPPFPLKMGP 421
Query: 443 LLFSPYYNWAPPSTSTLQYTVTPPHLPTTLSDSLSLPPLS---------SLLSAVRSSSP 493
+FSPYY W PP+TS+L H P S S PPLS S L S
Sbjct: 422 PVFSPYYCWCPPTTSSL-------HAP---SASYQFPPLSIELPSLPPLSSLLPASGSDG 471
Query: 494 SIPPRQSLDLTNFPPLDLPSFMPDPLVMPVSSFLGGSSSQQIPTFTPFICDPIVHIPVMD 553
+ P LDL++ PPL L +P P SSS Q P +CDPIVHIPV+D
Sbjct: 472 FLIPSSPLDLSDIPPLPLVHHIPIPGSS--------SSSSQQQMMIPIMCDPIVHIPVID 523
Query: 554 VCSSG-GYLVSAGPT--VSTTITPLHPNPLITQNESVLEKNARETLRLLLASAHQVPFST 610
+ SSG YLVSAGPT +ST I PL +N+S++EK ARETLRLL++ A+ +
Sbjct: 524 IFSSGQSYLVSAGPTGIISTGIPPLP-----VENDSLVEKGARETLRLLISGANATTSTP 578
Query: 611 SEEQPGAAYIGGSRGLYTVTRDIGLSPNSISPFSMVSF-PPCLVSTGAAGKE 661
GSRGLY+V+RD+ + +S F+ + P V G G E
Sbjct: 579 LNHH-------GSRGLYSVSRDV----SGVSLFAPIGLQQPSSVEGGDGGGE 619
>AT2G39950.9 | flocculation protein | Chr2:16676758-16679961 REVERSE
LENGTH=555 | 201606
Length = 555
Score = 332 bits (852), Expect = e-105, Method: Compositional matrix adjust.
Identities = 233/555 (41%), Positives = 318/555 (57%), Gaps = 56/555 (10%)
Query: 93 LQLIGACRADERMKPLFKLNVSSGVAEDRLLAQLSQHFDAAEVGILGRCLFVPLVSIRVG 152
+Q++GACR DER+KPL KLNVS+G+AEDRLLA LSQHF+ AE+G+L RC +PLVS+RVG
Sbjct: 1 MQVMGACRGDERLKPLLKLNVSNGMAEDRLLAHLSQHFEPAEIGMLARCFCIPLVSVRVG 60
Query: 153 KVIKRGSLLCPTAERGNLNLTMLPSSNLAISFVGDDGCTEKLAMLRSKSESSAVVIEDIQ 212
K+IK G L+ PT RGNL+L +LP+S+L +SF+GD+G +E+L SKS+ SAV IE+I
Sbjct: 61 KIIKEGILMRPTPIRGNLSLMVLPTSDLRLSFIGDNGHSEQLFTYTSKSQCSAVSIEEIT 120
Query: 213 SDSSGRSFRLQVPDGQVSYFWCSEKSQAHGAKLLAKVKSLLRKKPSLAKLSGISESRLDC 272
DSSGRSF +++ +G Y+WCSEKS+ G +L K+K L++KKPS+++L+GI ESRL
Sbjct: 121 VDSSGRSFVIRIANGNAFYYWCSEKSKLLGTELRRKMKDLIKKKPSISELTGIEESRLGS 180
Query: 273 FAIHLHAYLHGSTSGTKASTPDDSPSLLGSGLENSALQLDSSNHLSSSLSEPSDSKGSCS 332
A HL YL GS P+ + S +S+ + SSS S S C
Sbjct: 181 VASHLRLYLMGSV------VPNIKGCQVPSPDSSSSSGFSETADSSSSASSKSLRARHCG 234
Query: 333 VVVSPYSTSQGSLCPSLNSLADFVHGNEFMERFGQDRDS--SICSLSVIDNLPVVSTSTV 390
+ + +QGSL P +S + N + +D+ S S+ DN + S T
Sbjct: 235 ---TQQTKTQGSLSPRASSFKENTLRNASLRISSRDKSKGHSEGHFSIFDNSSITSIPT- 290
Query: 391 DVVNLSQAETKLAEAVLPIPILPRGFLEAPEKSEFLPSVHLFPSSQVVPTGPLLFSPYYN 450
+V Q+E ++ EA + + + A E++E PS P + GP +FSPYY
Sbjct: 291 NVEGFIQSEGEVEEATENYNGIRQ--IIAFEEAESTPSTMTGPPPFPLKMGPPVFSPYYC 348
Query: 451 WAPPSTSTLQYTVTPPHLPTTLSDSLSLPPLS---------SLLSAVRSSSPSIPPRQSL 501
W PP+TS+L H P S S PPLS S L S + P L
Sbjct: 349 WCPPTTSSL-------HAP---SASYQFPPLSIELPSLPPLSSLLPASGSDGFLIPSSPL 398
Query: 502 DLTNFPPLDLPSFMPDPLVMPVSSFLGGSSSQQIPTFTPFICDPIVHIPVMDVCSSG-GY 560
DL++ PPL L +P P SSS Q P +CDPIVHIPV+D+ SSG Y
Sbjct: 399 DLSDIPPLPLVHHIPIPGSS--------SSSSQQQMMIPIMCDPIVHIPVIDIFSSGQSY 450
Query: 561 LVSAGPT--VSTTITPLHPNPLITQNESVLEKNARETLRLLLASAHQVPFSTSEEQPGAA 618
LVSAGPT +ST I PL +N+S++EK ARETLRLL++ A+ +
Sbjct: 451 LVSAGPTGIISTGIPPLP-----VENDSLVEKGARETLRLLISGANATTSTPLNHH---- 501
Query: 619 YIGGSRGLYTVTRDI 633
GSRGLY+V+RD+
Sbjct: 502 ---GSRGLYSVSRDV 513
>AT2G39950.2 | flocculation protein | Chr2:16676758-16679961 REVERSE
LENGTH=555 | 201606
Length = 555
Score = 332 bits (852), Expect = e-105, Method: Compositional matrix adjust.
Identities = 233/555 (41%), Positives = 318/555 (57%), Gaps = 56/555 (10%)
Query: 93 LQLIGACRADERMKPLFKLNVSSGVAEDRLLAQLSQHFDAAEVGILGRCLFVPLVSIRVG 152
+Q++GACR DER+KPL KLNVS+G+AEDRLLA LSQHF+ AE+G+L RC +PLVS+RVG
Sbjct: 1 MQVMGACRGDERLKPLLKLNVSNGMAEDRLLAHLSQHFEPAEIGMLARCFCIPLVSVRVG 60
Query: 153 KVIKRGSLLCPTAERGNLNLTMLPSSNLAISFVGDDGCTEKLAMLRSKSESSAVVIEDIQ 212
K+IK G L+ PT RGNL+L +LP+S+L +SF+GD+G +E+L SKS+ SAV IE+I
Sbjct: 61 KIIKEGILMRPTPIRGNLSLMVLPTSDLRLSFIGDNGHSEQLFTYTSKSQCSAVSIEEIT 120
Query: 213 SDSSGRSFRLQVPDGQVSYFWCSEKSQAHGAKLLAKVKSLLRKKPSLAKLSGISESRLDC 272
DSSGRSF +++ +G Y+WCSEKS+ G +L K+K L++KKPS+++L+GI ESRL
Sbjct: 121 VDSSGRSFVIRIANGNAFYYWCSEKSKLLGTELRRKMKDLIKKKPSISELTGIEESRLGS 180
Query: 273 FAIHLHAYLHGSTSGTKASTPDDSPSLLGSGLENSALQLDSSNHLSSSLSEPSDSKGSCS 332
A HL YL GS P+ + S +S+ + SSS S S C
Sbjct: 181 VASHLRLYLMGSV------VPNIKGCQVPSPDSSSSSGFSETADSSSSASSKSLRARHCG 234
Query: 333 VVVSPYSTSQGSLCPSLNSLADFVHGNEFMERFGQDRDS--SICSLSVIDNLPVVSTSTV 390
+ + +QGSL P +S + N + +D+ S S+ DN + S T
Sbjct: 235 ---TQQTKTQGSLSPRASSFKENTLRNASLRISSRDKSKGHSEGHFSIFDNSSITSIPT- 290
Query: 391 DVVNLSQAETKLAEAVLPIPILPRGFLEAPEKSEFLPSVHLFPSSQVVPTGPLLFSPYYN 450
+V Q+E ++ EA + + + A E++E PS P + GP +FSPYY
Sbjct: 291 NVEGFIQSEGEVEEATENYNGIRQ--IIAFEEAESTPSTMTGPPPFPLKMGPPVFSPYYC 348
Query: 451 WAPPSTSTLQYTVTPPHLPTTLSDSLSLPPLS---------SLLSAVRSSSPSIPPRQSL 501
W PP+TS+L H P S S PPLS S L S + P L
Sbjct: 349 WCPPTTSSL-------HAP---SASYQFPPLSIELPSLPPLSSLLPASGSDGFLIPSSPL 398
Query: 502 DLTNFPPLDLPSFMPDPLVMPVSSFLGGSSSQQIPTFTPFICDPIVHIPVMDVCSSG-GY 560
DL++ PPL L +P P SSS Q P +CDPIVHIPV+D+ SSG Y
Sbjct: 399 DLSDIPPLPLVHHIPIPGSS--------SSSSQQQMMIPIMCDPIVHIPVIDIFSSGQSY 450
Query: 561 LVSAGPT--VSTTITPLHPNPLITQNESVLEKNARETLRLLLASAHQVPFSTSEEQPGAA 618
LVSAGPT +ST I PL +N+S++EK ARETLRLL++ A+ +
Sbjct: 451 LVSAGPTGIISTGIPPLP-----VENDSLVEKGARETLRLLISGANATTSTPLNHH---- 501
Query: 619 YIGGSRGLYTVTRDI 633
GSRGLY+V+RD+
Sbjct: 502 ---GSRGLYSVSRDV 513
>AT2G39950.8 | flocculation protein | Chr2:16676758-16678262 REVERSE
LENGTH=475 | 201606
Length = 475
Score = 213 bits (542), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 179/474 (37%), Positives = 250/474 (52%), Gaps = 56/474 (11%)
Query: 174 MLPSSNLAISFVGDDGCTEKLAMLRSKSESSAVVIEDIQSDSSGRSFRLQVPDGQVSYFW 233
+LP+S+L +SF+GD+G +E+L SKS+ SAV IE+I DSSGRSF +++ +G Y+W
Sbjct: 2 VLPTSDLRLSFIGDNGHSEQLFTYTSKSQCSAVSIEEITVDSSGRSFVIRIANGNAFYYW 61
Query: 234 CSEKSQAHGAKLLAKVKSLLRKKPSLAKLSGISESRLDCFAIHLHAYLHGSTSGTKASTP 293
CSEKS+ G +L K+K L++KKPS+++L+GI ESRL A HL YL GS P
Sbjct: 62 CSEKSKLLGTELRRKMKDLIKKKPSISELTGIEESRLGSVASHLRLYLMGSV------VP 115
Query: 294 DDSPSLLGSGLENSALQLDSSNHLSSSLSEPSDSKGSCSVVVSPYSTSQGSLCPSLNSLA 353
+ + S +S+ + SSS S S C + + +QGSL P +S
Sbjct: 116 NIKGCQVPSPDSSSSSGFSETADSSSSASSKSLRARHCG---TQQTKTQGSLSPRASSFK 172
Query: 354 DFVHGNEFMERFGQDRDS--SICSLSVIDNLPVVSTSTVDVVNLSQAETKLAEAVLPIPI 411
+ N + +D+ S S+ DN + S T +V Q+E ++ EA
Sbjct: 173 ENTLRNASLRISSRDKSKGHSEGHFSIFDNSSITSIPT-NVEGFIQSEGEVEEATENYNG 231
Query: 412 LPRGFLEAPEKSEFLPSVHLFPSSQVVPTGPLLFSPYYNWAPPSTSTLQYTVTPPHLPTT 471
+ + + A E++E PS P + GP +FSPYY W PP+TS+L H P
Sbjct: 232 IRQ--IIAFEEAESTPSTMTGPPPFPLKMGPPVFSPYYCWCPPTTSSL-------HAP-- 280
Query: 472 LSDSLSLPPLS---------SLLSAVRSSSPSIPPRQSLDLTNFPPLDLPSFMPDPLVMP 522
S S PPLS S L S + P LDL++ PPL L +P P
Sbjct: 281 -SASYQFPPLSIELPSLPPLSSLLPASGSDGFLIPSSPLDLSDIPPLPLVHHIPIPGSS- 338
Query: 523 VSSFLGGSSSQQIPTFTPFICDPIVHIPVMDVCSSG-GYLVSAGPT--VSTTITPLHPNP 579
SSS Q P +CDPIVHIPV+D+ SSG YLVSAGPT +ST I PL
Sbjct: 339 -------SSSSQQQMMIPIMCDPIVHIPVIDIFSSGQSYLVSAGPTGIISTGIPPLP--- 388
Query: 580 LITQNESVLEKNARETLRLLLASAHQVPFSTSEEQPGAAYIGGSRGLYTVTRDI 633
+N+S++EK ARETLRLL++ A+ + GSRGLY+V+RD+
Sbjct: 389 --VENDSLVEKGARETLRLLISGANATTSTPLNHH-------GSRGLYSVSRDV 433
>AT2G39950.7 | flocculation protein | Chr2:16676758-16678262 REVERSE
LENGTH=475 | 201606
Length = 475
Score = 213 bits (542), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 179/474 (37%), Positives = 250/474 (52%), Gaps = 56/474 (11%)
Query: 174 MLPSSNLAISFVGDDGCTEKLAMLRSKSESSAVVIEDIQSDSSGRSFRLQVPDGQVSYFW 233
+LP+S+L +SF+GD+G +E+L SKS+ SAV IE+I DSSGRSF +++ +G Y+W
Sbjct: 2 VLPTSDLRLSFIGDNGHSEQLFTYTSKSQCSAVSIEEITVDSSGRSFVIRIANGNAFYYW 61
Query: 234 CSEKSQAHGAKLLAKVKSLLRKKPSLAKLSGISESRLDCFAIHLHAYLHGSTSGTKASTP 293
CSEKS+ G +L K+K L++KKPS+++L+GI ESRL A HL YL GS P
Sbjct: 62 CSEKSKLLGTELRRKMKDLIKKKPSISELTGIEESRLGSVASHLRLYLMGSV------VP 115
Query: 294 DDSPSLLGSGLENSALQLDSSNHLSSSLSEPSDSKGSCSVVVSPYSTSQGSLCPSLNSLA 353
+ + S +S+ + SSS S S C + + +QGSL P +S
Sbjct: 116 NIKGCQVPSPDSSSSSGFSETADSSSSASSKSLRARHCG---TQQTKTQGSLSPRASSFK 172
Query: 354 DFVHGNEFMERFGQDRDS--SICSLSVIDNLPVVSTSTVDVVNLSQAETKLAEAVLPIPI 411
+ N + +D+ S S+ DN + S T +V Q+E ++ EA
Sbjct: 173 ENTLRNASLRISSRDKSKGHSEGHFSIFDNSSITSIPT-NVEGFIQSEGEVEEATENYNG 231
Query: 412 LPRGFLEAPEKSEFLPSVHLFPSSQVVPTGPLLFSPYYNWAPPSTSTLQYTVTPPHLPTT 471
+ + + A E++E PS P + GP +FSPYY W PP+TS+L H P
Sbjct: 232 IRQ--IIAFEEAESTPSTMTGPPPFPLKMGPPVFSPYYCWCPPTTSSL-------HAP-- 280
Query: 472 LSDSLSLPPLS---------SLLSAVRSSSPSIPPRQSLDLTNFPPLDLPSFMPDPLVMP 522
S S PPLS S L S + P LDL++ PPL L +P P
Sbjct: 281 -SASYQFPPLSIELPSLPPLSSLLPASGSDGFLIPSSPLDLSDIPPLPLVHHIPIPGSS- 338
Query: 523 VSSFLGGSSSQQIPTFTPFICDPIVHIPVMDVCSSG-GYLVSAGPT--VSTTITPLHPNP 579
SSS Q P +CDPIVHIPV+D+ SSG YLVSAGPT +ST I PL
Sbjct: 339 -------SSSSQQQMMIPIMCDPIVHIPVIDIFSSGQSYLVSAGPTGIISTGIPPLP--- 388
Query: 580 LITQNESVLEKNARETLRLLLASAHQVPFSTSEEQPGAAYIGGSRGLYTVTRDI 633
+N+S++EK ARETLRLL++ A+ + GSRGLY+V+RD+
Sbjct: 389 --VENDSLVEKGARETLRLLISGANATTSTPLNHH-------GSRGLYSVSRDV 433