BLAST Resut
BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Eca_sc000066.1_g1780.1
         (696 letters)

Database: Araport11_genes.201606.pep 
           48,359 sequences; 20,855,782 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT2G39950.1 | flocculation protein | Chr2:16676758-16680204 REVE...   347   e-110
AT2G39950.9 | flocculation protein | Chr2:16676758-16679961 REVE...   332   e-105
AT2G39950.2 | flocculation protein | Chr2:16676758-16679961 REVE...   332   e-105
AT2G39950.8 | flocculation protein | Chr2:16676758-16678262 REVE...   213   2e-61
AT2G39950.7 | flocculation protein | Chr2:16676758-16678262 REVE...   213   2e-61

>AT2G39950.1 | flocculation protein | Chr2:16676758-16680204 REVERSE
           LENGTH=636 | 201606
          Length = 636

 Score =  347 bits (891), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 243/592 (41%), Positives = 335/592 (56%), Gaps = 63/592 (10%)

Query: 86  QWLQALDLQLIGACRADERMKPLFKLNVSSGVAEDRLLAQLSQHFDAAEVGILGRCLFVP 145
           +WLQALD+Q++GACR DER+KPL KLNVS+G+AEDRLLA LSQHF+ AE+G+L RC  +P
Sbjct: 75  RWLQALDMQVMGACRGDERLKPLLKLNVSNGMAEDRLLAHLSQHFEPAEIGMLARCFCIP 134

Query: 146 LVSIRVGKVIKRGSLLCPTAERGNLNLTMLPSSNLAISFVGDDGCTEKLAMLRSKSESSA 205
           LVS+RVGK+IK G L+ PT  RGNL+L +LP+S+L +SF+GD+G +E+L    SKS+ SA
Sbjct: 135 LVSVRVGKIIKEGILMRPTPIRGNLSLMVLPTSDLRLSFIGDNGHSEQLFTYTSKSQCSA 194

Query: 206 VVIEDIQSDSSGRSFRLQVPDGQVSYFWCSEKSQAHGAKLLAKVKSLLRKKPSLAKLSGI 265
           V IE+I  DSSGRSF +++ +G   Y+WCSEKS+  G +L  K+K L++KKPS+++L+GI
Sbjct: 195 VSIEEITVDSSGRSFVIRIANGNAFYYWCSEKSKLLGTELRRKMKDLIKKKPSISELTGI 254

Query: 266 SESRLDCFAIHLHAYLHGSTSGTKASTPDDSPSLLGSGLENSALQLDSSNHLSSSLSEPS 325
            ESRL   A HL  YL GS            P++ G  + +      S    ++  S  +
Sbjct: 255 EESRLGSVASHLRLYLMGSV----------VPNIKGCQVPSPDSSSSSGFSETADSSSSA 304

Query: 326 DSKG-SCSVVVSPYSTSQGSLCPSLNSLADFVHGNEFMERFGQDRDS--SICSLSVIDNL 382
            SK        +  + +QGSL P  +S  +    N  +    +D+    S    S+ DN 
Sbjct: 305 SSKSLRARHCGTQQTKTQGSLSPRASSFKENTLRNASLRISSRDKSKGHSEGHFSIFDNS 364

Query: 383 PVVSTSTVDVVNLSQAETKLAEAVLPIPILPRGFLEAPEKSEFLPSVHLFPSSQVVPTGP 442
            + S  T +V    Q+E ++ EA      + +  + A E++E  PS    P    +  GP
Sbjct: 365 SITSIPT-NVEGFIQSEGEVEEATENYNGIRQ--IIAFEEAESTPSTMTGPPPFPLKMGP 421

Query: 443 LLFSPYYNWAPPSTSTLQYTVTPPHLPTTLSDSLSLPPLS---------SLLSAVRSSSP 493
            +FSPYY W PP+TS+L       H P   S S   PPLS         S L     S  
Sbjct: 422 PVFSPYYCWCPPTTSSL-------HAP---SASYQFPPLSIELPSLPPLSSLLPASGSDG 471

Query: 494 SIPPRQSLDLTNFPPLDLPSFMPDPLVMPVSSFLGGSSSQQIPTFTPFICDPIVHIPVMD 553
            + P   LDL++ PPL L   +P P           SSS Q     P +CDPIVHIPV+D
Sbjct: 472 FLIPSSPLDLSDIPPLPLVHHIPIPGSS--------SSSSQQQMMIPIMCDPIVHIPVID 523

Query: 554 VCSSG-GYLVSAGPT--VSTTITPLHPNPLITQNESVLEKNARETLRLLLASAHQVPFST 610
           + SSG  YLVSAGPT  +ST I PL       +N+S++EK ARETLRLL++ A+    + 
Sbjct: 524 IFSSGQSYLVSAGPTGIISTGIPPLP-----VENDSLVEKGARETLRLLISGANATTSTP 578

Query: 611 SEEQPGAAYIGGSRGLYTVTRDIGLSPNSISPFSMVSF-PPCLVSTGAAGKE 661
                      GSRGLY+V+RD+    + +S F+ +    P  V  G  G E
Sbjct: 579 LNHH-------GSRGLYSVSRDV----SGVSLFAPIGLQQPSSVEGGDGGGE 619


>AT2G39950.9 | flocculation protein | Chr2:16676758-16679961 REVERSE
           LENGTH=555 | 201606
          Length = 555

 Score =  332 bits (852), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 233/555 (41%), Positives = 318/555 (57%), Gaps = 56/555 (10%)

Query: 93  LQLIGACRADERMKPLFKLNVSSGVAEDRLLAQLSQHFDAAEVGILGRCLFVPLVSIRVG 152
           +Q++GACR DER+KPL KLNVS+G+AEDRLLA LSQHF+ AE+G+L RC  +PLVS+RVG
Sbjct: 1   MQVMGACRGDERLKPLLKLNVSNGMAEDRLLAHLSQHFEPAEIGMLARCFCIPLVSVRVG 60

Query: 153 KVIKRGSLLCPTAERGNLNLTMLPSSNLAISFVGDDGCTEKLAMLRSKSESSAVVIEDIQ 212
           K+IK G L+ PT  RGNL+L +LP+S+L +SF+GD+G +E+L    SKS+ SAV IE+I 
Sbjct: 61  KIIKEGILMRPTPIRGNLSLMVLPTSDLRLSFIGDNGHSEQLFTYTSKSQCSAVSIEEIT 120

Query: 213 SDSSGRSFRLQVPDGQVSYFWCSEKSQAHGAKLLAKVKSLLRKKPSLAKLSGISESRLDC 272
            DSSGRSF +++ +G   Y+WCSEKS+  G +L  K+K L++KKPS+++L+GI ESRL  
Sbjct: 121 VDSSGRSFVIRIANGNAFYYWCSEKSKLLGTELRRKMKDLIKKKPSISELTGIEESRLGS 180

Query: 273 FAIHLHAYLHGSTSGTKASTPDDSPSLLGSGLENSALQLDSSNHLSSSLSEPSDSKGSCS 332
            A HL  YL GS        P+     + S   +S+     +   SSS S  S     C 
Sbjct: 181 VASHLRLYLMGSV------VPNIKGCQVPSPDSSSSSGFSETADSSSSASSKSLRARHCG 234

Query: 333 VVVSPYSTSQGSLCPSLNSLADFVHGNEFMERFGQDRDS--SICSLSVIDNLPVVSTSTV 390
              +  + +QGSL P  +S  +    N  +    +D+    S    S+ DN  + S  T 
Sbjct: 235 ---TQQTKTQGSLSPRASSFKENTLRNASLRISSRDKSKGHSEGHFSIFDNSSITSIPT- 290

Query: 391 DVVNLSQAETKLAEAVLPIPILPRGFLEAPEKSEFLPSVHLFPSSQVVPTGPLLFSPYYN 450
           +V    Q+E ++ EA      + +  + A E++E  PS    P    +  GP +FSPYY 
Sbjct: 291 NVEGFIQSEGEVEEATENYNGIRQ--IIAFEEAESTPSTMTGPPPFPLKMGPPVFSPYYC 348

Query: 451 WAPPSTSTLQYTVTPPHLPTTLSDSLSLPPLS---------SLLSAVRSSSPSIPPRQSL 501
           W PP+TS+L       H P   S S   PPLS         S L     S   + P   L
Sbjct: 349 WCPPTTSSL-------HAP---SASYQFPPLSIELPSLPPLSSLLPASGSDGFLIPSSPL 398

Query: 502 DLTNFPPLDLPSFMPDPLVMPVSSFLGGSSSQQIPTFTPFICDPIVHIPVMDVCSSG-GY 560
           DL++ PPL L   +P P           SSS Q     P +CDPIVHIPV+D+ SSG  Y
Sbjct: 399 DLSDIPPLPLVHHIPIPGSS--------SSSSQQQMMIPIMCDPIVHIPVIDIFSSGQSY 450

Query: 561 LVSAGPT--VSTTITPLHPNPLITQNESVLEKNARETLRLLLASAHQVPFSTSEEQPGAA 618
           LVSAGPT  +ST I PL       +N+S++EK ARETLRLL++ A+    +         
Sbjct: 451 LVSAGPTGIISTGIPPLP-----VENDSLVEKGARETLRLLISGANATTSTPLNHH---- 501

Query: 619 YIGGSRGLYTVTRDI 633
              GSRGLY+V+RD+
Sbjct: 502 ---GSRGLYSVSRDV 513


>AT2G39950.2 | flocculation protein | Chr2:16676758-16679961 REVERSE
           LENGTH=555 | 201606
          Length = 555

 Score =  332 bits (852), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 233/555 (41%), Positives = 318/555 (57%), Gaps = 56/555 (10%)

Query: 93  LQLIGACRADERMKPLFKLNVSSGVAEDRLLAQLSQHFDAAEVGILGRCLFVPLVSIRVG 152
           +Q++GACR DER+KPL KLNVS+G+AEDRLLA LSQHF+ AE+G+L RC  +PLVS+RVG
Sbjct: 1   MQVMGACRGDERLKPLLKLNVSNGMAEDRLLAHLSQHFEPAEIGMLARCFCIPLVSVRVG 60

Query: 153 KVIKRGSLLCPTAERGNLNLTMLPSSNLAISFVGDDGCTEKLAMLRSKSESSAVVIEDIQ 212
           K+IK G L+ PT  RGNL+L +LP+S+L +SF+GD+G +E+L    SKS+ SAV IE+I 
Sbjct: 61  KIIKEGILMRPTPIRGNLSLMVLPTSDLRLSFIGDNGHSEQLFTYTSKSQCSAVSIEEIT 120

Query: 213 SDSSGRSFRLQVPDGQVSYFWCSEKSQAHGAKLLAKVKSLLRKKPSLAKLSGISESRLDC 272
            DSSGRSF +++ +G   Y+WCSEKS+  G +L  K+K L++KKPS+++L+GI ESRL  
Sbjct: 121 VDSSGRSFVIRIANGNAFYYWCSEKSKLLGTELRRKMKDLIKKKPSISELTGIEESRLGS 180

Query: 273 FAIHLHAYLHGSTSGTKASTPDDSPSLLGSGLENSALQLDSSNHLSSSLSEPSDSKGSCS 332
            A HL  YL GS        P+     + S   +S+     +   SSS S  S     C 
Sbjct: 181 VASHLRLYLMGSV------VPNIKGCQVPSPDSSSSSGFSETADSSSSASSKSLRARHCG 234

Query: 333 VVVSPYSTSQGSLCPSLNSLADFVHGNEFMERFGQDRDS--SICSLSVIDNLPVVSTSTV 390
              +  + +QGSL P  +S  +    N  +    +D+    S    S+ DN  + S  T 
Sbjct: 235 ---TQQTKTQGSLSPRASSFKENTLRNASLRISSRDKSKGHSEGHFSIFDNSSITSIPT- 290

Query: 391 DVVNLSQAETKLAEAVLPIPILPRGFLEAPEKSEFLPSVHLFPSSQVVPTGPLLFSPYYN 450
           +V    Q+E ++ EA      + +  + A E++E  PS    P    +  GP +FSPYY 
Sbjct: 291 NVEGFIQSEGEVEEATENYNGIRQ--IIAFEEAESTPSTMTGPPPFPLKMGPPVFSPYYC 348

Query: 451 WAPPSTSTLQYTVTPPHLPTTLSDSLSLPPLS---------SLLSAVRSSSPSIPPRQSL 501
           W PP+TS+L       H P   S S   PPLS         S L     S   + P   L
Sbjct: 349 WCPPTTSSL-------HAP---SASYQFPPLSIELPSLPPLSSLLPASGSDGFLIPSSPL 398

Query: 502 DLTNFPPLDLPSFMPDPLVMPVSSFLGGSSSQQIPTFTPFICDPIVHIPVMDVCSSG-GY 560
           DL++ PPL L   +P P           SSS Q     P +CDPIVHIPV+D+ SSG  Y
Sbjct: 399 DLSDIPPLPLVHHIPIPGSS--------SSSSQQQMMIPIMCDPIVHIPVIDIFSSGQSY 450

Query: 561 LVSAGPT--VSTTITPLHPNPLITQNESVLEKNARETLRLLLASAHQVPFSTSEEQPGAA 618
           LVSAGPT  +ST I PL       +N+S++EK ARETLRLL++ A+    +         
Sbjct: 451 LVSAGPTGIISTGIPPLP-----VENDSLVEKGARETLRLLISGANATTSTPLNHH---- 501

Query: 619 YIGGSRGLYTVTRDI 633
              GSRGLY+V+RD+
Sbjct: 502 ---GSRGLYSVSRDV 513


>AT2G39950.8 | flocculation protein | Chr2:16676758-16678262 REVERSE
           LENGTH=475 | 201606
          Length = 475

 Score =  213 bits (542), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 179/474 (37%), Positives = 250/474 (52%), Gaps = 56/474 (11%)

Query: 174 MLPSSNLAISFVGDDGCTEKLAMLRSKSESSAVVIEDIQSDSSGRSFRLQVPDGQVSYFW 233
           +LP+S+L +SF+GD+G +E+L    SKS+ SAV IE+I  DSSGRSF +++ +G   Y+W
Sbjct: 2   VLPTSDLRLSFIGDNGHSEQLFTYTSKSQCSAVSIEEITVDSSGRSFVIRIANGNAFYYW 61

Query: 234 CSEKSQAHGAKLLAKVKSLLRKKPSLAKLSGISESRLDCFAIHLHAYLHGSTSGTKASTP 293
           CSEKS+  G +L  K+K L++KKPS+++L+GI ESRL   A HL  YL GS        P
Sbjct: 62  CSEKSKLLGTELRRKMKDLIKKKPSISELTGIEESRLGSVASHLRLYLMGSV------VP 115

Query: 294 DDSPSLLGSGLENSALQLDSSNHLSSSLSEPSDSKGSCSVVVSPYSTSQGSLCPSLNSLA 353
           +     + S   +S+     +   SSS S  S     C    +  + +QGSL P  +S  
Sbjct: 116 NIKGCQVPSPDSSSSSGFSETADSSSSASSKSLRARHCG---TQQTKTQGSLSPRASSFK 172

Query: 354 DFVHGNEFMERFGQDRDS--SICSLSVIDNLPVVSTSTVDVVNLSQAETKLAEAVLPIPI 411
           +    N  +    +D+    S    S+ DN  + S  T +V    Q+E ++ EA      
Sbjct: 173 ENTLRNASLRISSRDKSKGHSEGHFSIFDNSSITSIPT-NVEGFIQSEGEVEEATENYNG 231

Query: 412 LPRGFLEAPEKSEFLPSVHLFPSSQVVPTGPLLFSPYYNWAPPSTSTLQYTVTPPHLPTT 471
           + +  + A E++E  PS    P    +  GP +FSPYY W PP+TS+L       H P  
Sbjct: 232 IRQ--IIAFEEAESTPSTMTGPPPFPLKMGPPVFSPYYCWCPPTTSSL-------HAP-- 280

Query: 472 LSDSLSLPPLS---------SLLSAVRSSSPSIPPRQSLDLTNFPPLDLPSFMPDPLVMP 522
            S S   PPLS         S L     S   + P   LDL++ PPL L   +P P    
Sbjct: 281 -SASYQFPPLSIELPSLPPLSSLLPASGSDGFLIPSSPLDLSDIPPLPLVHHIPIPGSS- 338

Query: 523 VSSFLGGSSSQQIPTFTPFICDPIVHIPVMDVCSSG-GYLVSAGPT--VSTTITPLHPNP 579
                  SSS Q     P +CDPIVHIPV+D+ SSG  YLVSAGPT  +ST I PL    
Sbjct: 339 -------SSSSQQQMMIPIMCDPIVHIPVIDIFSSGQSYLVSAGPTGIISTGIPPLP--- 388

Query: 580 LITQNESVLEKNARETLRLLLASAHQVPFSTSEEQPGAAYIGGSRGLYTVTRDI 633
              +N+S++EK ARETLRLL++ A+    +            GSRGLY+V+RD+
Sbjct: 389 --VENDSLVEKGARETLRLLISGANATTSTPLNHH-------GSRGLYSVSRDV 433


>AT2G39950.7 | flocculation protein | Chr2:16676758-16678262 REVERSE
           LENGTH=475 | 201606
          Length = 475

 Score =  213 bits (542), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 179/474 (37%), Positives = 250/474 (52%), Gaps = 56/474 (11%)

Query: 174 MLPSSNLAISFVGDDGCTEKLAMLRSKSESSAVVIEDIQSDSSGRSFRLQVPDGQVSYFW 233
           +LP+S+L +SF+GD+G +E+L    SKS+ SAV IE+I  DSSGRSF +++ +G   Y+W
Sbjct: 2   VLPTSDLRLSFIGDNGHSEQLFTYTSKSQCSAVSIEEITVDSSGRSFVIRIANGNAFYYW 61

Query: 234 CSEKSQAHGAKLLAKVKSLLRKKPSLAKLSGISESRLDCFAIHLHAYLHGSTSGTKASTP 293
           CSEKS+  G +L  K+K L++KKPS+++L+GI ESRL   A HL  YL GS        P
Sbjct: 62  CSEKSKLLGTELRRKMKDLIKKKPSISELTGIEESRLGSVASHLRLYLMGSV------VP 115

Query: 294 DDSPSLLGSGLENSALQLDSSNHLSSSLSEPSDSKGSCSVVVSPYSTSQGSLCPSLNSLA 353
           +     + S   +S+     +   SSS S  S     C    +  + +QGSL P  +S  
Sbjct: 116 NIKGCQVPSPDSSSSSGFSETADSSSSASSKSLRARHCG---TQQTKTQGSLSPRASSFK 172

Query: 354 DFVHGNEFMERFGQDRDS--SICSLSVIDNLPVVSTSTVDVVNLSQAETKLAEAVLPIPI 411
           +    N  +    +D+    S    S+ DN  + S  T +V    Q+E ++ EA      
Sbjct: 173 ENTLRNASLRISSRDKSKGHSEGHFSIFDNSSITSIPT-NVEGFIQSEGEVEEATENYNG 231

Query: 412 LPRGFLEAPEKSEFLPSVHLFPSSQVVPTGPLLFSPYYNWAPPSTSTLQYTVTPPHLPTT 471
           + +  + A E++E  PS    P    +  GP +FSPYY W PP+TS+L       H P  
Sbjct: 232 IRQ--IIAFEEAESTPSTMTGPPPFPLKMGPPVFSPYYCWCPPTTSSL-------HAP-- 280

Query: 472 LSDSLSLPPLS---------SLLSAVRSSSPSIPPRQSLDLTNFPPLDLPSFMPDPLVMP 522
            S S   PPLS         S L     S   + P   LDL++ PPL L   +P P    
Sbjct: 281 -SASYQFPPLSIELPSLPPLSSLLPASGSDGFLIPSSPLDLSDIPPLPLVHHIPIPGSS- 338

Query: 523 VSSFLGGSSSQQIPTFTPFICDPIVHIPVMDVCSSG-GYLVSAGPT--VSTTITPLHPNP 579
                  SSS Q     P +CDPIVHIPV+D+ SSG  YLVSAGPT  +ST I PL    
Sbjct: 339 -------SSSSQQQMMIPIMCDPIVHIPVIDIFSSGQSYLVSAGPTGIISTGIPPLP--- 388

Query: 580 LITQNESVLEKNARETLRLLLASAHQVPFSTSEEQPGAAYIGGSRGLYTVTRDI 633
              +N+S++EK ARETLRLL++ A+    +            GSRGLY+V+RD+
Sbjct: 389 --VENDSLVEKGARETLRLLISGANATTSTPLNHH-------GSRGLYSVSRDV 433


Top