BLAST Resut
BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Eca_sc000102.1_g1240.1
(506 letters)
Database: ./nr
95,329,361 sequences; 35,143,497,570 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
GAU44080.1 hypothetical protein TSUD_399620 [Trifolium subterran... 256 3e-74
AFP55574.1 non-ltr retroelement reverse transcriptase [Rosa rugosa] 250 5e-69
AAS55787.1 hypothetical protein [Oryza sativa Japonica Group] AA... 246 1e-67
>GAU44080.1 hypothetical protein TSUD_399620 [Trifolium subterraneum]
Length = 675
Score = 256 bits (654), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 150/469 (31%), Positives = 239/469 (50%), Gaps = 15/469 (3%)
Query: 9 STLAWNVRGCGNYLNRKHLWTLIREHKPEVIFLSETKCMNTNINKL-LNLGMDYTSYTVE 67
S + WN RG G+ L L+R +KP+V+FLSET M+ + +L LG D + ++V+
Sbjct: 2 SMIGWNCRGIGHPRTVPSLKYLVRVYKPDVLFLSETLSMSNKMEELRYMLGFD-SFFSVD 60
Query: 68 PNNKAGGLGLFWTKEINMEIMYADRNIIQARASFTPDNSNFILSCFYGSPYKHNRMDPWQ 127
+ GGL L W + ++ N + N N+ L+ FYG P R D W
Sbjct: 61 REGRGGGLALMWRFSFHCSVINFSANHVDVEVK-DNMNGNWRLTGFYGFPGSGRRRDSWN 119
Query: 128 TLVSLAPNNSLPWVIIGDLNTILHPDEKTGGSRKVPTLVRGINSLINNLGVRDVGFSGYP 187
L L+ +++LPW I+GD N IL P EK G + + P L+ G S++ + G+ D+ GYP
Sbjct: 120 FLKQLSHSSNLPWCILGDFNDILLPYEKKGKNDRAPWLINGFRSVVLDSGLVDIHMEGYP 179
Query: 188 YTWSNRQFNGNLIQERLDRALTNDNWLIEFPDTNLIHLPGVGSDHNPILLTTNPRPKLG- 246
+TW ++ERLD+AL ND W FP+ L +LP SDH PILL P ++
Sbjct: 180 FTWFKSLGTFRAVEERLDKALANDAWFQNFPNAILENLPAPASDHYPILLVREPENRISR 239
Query: 247 -PKPFKFIRTWMSHPECSQFIQEKW-TFNPAQIHMSLNRLAGHLSRWNKEVFGHLDSKIK 304
FKF W+ PE S F+ +W ++ Q+ L+ A L+ WN+ F L I
Sbjct: 240 IRSRFKFENAWLVDPEFSDFVSNRWLSYGDQQVLNKLDMCASDLTIWNRNHFQRLRRDID 299
Query: 305 SLTNQLQRLRDK---DSVHRIN---KELEEAYNQQESLWREKSRIDNIQLGDRNTKYFHS 358
+ +++ +R K ++VH N + + + Q+++ WR++++ ++ GD NTK+FH+
Sbjct: 300 TCRKKIECVRSKVNYENVHYFNSLRQRMSQLLVQEDAYWRQRAKTHWLRDGDLNTKFFHA 359
Query: 359 KAIHRGRKNQIMAIKKEDNSWTNEQSEIASIFQTNLKQISTTNGQEAEIT-FLNLFSTQI 417
A R + N+I ++ + +++ + + + N Q + I N+ I
Sbjct: 360 AATSRRKVNRINSLLDSSGNLITNNADLCEVARDYF--VDIFNKQHSTIEPVANIIDQSI 417
Query: 418 TEAQNTTITATPNKEEIKNAVFSLKPHAAPGPDGYPPYFYQANWATTNQ 466
NT +TA EE K A+FS+ P PGPDG+ P F+Q W Q
Sbjct: 418 MSEDNTLLTAPFTLEEFKEAMFSMHPDKCPGPDGFNPGFFQHFWHICGQ 466
>AFP55574.1 non-ltr retroelement reverse transcriptase [Rosa rugosa]
Length = 1656
Score = 250 bits (638), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 143/461 (31%), Positives = 238/461 (51%), Gaps = 20/461 (4%)
Query: 14 NVRGCGNYLNRKHLWTLIREHKPEVIFLSETKCMNTNINKLLNLGMDYTSY-TVEPNNKA 72
N G G L R + ++H PE++FL ET+ I K + +T + V+P
Sbjct: 616 NQDGGGKTLRR-----ICKKHNPEILFLMETR-QQEGIIKEWKRNLKFTDHHVVDPIATG 669
Query: 73 GGLGLFWTKEINMEIMYADRNIIQARASFTPDNSNFILSCFYGSPYKHNRMDPWQTLVSL 132
GL LFW + + I+ + N + SF D ++ YG+P+ + + W+ + S
Sbjct: 670 RGLALFWGDAVQVSILDSSPNYVDTVVSFLSDAFVCKITWMYGNPHDNEKRAFWRLMYSR 729
Query: 133 APNNSLPWVIIGDLNTILHPDEKTGGSRKVPTLVRGINSLINNLGVRDVGFSGYPYTWSN 192
P SLPW+++GD N +L P EK GG +P ++ +NN +RD+ F G ++W
Sbjct: 730 FPVQSLPWLVLGDFNEVLDPSEKWGGGPPLPWRIKLFRDFLNNGHLRDLHFKGPGFSWFA 789
Query: 193 RQFNGNLIQERLDRALTNDNWLIEFPDTNLIHLPGVGSDHNPILLTTNPRPKLGPKPFKF 252
+ I+ERLDRAL N W P+T ++HLP +GSDH P+LL +NP+ + F+F
Sbjct: 790 MRHGRVFIKERLDRALGNIAWSSSQPNTQILHLPKIGSDHRPLLLDSNPKMLNKTRLFRF 849
Query: 253 IRTWMSHPECSQFIQEKW--TFNPAQIHMSLNR----LAGHLSRWNKEVFGHLDSKIKSL 306
+ W +H E S IQ W F + + S NR L W+KE F + ++ L
Sbjct: 850 EQMWTTHEEYSDVIQRSWPPAFGGSAMR-SWNRNLLSCGKALKMWSKEKFSNPSVQVADL 908
Query: 307 TNQLQRLRDK---DSVHRIN---KELEEAYNQQESLWREKSRIDNIQLGDRNTKYFHSKA 360
+ +++L D+ H+IN ++ + + Q E W ++SR++ ++LGD+N+ +FH
Sbjct: 909 LSDIEKLHQSNPPDAHHQINILTDQVTKLWTQDEMYWHQRSRVNWLKLGDQNSSFFHQTT 968
Query: 361 IHRGRKNQIMAIKKEDNSWTNEQSEIASIFQTNLKQISTTNGQEAEITFLNLFSTQITEA 420
I R + N+I+ +K + +W + ++++A F + +NG + L+ T +T
Sbjct: 969 IQRRQYNKIVRLKDDHGNWLDSEADVALQFLDYFTALYQSNGPQQWEEVLDFVDTAVTAE 1028
Query: 421 QNTTITATPNKEEIKNAVFSLKPHAAPGPDGYPPYFYQANW 461
N +++ + E+K AVF L APGPDG+ FYQ W
Sbjct: 1029 MNKILSSPVSLLEVKKAVFDLGATKAPGPDGFSGIFYQNQW 1069
>AAS55787.1 hypothetical protein [Oryza sativa Japonica Group] AAV32224.1
hypothetical protein [Oryza sativa Japonica Group]
Length = 1936
Score = 246 bits (628), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 142/496 (28%), Positives = 247/496 (49%), Gaps = 33/496 (6%)
Query: 9 STLAWNVRGCGNYLNRKHLWTLIREHKPEVIFLSETKCMNTNINKLLNLGMDYTSYTVEP 68
S LAWN RG GN + L LI++ +++FL ET+ +++L V
Sbjct: 637 SCLAWNCRGLGNTATVQDLRALIQKAGSQLVFLCETRQSVEKMSRLRRKLAFRGFVGVSS 696
Query: 69 NNKAGGLGLFWTKEINMEIMYADRNIIQARASFTPDNSNFILSCFYGSPYKHNRMDPWQT 128
K+GGL L+W + +++++ ++ I A +PD + ++ YG P NR W
Sbjct: 697 EGKSGGLALYWDESVSVDVKDINKRYIDAYVRLSPDEPQWHITFVYGEPRVENRHRMWSL 756
Query: 129 LVSLAPNNSLPWVIIGDLNTILHPDEKTGGSRKVPTLVRGINSLINNLGVRDVGFSGYPY 188
L ++ +++LPW++IGD N L E + + T ++ + + ++D+GF G P+
Sbjct: 757 LRTIRQSSALPWMVIGDFNETLWQFEHFSKNPRCETQMQNFRDALYDCDLQDLGFKGVPH 816
Query: 189 TWSNRQFNGNLIQERLDRALTNDNWLIEFPDTNLIHLPGVGSDHNPILL------TTNPR 242
T+ NR+ ++ RLDRA+ +D W FP+ + HL SDH+PILL TT PR
Sbjct: 817 TYDNRRDGWRNVKVRLDRAVADDKWRDLFPEAQVSHLVSPCSDHSPILLEFIVKDTTRPR 876
Query: 243 PKLGPKPFKFIRTWMSHPECSQFIQEKWT-----FNPAQIHMSLNRLAGHLSRWNKEVFG 297
K + W PE Q I+E W + I+++L R+ L W+K
Sbjct: 877 QKC----LHYEIVWEREPESVQVIEEAWINAGVKTDLGDINIALGRVMSALRSWSK---- 928
Query: 298 HLDSKIKSLTNQLQRLRDK-----------DSVHRINKELEEAYNQQESLWREKSRIDNI 346
+K+K++ +L++ R K S+ + + E ++E LW ++SR++ +
Sbjct: 929 ---TKVKNVGKELEKARKKLEDLIASNAARSSIRQATDHMNEMLYREEMLWLQRSRVNWL 985
Query: 347 QLGDRNTKYFHSKAIHRGRKNQIMAIKKEDNSWTNEQSEIASIFQTNLKQISTTNGQEAE 406
+ GDRNT++FHS+A+ R +KN+I ++ E+ + + S + ++ + + +
Sbjct: 986 KEGDRNTRFFHSRAVWRAKKNKISKLRDENGAIHSTTSVLETMATEYFQGVYKADPSLNP 1045
Query: 407 ITFLNLFSTQITEAQNTTITATPNKEEIKNAVFSLKPHAAPGPDGYPPYFYQANWATTNQ 466
+ LF ++T+A N + +EEI A+F + P +P PDG+P FYQ NW T
Sbjct: 1046 ESVTRLFQEKVTDAMNEKLCQEFKEEEIAQAIFQIGPLKSPRPDGFPARFYQRNWGTLKS 1105
Query: 467 PAEYHLQNNCQSPCQP 482
++N QS P
Sbjct: 1106 DIILAVRNFFQSGLMP 1121