BLAST Resut
BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Eca_sc000117.1_g0500.1
(627 letters)
Database: ./nr
95,329,361 sequences; 35,143,497,570 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
XP_012849842.1 PREDICTED: uncharacterized protein LOC105969618 [... 449 e-147
XP_012856897.1 PREDICTED: uncharacterized protein LOC105976150 [... 374 e-117
KZV48565.1 hypothetical protein F511_32101 [Dorcoceras hygrometr... 367 e-116
>XP_012849842.1 PREDICTED: uncharacterized protein LOC105969618 [Erythranthe
guttata]
Length = 650
Score = 449 bits (1155), Expect = e-147, Method: Compositional matrix adjust.
Identities = 231/465 (49%), Positives = 314/465 (67%), Gaps = 42/465 (9%)
Query: 21 FDVHHSDSPTTILITPLLTGDNYSSWSRGMSKALRAKGKLGFVDGSIDKPTDPNELDQWG 80
+ +HHSDSP+TIL+TPLLTGDNY SWSR ++ ALRAK KLGFVDGS+ PT+ +++ W
Sbjct: 24 YTIHHSDSPSTILVTPLLTGDNYGSWSRAVTMALRAKNKLGFVDGSLPIPTEKSDISNWE 83
Query: 81 RCNDLVSSWLINSVDPPLRSSILYDETASDIWKNLFDRYHQTNAPKIFQLKRAISCLKQE 140
RCNDLV SW++NSV P +R SILY ETA+ IW +L DR+ Q+NAPKI+QLK++IS LKQE
Sbjct: 84 RCNDLVGSWILNSVSPEIRPSILYAETAAQIWTDLKDRFSQSNAPKIYQLKQSISSLKQE 143
Query: 141 NLDVSTYFTHLKALQDELNSLLITEPCICGHGRSLVERTNQDRAMEFLQGLNDRFSSIRS 200
++ VS YFT LK+L DEL S++ PCICG+ +S++++ NQDR+MEFLQGL+DRFS+IRS
Sbjct: 144 SMSVSLYFTQLKSLWDELGSIIHITPCICGNAKSIIDQQNQDRSMEFLQGLHDRFSAIRS 203
Query: 201 QILLIEPFPSILRIHSLTKQEEVQQ--NLVTPMNVDSNTAALAVGSNDYSHNRFNGSDRQ 258
QILL+EPFPSI RI++L +QEE QQ N++T VD+ AAL + R
Sbjct: 204 QILLMEPFPSIQRIYNLVRQEEKQQEINILTTPTVDA--AALQASKPQF---------RP 252
Query: 259 RTKRPRPFCENCKIHGHDVSSCYKLHGYPPNYGQRGRNRPRQDAVTASVMDNNGSAMNHN 318
KR RPFC++C HGH +++CY+LHG+P + ++ P +N + M
Sbjct: 253 SGKRQRPFCDHCNKHGHTLATCYQLHGFPDKHVKKSVPPP-----------SNSTLM--- 298
Query: 319 GSGVPAASSFTNEQYQGILALLDRNRQSTDHSNGTNKDPLINLTGKYSSSSHPPWVIDSG 378
ASS T+EQY +L LL + S P ++L GK + S W+IDSG
Sbjct: 299 ------ASSLTHEQYNKLLTLLAKEETS---------GPSVHLAGKNHTFSSFCWIIDSG 343
Query: 379 AAKHICSNLSRFSSYSLAPPDLYVRLPDDSRVKVDNIGTIYFTNRFYLKDVLHIPSFQFN 438
A+ HIC++LS FSSYS ++YV+LPD S V +IGT+ F L +V +IPSF+FN
Sbjct: 344 ASNHICTSLSFFSSYSPIRKNIYVQLPDGSHAPVTHIGTVKCFGTFILTNVFYIPSFKFN 403
Query: 439 LISISQISKSLRCTVLFDRDLCLFQDRITKTEIGHGSLHDDLYFL 483
L+SISQ +KS C ++F C+FQD+ TK IG G+ H+ L++L
Sbjct: 404 LLSISQFTKSTNCDIIFSSSGCVFQDQSTKKTIGRGNPHNGLFYL 448
>XP_012856897.1 PREDICTED: uncharacterized protein LOC105976150 [Erythranthe
guttata]
Length = 746
Score = 374 bits (961), Expect = e-117, Method: Compositional matrix adjust.
Identities = 225/627 (35%), Positives = 345/627 (55%), Gaps = 72/627 (11%)
Query: 18 SGPFDVHHSDSPTTILITPLLTGDNYSSWSRGMSKALRAKGKLGFVDGSIDKPTDPNEL- 76
S PF +H SD P +L++ LL+ DN++SWSR M +L K KLGF++G+I +P+ +
Sbjct: 11 SSPFYLHPSDGPGLVLVSQLLSEDNFASWSRAMQISLTVKNKLGFINGTITEPSRDEAVL 70
Query: 77 -DQWGRCNDLVSSWLINSVDPPLRSSILYDETASDIWKNLFDRYHQTNAPKIFQLKRAIS 135
+ W R N +V SW++N+V +++SI+Y ++A ++WK+L R+ QTN P+IFQL+R +S
Sbjct: 71 HNAWVRNNSIVISWILNAVSKDIQASIMYSDSAHEMWKDLNTRFSQTNGPRIFQLRRELS 130
Query: 136 CLKQENLDVSTYFTHLKALQDELNSL---LITEPCICGHGRSLVERTNQDRAMEFLQGLN 192
L Q+ V+ YFT LKA+ DEL++ C CG + L E N + M FL GLN
Sbjct: 131 NLTQDTQSVNVYFTKLKAIWDELSNFRPSCTCGACTCGGVQKLNEHYNLEHVMAFLMGLN 190
Query: 193 DRFSSIRSQILLIEPFPSILRIHSLTKQEEVQQNLVTPMNVDSNTAALAVGSNDYS---- 248
+ +S R QILL++P P I ++ +L QEE Q+++ + N N+ A ++ +
Sbjct: 191 ESLTSTRGQILLMDPLPPINKVFALVSQEERQRSIHSSHNEVQNSLAFSIRGDQSVQRSV 250
Query: 249 HNRFNGSDRQRTKRPRPFCENCKIHGHDVSSCYKLHGYPPNYGQRGRNRPRQDAVTASVM 308
HN+ S +R + R FC +C I+GH + CYKLHGYPP Y + +PR ++ S
Sbjct: 251 HNQVYTSAPKR--KERGFCTHCNIYGHTIDKCYKLHGYPPGY----KAKPRYSSLPQSRF 304
Query: 309 DNNGSA-----MNHNGSGV----------PAASSFTNEQYQGILA-----LLDRNRQSTD 348
N A +++ SG P ++ + Q Q ++A + + + ST
Sbjct: 305 SVNQVAAMESPLDYATSGSTSQPPFVSSDPVLANMSAAQCQQLMAYFSNQMAAKKQVSTQ 364
Query: 349 HSNGTNKDPL-------INLTGKYSSSSHPP-WVIDSGAAKHICSNLSRFSSYSLAPPDL 400
S+G + I L S P W+IDSGA++HIC++ + FSS L +
Sbjct: 365 QSHGDEAEVAHISCVSGICLAASLHESFQPHYWIIDSGASRHICNDKTLFSS--LHKVNF 422
Query: 401 Y-VRLPDDSRVKVDNIGTIYFTNRFYLKDVLHIPSFQFNLISISQISKSLRCTVLFDRDL 459
V LPD S V V+ +G + ++ LK+V ++PSF+FNLIS+S + L TV+FD
Sbjct: 423 ARVILPDCSLVVVEYMGDVCLSDDLILKNVFYVPSFKFNLISVSALLDRLPHTVIFDSTS 482
Query: 460 CLFQDRITKTEIGHGSLHDDLYFLRTERPILSVSNLVRDNSSHESFVLWHCRLGHPSFSR 519
L QD+ K +IG GS D S +WH RLGH +
Sbjct: 483 FLIQDKFLK-KIGKGSKID------------------------VSATVWHNRLGHIPQLK 517
Query: 520 FQYFNSRLPCMNFNFLSNKVPCELCPLSKQSRLPFPNSTTTSSRIFDIVHMDLWGPFPVP 579
+++ + + N C +CP++KQ RL FP S+T SS +FD++H D+WGP+ V
Sbjct: 518 LDILSTKFS-LAMDKPKNNSCCYICPMAKQKRLKFPISSTVSSHMFDLIHCDIWGPYRVE 576
Query: 580 STSGCRYFLTLVDDFSRCTWIFLMSSK 606
S +G +YF+TLVDD+SR TW+ L+ SK
Sbjct: 577 SHNGYKYFVTLVDDYSRFTWVHLLKSK 603
>KZV48565.1 hypothetical protein F511_32101 [Dorcoceras hygrometricum]
Length = 600
Score = 367 bits (942), Expect = e-116, Method: Compositional matrix adjust.
Identities = 211/595 (35%), Positives = 328/595 (55%), Gaps = 39/595 (6%)
Query: 18 SGPFDVHHSDSPTTILITPLLTGDNYSSWSRGMSKALRAKGKLGFVDGSIDKPTDPNEL- 76
S P+ + + D P +L++ LL G NY+ WSR M AL AK KLGFVD SID+P + L
Sbjct: 12 SSPYYLQNGDHPGLLLVSNLLVGSNYNIWSRAMVVALTAKNKLGFVDNSIDQPRSDDLLY 71
Query: 77 DQWGRCNDLVSSWLINSVDPPLRSSILYDETASDIWKNLFDRYHQTNAPKIFQLKRAISC 136
W RCN +V SW++NSV + S++Y TA ++W +L DR+H++NAP+++Q+K+ ++
Sbjct: 72 GSWTRCNSMVISWILNSVTRDIADSLMYMPTAREMWVDLHDRFHESNAPRVYQIKKMLNG 131
Query: 137 LKQENLDVSTYFTHLKALQDELNSLLITEPCICGHGRSLVERTNQDRAMEFLQGLNDRFS 196
L+Q +D+S+Y+T L+ L DEL T C CG + + NQ+ M FL GLN+ ++
Sbjct: 132 LQQGAMDISSYYTKLRILWDELRDYQPTSVCNCGSMKEWIAYQNQECVMHFLMGLNESYA 191
Query: 197 SIRSQILLIEPFPSILRIHSLTKQEEVQQNL---VTPMNVDSNTAALAVGSNDYSHNRFN 253
IR+Q+L++EP P I ++ +L QEE Q+++ +++D + +L +++ ++
Sbjct: 192 QIRAQVLMMEPLPIISKVFALVVQEERQRSIHHGTAKISIDHH-VSLNNVNSNIVNSTTT 250
Query: 254 GSDRQRTKRPRPFCENCKIHGHDVSSCYKLHGYPPNYGQRGRNRPRQDAVT---ASVMDN 310
+ K + C +C H V CYKLHGYPP + + + P+ +A +S+M +
Sbjct: 251 PRVPRSGKGDKVVCSHCHFRNHTVDKCYKLHGYPPGHPKLKQQLPQSNAQVHQISSIMQD 310
Query: 311 NGSAMNHNGSGVPAASSFTNEQYQGILALLDRNRQ---------STDHSNGTNKDPLINL 361
N SA S T Q + ++ L S+ + + +
Sbjct: 311 NSSA---------PGDSLTQNQCKQLIEFLSSKLHFGHSSQVEPQQQESSASCFTGICST 361
Query: 362 TGKYSSSSHPPWVIDSGAAKHICSNLSRFSSYSLAPPDLYVRLPDDSRVKVDNIGTIYFT 421
SS +H WV+D+GA HIC +LS F +S P + + LP+ ++V G+++ T
Sbjct: 362 VSHNSSITHTDWVLDTGATHHICCSLSMF--HSSKPVNSKIMLPNTLTIQVTTTGSVFLT 419
Query: 422 NRFYLKDVLHIPSFQFNLISISQISKSLRCTVLFDRDLCLFQDRITKTEIGHGSLHDDLY 481
N L DVL++P FQFNL+SIS ++K+L C+V F D C QD IG G +LY
Sbjct: 420 NDLILHDVLYVPEFQFNLLSISSLTKNLACSVSFMSDSCHIQDFKRTKTIGMGKRLGNLY 479
Query: 482 FLRTERPILSVSNLVRDNSSHESFVLWHCRLGHPSFSRFQYFNSRLPCMNFNFLSNKVP- 540
L I S S + N S LWHCR+GHPS ++ + L +F S V
Sbjct: 480 VL-INSSITSPSYVC--NVSVPKPELWHCRMGHPSPNKLSSLKNIL-----HFDSTDVDI 531
Query: 541 --CELCPLSKQSRLPFPNSTTTSSRIFDIVHMDLWGPFPVPSTSGCRYFLTLVDD 593
C +C +SKQ RLPF + T++ F+++H+D+WGPF S G R+FLT+VDD
Sbjct: 532 NLCHVCHMSKQKRLPFESHNKTAAHSFELLHIDVWGPFSKYSVDGYRFFLTIVDD 586