BLASTP 2.0.14 [Jun-29-2000]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= CYS1_DICDI
(351 letters)
Database: /home/peter/blast/data/swissprot
88,780 sequences; 31,984,247 total letters
Searching......................................................................................................................................................
3 occurrence(s) of pattern in query
CYS1_DICDI; PATTERN.
pattern P-E-E-Q at position 23 of query sequence
effective database length=3.2e+07
pattern probability=8.9e-06
lengthXprobability=2.8e+02
Number of occurrences of pattern in the database is 349
CYS1_DICDI; PATTERN.
pattern P-E-E-Q at position 120 of query sequence
effective database length=3.2e+07
pattern probability=8.9e-06
lengthXprobability=2.8e+02
Number of occurrences of pattern in the database is 349
CYS1_DICDI; PATTERN.
pattern P-E-E-Q at position 237 of query sequence
effective database length=3.2e+07
pattern probability=8.9e-06
lengthXprobability=2.8e+02
Number of occurrences of pattern in the database is 349
done
Results from round 1
Score E
(bits) Value
Significant matches for pattern occurrence 1 at position 23
sp|P04988|CYS1_DICDI CYSTEINE PROTEINASE 1 PRECURSOR 688 0.0
sp|P30957|RYNC_RABIT RYANODINE RECEPTOR, CARDIAC MUSCLE 8 4.8
sp|Q08862|GTC_RABIT GLUTATHIONE S-TRANSFERASE YC (ALPHA II) (GST... 7 6.0
sp|O95801|TTC4_HUMAN TETRATRICOPEPTIDE REPEAT PROTEIN 4 7 7.6
sp|P36114|YKZ8_YEAST HYPOTHETICAL 81.8 KDA PROTEIN IN YPT52-DBP7... 7 9.6
Significant matches for pattern occurrence 2 at position 120
sp|P11559|MCRA_METVO METHYL-COENZYME M REDUCTASE ALPHA SUBUNIT 13 0.13
sp|Q49605|MCRA_METKA METHYL-COENZYME M REDUCTASE I ALPHA SUBUNIT... 11 0.43
sp|P81901|FER_PYRIS FERREDOXIN (SEVEN-IRON FERREDOXIN) 11 0.55
sp|Q58256|MCRX_METJA METHYL-COENZYME M REDUCTASE II ALPHA SUBUNI... 10 1.1
sp|P53203|YG14_YEAST HYPOTHETICAL 52.9 KD PROTEIN IN ERP6-TFG2 I... 8 3.0
sp|P55002|MGP1_MOUSE MICROFIBRIL-ASSOCIATED GLYCOPROTEIN PRECURS... 7 6.0
sp|Q06234|ASH1_XENLA ACHAETE-SCUTE HOMOLOG 1 7 7.6
sp|P20918|PLMN_MOUSE PLASMINOGEN PRECURSOR [CONTAINS: ANGIOSTATIN] 7 7.6
Significant matches for pattern occurrence 3 at position 237
sp|P49362|GCSB_FLAPR GLYCINE DEHYDROGENASE [DECARBOXYLATING] B, ... 9 1.4
sp|P49361|GCSA_FLAPR GLYCINE DEHYDROGENASE [DECARBOXYLATING] A, ... 9 1.4
sp|O49852|GCSP_FLATR GLYCINE DEHYDROGENASE [DECARBOXYLATING], MI... 8 4.8
sp|P32767|PDR6_YEAST PLEIOTROPIC DRUG RESISTANCE REGULATORY PROT... 7 6.0
sp|O49850|GCSP_FLAAN GLYCINE DEHYDROGENASE [DECARBOXYLATING], MI... 7 9.6
Significant alignments for pattern occurrence 1 at position 23
>sp|P04988|CYS1_DICDI CYSTEINE PROTEINASE 1 PRECURSOR
Length = 343
Score = 688 bits (1789), Expect = 0.0
Identities = 343/351 (97%), Positives = 343/351 (97%), Gaps = 8/351 (2%)
Query: 1 MKVILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIE 60
pattern 23 ****
MKVILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIE
Sbjct: 1 MKVILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIE 60
Query: 61 ELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPP 120
pattern 120 *
ELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIP
Sbjct: 61 ELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIP- 119
Query: 121 EEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHE 180
pattern 121 ***
TAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHE
Sbjct: 120 ---TAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHE 176
Query: 181 CMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQ 240
pattern 237 ****
CMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIG
Sbjct: 177 CMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIG---- 232
Query: 241 AKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVG 300
AKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVG
Sbjct: 233 AKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVG 292
Query: 301 YSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII 351
YSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII
Sbjct: 293 YSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII 343
>sp|P30957|RYNC_RABIT RYANODINE RECEPTOR, CARDIAC MUSCLE
Length = 4969
Score = 7.8 bits (25), Expect = 4.8
Identities = 14/39 (35%), Positives = 19/39 (47%)
Query: 23 PEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEE 61
pattern 23 ****
PEEQ +F E + K +K EE E + G+ EE
Sbjct: 4414 PEEQEKFQEQKTKEEEKEEKEETKSEPEKAEGEDGEKEE 4452
>sp|Q08862|GTC_RABIT GLUTATHIONE S-TRANSFERASE YC (ALPHA II) (GST CLASS-ALPHA)
Length = 221
Score = 7.4 bits (24), Expect = 6.0
Identities = 19/67 (28%), Positives = 35/67 (51%), Gaps = 12/67 (17%)
Query: 21 IPPEEQ-SQFLEFQDKFNKKY---------SH-EEYLERFEIFKSNLGKIEEL-NLIAIN 68
pattern 23 ****
+PPEEQ ++ + +DK +Y SH ++YL ++ K+++ +E L N+ +N
Sbjct: 112 LPPEEQEAKLAQIKDKAKNRYFPAFEKVLKSHGQDYLVGNKLSKADILLVELLYNVEELN 171
Query: 69 HKADTKF 75
A F
Sbjct: 172 PGATASF 178
>sp|O95801|TTC4_HUMAN TETRATRICOPEPTIDE REPEAT PROTEIN 4
Length = 356
Score = 7.1 bits (23), Expect = 7.6
Identities = 14/67 (20%), Positives = 32/67 (46%), Gaps = 5/67 (7%)
Query: 23 PEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGK---IEELNLIAINHKADTKFGVNK 79
pattern 23 ****
PEEQ++ ++D+ N + ++Y + + L K +LN + ++A ++ +
Sbjct: 75 PEEQAK--TYKDEGNDYFKEKDYKKAVISYTEGLKKKCADPDLNAVLYTNRAAAQYYLGN 132
Query: 80 FADLSSD 86
F +D
Sbjct: 133 FRSALND 139
>sp|P36114|YKZ8_YEAST HYPOTHETICAL 81.8 KDA PROTEIN IN YPT52-DBP7 INTERGENIC REGION
Length = 725
Score = 6.8 bits (22), Expect = 9.6
Identities = 21/99 (21%), Positives = 43/99 (43%), Gaps = 21/99 (21%)
Query: 21 IPPEEQ--SQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVN 78
pattern 23 ****
+ PEEQ L+F ++ H ER + +++G +N + + G+
Sbjct: 213 LTPEEQKDKDLLQFAEQI-----HSMRTER--LSGAHIGNSPAIN------RLRGELGLQ 259
Query: 79 KFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINS 117
DL +E ++ + +DD+ ++ DEF++S
Sbjct: 260 AMEDLPEEEITDH------KVLSDDIDLSQATIDEFVHS 292
Significant alignments for pattern occurrence 2 at position 120
>sp|P11559|MCRA_METVO METHYL-COENZYME M REDUCTASE ALPHA SUBUNIT
Length = 555
Score = 13.0 bits (40), Expect = 0.13
Identities = 16/28 (57%), Positives = 18/28 (64%), Gaps = 3/28 (10%)
Query: 99 IFTDDLPVADYLDDEF---INSIPPEEQ 123
pattern 120 ****
IFT D +AD LDD F IN + PEEQ
Sbjct: 170 IFTGDDELADELDDRFVIDINKLFPEEQ 197
>sp|Q49605|MCRA_METKA METHYL-COENZYME M REDUCTASE I ALPHA SUBUNIT (MCR I ALPHA)
Length = 553
Score = 11.2 bits (35), Expect = 0.43
Identities = 14/28 (50%), Positives = 18/28 (64%), Gaps = 3/28 (10%)
Query: 99 IFTDDLPVADYLDDEFINSIP---PEEQ 123
pattern 120 ****
I T DL +AD +DD+F+ I PEEQ
Sbjct: 168 IITGDLELADEIDDKFLIDIEKLFPEEQ 195
>sp|P81901|FER_PYRIS FERREDOXIN (SEVEN-IRON FERREDOXIN)
Length = 101
Score = 10.9 bits (34), Expect = 0.55
Identities = 12/23 (52%), Positives = 16/23 (69%), Gaps = 1/23 (4%)
Query: 114 FINSIPPEEQTAF-DWRTRGAVT 135
pattern 120 ****
F S+ PEEQ AF +W+TR +T
Sbjct: 78 FGKSLTPEEQRAFEEWKTRYGIT 100
>sp|Q58256|MCRX_METJA METHYL-COENZYME M REDUCTASE II ALPHA SUBUNIT (MCR II ALPHA)
Length = 553
Score = 9.8 bits (31), Expect = 1.1
Identities = 14/28 (50%), Positives = 17/28 (60%), Gaps = 3/28 (10%)
Query: 99 IFTDDLPVADYLDDEF---INSIPPEEQ 123
pattern 120 ****
IFT D +AD +D F IN + PEEQ
Sbjct: 168 IFTGDDELADEIDKRFLIDINKLFPEEQ 195
>sp|P53203|YG14_YEAST HYPOTHETICAL 52.9 KD PROTEIN IN ERP6-TFG2 INTERGENIC REGION
Length = 462
Score = 8.5 bits (27), Expect = 3.0
Identities = 13/39 (33%), Positives = 21/39 (53%), Gaps = 9/39 (23%)
Query: 112 DEFINSIP-------PEEQT--AFDWRTRGAVTPVKNQG 141
pattern 120 ****
DEF+N+ P PEEQ+ A++W + + + N G
Sbjct: 308 DEFLNTSPSPEVFTLPEEQSGMAWEWHDKDWMLDLTNDG 346
>sp|P55002|MGP1_MOUSE MICROFIBRIL-ASSOCIATED GLYCOPROTEIN PRECURSOR (MAGP) (MAGP-1)
Length = 183
Score = 7.4 bits (24), Expect = 6.0
Identities = 11/37 (29%), Positives = 18/37 (47%)
Query: 100 FTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTP 136
pattern 120 ****
+ D + ADY D + ++ PEEQ + + V P
Sbjct: 37 YGDQIDNADYYDYQEVSPRTPEEQFQSQQQVQQEVIP 73
>sp|Q06234|ASH1_XENLA ACHAETE-SCUTE HOMOLOG 1
Length = 199
Score = 7.1 bits (23), Expect = 7.6
Identities = 11/27 (40%), Positives = 15/27 (54%), Gaps = 1/27 (3%)
Query: 105 PVADYLDDE-FINSIPPEEQTAFDWRT 130
pattern 120 ****
PV+ Y DE + + PEEQ D+ T
Sbjct: 171 PVSSYSSDEGSYDPLSPEEQELLDFTT 197
>sp|P20918|PLMN_MOUSE PLASMINOGEN PRECURSOR [CONTAINS: ANGIOSTATIN]
Length = 812
Score = 7.1 bits (23), Expect = 7.6
Identities = 8/13 (61%), Positives = 11/13 (84%)
Query: 112 DEFINSIPPEEQT 124
pattern 120 ****
D+ +S+PPEEQT
Sbjct: 359 DQSDSSVPPEEQT 371
Significant alignments for pattern occurrence 3 at position 237
>sp|P49362|GCSB_FLAPR GLYCINE DEHYDROGENASE [DECARBOXYLATING] B, MITOCHONDRIAL PRECURSOR
(GLYCINE DECARBOXYLASE B) (GLYCINE CLEAVAGE SYSTEM
P-PROTEIN B)
Length = 1034
Score = 9.5 bits (30), Expect = 1.4
Identities = 21/79 (26%), Positives = 39/79 (48%), Gaps = 13/79 (16%)
Query: 231 NSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPN 290
pattern 237 ****
NSA PEEQ K++ F P +++ I +T P +I D++++ + G+ + +
Sbjct: 80 NSAT--PEEQTKMAEFVGFPNLDSL----IDATVPKSIRLDSMKYSKFDEGLTESQMIAH 133
Query: 291 SLDHGILIVGYSAKNTIFR 309
D ++KN IF+
Sbjct: 134 MQD-------LASKNKIFK 145
>sp|P49361|GCSA_FLAPR GLYCINE DEHYDROGENASE [DECARBOXYLATING] A, MITOCHONDRIAL PRECURSOR
(GLYCINE DECARBOXYLASE A) (GLYCINE CLEAVAGE SYSTEM
P-PROTEIN A)
Length = 1037
Score = 9.5 bits (30), Expect = 1.4
Identities = 21/79 (26%), Positives = 39/79 (48%), Gaps = 13/79 (16%)
Query: 231 NSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPN 290
pattern 237 ****
NSA PEEQ K++ F P +++ I +T P +I D++++ + G+ + +
Sbjct: 83 NSAT--PEEQTKMAEFVGFPNLDSL----IDATVPKSIRLDSMKYSKFDEGLTESQMIAH 136
Query: 291 SLDHGILIVGYSAKNTIFR 309
D ++KN IF+
Sbjct: 137 MQD-------LASKNKIFK 148
>sp|O49852|GCSP_FLATR GLYCINE DEHYDROGENASE [DECARBOXYLATING], MITOCHONDRIAL PRECURSOR
(GLYCINE DECARBOXYLASE) (GLYCINE CLEAVAGE SYSTEM
P-PROTEIN)
Length = 1034
Score = 7.8 bits (25), Expect = 4.8
Identities = 21/79 (26%), Positives = 38/79 (47%), Gaps = 13/79 (16%)
Query: 231 NSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPN 290
pattern 237 ****
NSA PEEQ K++ F +++ I +T P AI D++++ + G+ + +
Sbjct: 80 NSAT--PEEQTKMAEFVGFSNLDSL----IDATVPKAIRLDSMKYSKFDEGLTESQMIAH 133
Query: 291 SLDHGILIVGYSAKNTIFR 309
D ++KN IF+
Sbjct: 134 MQD-------LASKNKIFK 145
>sp|P32767|PDR6_YEAST PLEIOTROPIC DRUG RESISTANCE REGULATORY PROTEIN 6
Length = 1081
Score = 7.4 bits (24), Expect = 6.0
Identities = 25/93 (26%), Positives = 37/93 (38%), Gaps = 17/93 (18%)
Query: 159 HFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYI-IKNGGIQTESS 217
+F S+N+ +S L E M + E C L P ++I N I +S+
Sbjct: 642 NFTSKNEQEKISNDKL-----EVMVIKTVSTLCETCREELTPYLMHFISFLNTVIMPDSN 696
Query: 218 YPYTAETG--------TQCNFNSANIGPEEQAK 242
pattern 237 ****
+ T QC ++ GPEEQAK
Sbjct: 697 VSHFTRTKLVRSIGYVVQCQVSN---GPEEQAK 726
>sp|O49850|GCSP_FLAAN GLYCINE DEHYDROGENASE [DECARBOXYLATING], MITOCHONDRIAL PRECURSOR
(GLYCINE DECARBOXYLASE) (GLYCINE CLEAVAGE SYSTEM
P-PROTEIN)
Length = 1034
Score = 6.8 bits (22), Expect = 9.6
Identities = 20/79 (25%), Positives = 38/79 (47%), Gaps = 13/79 (16%)
Query: 231 NSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPN 290
pattern 237 ****
NSA PEEQ K++ F +++ I +T P +I D++++ + G+ + +
Sbjct: 80 NSAT--PEEQTKMAEFVGFSNLDSL----IDATVPKSIRLDSMKYSKFDEGLTESQMIAH 133
Query: 291 SLDHGILIVGYSAKNTIFR 309
D ++KN IF+
Sbjct: 134 MQD-------LASKNKIFK 145
Searching..................................................done
Results from round 2
Score E
Sequences producing significant alignments: (bits) Value
Sequences used in model and found again:
Sequences not found previously or not previously below threshold:
sp|P04988|CYS1_DICDI CYSTEINE PROTEINASE 1 PRECURSOR 709 0.0
sp|P43295|A494_ARATH PROBABLE CYSTEINE PROTEINASE A494 PRECURSOR 273 4e-73
sp|P25804|CYSP_PEA CYSTEINE PROTEINASE 15A PRECURSOR (TURGOR-RES... 270 2e-72
sp|P43296|RD19_ARATH CYSTEINE PROTEINASE RD19A PRECURSOR 266 6e-71
sp|Q10716|CYS1_MAIZE CYSTEINE PROTEINASE 1 PRECURSOR 252 6e-67
sp|P04989|CYS2_DICDI CYSTEINE PROTEINASE 2 PRECURSOR (PRESTALK C... 250 2e-66
sp|P54640|CYS5_DICDI CYSTEINE PROTEINASE 5 PRECURSOR 238 1e-62
sp|P14658|CYSP_TRYBB CYSTEINE PROTEINASE PRECURSOR 236 4e-62
sp|Q26534|CATL_SCHMA CATHEPSIN L PRECURSOR (SMCL1) 233 3e-61
sp|P35591|CYS1_LEIPI CYSTEINE PROTEINASE 1 PRECURSOR (AMASTIGOTE... 233 3e-61
sp|P25775|LCPA_LEIME CYSTEINE PROTEINASE A PRECURSOR 231 1e-60
sp|P13277|CYS1_HOMAM DIGESTIVE CYSTEINE PROTEINASE 1 PRECURSOR 221 1e-57
sp|P25779|CYSP_TRYCR CRUZIPAIN PRECURSOR (MAJOR CYSTEINE PROTEIN... 221 2e-57
sp|P41721|CATV_NPVBM VIRAL CATHEPSIN (V-CATH) 216 5e-56
sp|P25782|CYS2_HOMAM DIGESTIVE CYSTEINE PROTEINASE 2 PRECURSOR 215 1e-55
sp|P41715|CATV_NPVCF VIRAL CATHEPSIN (V-CATH) 214 2e-55
sp|P25784|CYS3_HOMAM DIGESTIVE CYSTEINE PROTEINASE 3 PRECURSOR 214 2e-55
sp|P07154|CATL_RAT CATHEPSIN L PRECURSOR (MAJOR EXCRETED PROTEIN... 212 7e-55
sp|P06797|CATL_MOUSE CATHEPSIN L PRECURSOR (MAJOR EXCRETED PROTE... 212 1e-54
sp|P12412|CYSP_VIGMU VIGNAIN PRECURSOR (BEAN ENDOPEPTIDASE) (CYS... 209 8e-54
sp|P25783|CATV_NPVAC VIRAL CATHEPSIN (V-CATH) 209 8e-54
sp|P25975|CATL_BOVIN CATHEPSIN L PRECURSOR 208 1e-53
sp|Q40143|CYS3_LYCES CYSTEINE PROTEINASE 3 PRECURSOR 207 2e-53
sp|Q05094|CYS2_LEIPI CYSTEINE PROTEINASE 2 PRECURSOR (AMASTIGOTE... 207 3e-53
sp|P36400|LCPB_LEIME CYSTEINE PROTEINASE B PRECURSOR 206 4e-53
sp|P07711|CATL_HUMAN CATHEPSIN L PRECURSOR (MAJOR EXCRETED PROTE... 206 4e-53
sp|Q28944|CATL_PIG CATHEPSIN L PRECURSOR 206 5e-53
sp|P00785|ACTN_ACTCH ACTINIDAIN PRECURSOR (ACTINIDIN) 204 3e-52
sp|P25803|CYSP_PHAVU VIGNAIN PRECURSOR (BEAN ENDOPEPTIDASE) (CYS... 203 6e-52
sp|Q10991|CATL_SHEEP CATHEPSIN L 201 1e-51
sp|P43156|CYSP_HEMSP THIOL PROTEASE SEN102 PRECURSOR 201 2e-51
sp|P54639|CYS4_DICDI CYSTEINE PROTEINASE 4 PRECURSOR 200 3e-51
sp|O60911|CATM_HUMAN CATHEPSIN L2 PRECURSOR (CATHEPSIN V) 199 7e-51
sp|O10364|CATV_NPVOP VIRAL CATHEPSIN (V-CATH) 196 5e-50
sp|P25777|ORYB_ORYSA ORYZAIN BETA CHAIN PRECURSOR 196 5e-50
sp|P25776|ORYA_ORYSA ORYZAIN ALPHA CHAIN PRECURSOR 194 2e-49
sp|P43297|RD21_ARATH CYSTEINE PROTEINASE RD21A PRECURSOR 193 4e-49
sp|Q10717|CYS2_MAIZE CYSTEINE PROTEINASE 2 PRECURSOR 193 5e-49
sp|P14080|PAP2_CARPA CHYMOPAPAIN PRECURSOR (PAPAYA PROTEINASE II... 192 1e-48
sp|P00786|CATH_RAT CATHEPSIN H PRECURSOR (CATHEPSIN B3) (CATHEPS... 192 1e-48
sp|P25251|CYS4_BRANA CYSTEINE PROTEINASE COT44 PRECURSOR 190 5e-48
sp|P09668|CATH_HUMAN CATHEPSIN H PRECURSOR 188 2e-47
sp|P10056|PAP3_CARPA CARICAIN PRECURSOR (PAPAYA PROTEINASE OMEGA... 187 2e-47
sp|P25778|ORYC_ORYSA ORYZAIN GAMMA CHAIN PRECURSOR 187 2e-47
sp|P15242|TES1_RAT TESTIN 1/2 PRECURSOR (CMB-22/CMB-23) 187 4e-47
sp|O46427|CATH_PIG CATHEPSIN H PRECURSOR 186 5e-47
sp|P05167|ALEU_HORVU THIOL PROTEASE ALEURAIN PRECURSOR 185 9e-47
sp|P43235|CATK_HUMAN CATHEPSIN K PRECURSOR (CATHEPSIN O) (CATHEP... 185 1e-46
sp|P05994|PAP4_CARPA PAPAYA PROTEINASE IV PRECURSOR (PPIV) (PAPA... 184 3e-46
sp|P25250|CYS2_HORVU CYSTEINE PROTEINASE EP-B 2 PRECURSOR 183 3e-46
sp|P25249|CYS1_HORVU CYSTEINE PROTEINASE EP-B 1 PRECURSOR 183 5e-46
sp|P43236|CATK_RABIT CATHEPSIN K PRECURSOR (OC-2 PROTEIN) 183 6e-46
sp|P22895|P34_SOYBN P34 PROBABLE THIOL PROTEASE PRECURSOR 182 8e-46
sp|P49935|CATH_MOUSE CATHEPSIN H PRECURSOR (CATHEPSIN B3) (CATHE... 180 5e-45
sp|P55097|CATK_MOUSE CATHEPSIN K PRECURSOR 178 2e-44
sp|P56202|CATW_HUMAN CATHEPSIN W PRECURSOR (LYMPHOPAIN) 177 3e-44
sp|P56203|CATW_MOUSE CATHEPSIN W PRECURSOR (LYMPHOPAIN) 176 6e-44
sp|P43234|CATO_HUMAN CATHEPSIN O PRECURSOR 173 4e-43
sp|P00784|PAPA_CARPA PAPAIN PRECURSOR (PAPAYA PROTEINASE I) (PPI) 173 7e-43
sp|P25774|CATS_HUMAN CATHEPSIN S PRECURSOR 171 3e-42
sp||CATL_CHICK_1 [Segment 1 of 2] CATHEPSIN L 167 2e-41
sp|P25326|CATS_BOVIN CATHEPSIN S 165 1e-40
sp|P80884|ANAN_ANACO ANANAIN 161 2e-39
sp|Q02765|CATS_RAT CATHEPSIN S PRECURSOR 158 1e-38
sp|P20721|CYSL_LYCES LOW-TEMPERATURE-INDUCED CYSTEINE PROTEINASE... 158 2e-38
sp|P36184|ACP1_ENTHI CYSTEINE PROTEINASE ACP1 PRECURSOR 152 1e-36
sp|Q01957|CPP1_ENTHI CYSTEINE PROTEINASE 1 PRECURSOR 150 4e-36
sp|O17473|CATL_BRUPA CATHEPSIN L-LIKE PRECURSOR 150 6e-36
sp|P46102|CYSP_PLAVN CYSTEINE PROTEINASE PRECURSOR 150 6e-36
sp|Q06964|CPP3_ENTHI CYSTEINE PROTEINASE 3 PRECURSOR (CYSTEINE P... 149 9e-36
sp|Q01958|CPP2_ENTHI CYSTEINE PROTEINASE 2 PRECURSOR 149 9e-36
sp|P36185|ACP2_ENTHI CYSTEINE PROTEINASE ACP2 PRECURSOR 145 1e-34
sp|P25781|CYSP_THEAN CYSTEINE PROTEINASE PRECURSOR 145 1e-34
sp|P22497|CYSP_THEPA CYSTEINE PROTEINASE PRECURSOR 143 5e-34
sp|P25805|CYSP_PLAFA THROPHOZOITE CYSTEINE PROTEINASE PRECURSOR ... 141 3e-33
sp|P14518|BROM_ANACO BROMELAIN, STEM 139 6e-33
sp|P16311|MMAL_DERFA MAJOR MITE FECAL ALLERGEN DER F 1 PRECURSOR... 138 1e-32
sp|P42666|CYSP_PLAVI CYSTEINE PROTEINASE PRECURSOR 129 1e-29
sp|P08176|MMAL_DERPT MAJOR MITE FECAL ALLERGEN DER P 1 PRECURSOR... 121 3e-27
sp|P80067|CATC_RAT DIPEPTIDYL-PEPTIDASE I PRECURSOR (DPP-I) (DPP... 111 3e-24
sp|P97821|CATC_MOUSE DIPEPTIDYL-PEPTIDASE I PRECURSOR (DPP-I) (D... 109 9e-24
sp|P25773|CATL_FELCA CATHEPSIN L (PROGESTERONE-DEPENDENT PROTEIN... 108 2e-23
sp|Q26563|CATC_SCHMA CATHEPSIN C PRECURSOR 108 3e-23
sp|P53634|CATC_HUMAN DIPEPTIDYL-PEPTIDASE I PRECURSOR (DPP-I) (D... 107 3e-23
sp|P25780|EUM1_EURMA MITE GROUP I ALLERGEN EUR M 1 (EUR M I) 100 7e-21
sp|Q23894|CYS3_DICDI CYSTEINE PROTEINASE 3 (CYSTEINE PROTEINASE II) 95 2e-19
sp|P43509|CPR5_CAEEL CATHEPSIN B-LIKE CYSTEINE PROTEINASE 5 PREC... 91 4e-18
sp|P43508|CPR4_CAEEL CATHEPSIN B-LIKE CYSTEINE PROTEINASE 4 PREC... 90 5e-18
sp|P05993|PAP5_CARPA CYSTEINE PROTEINASE (CLONE PLBPC13) 90 5e-18
sp|P07688|CATB_BOVIN CATHEPSIN B PRECURSOR 89 2e-17
sp|P00787|CATB_RAT CATHEPSIN B PRECURSOR (CATHEPSIN B1) (RSG-2) 87 4e-17
sp|P25807|CYS1_CAEEL GUT-SPECIFIC CYSTEINE PROTEINASE PRECURSOR 87 5e-17
sp|P07858|CATB_HUMAN CATHEPSIN B PRECURSOR (CATHEPSIN B1) (APP S... 86 9e-17
sp|P43157|CYSP_SCHJA CATHEPSIN B-LIKE CYSTEINE PROTEINASE PRECUR... 85 2e-16
sp|P43233|CATB_CHICK CATHEPSIN B PRECURSOR (CATHEPSIN B1) 85 2e-16
sp|P43510|CPR6_CAEEL CATHEPSIN B-LIKE CYSTEINE PROTEINASE 6 PREC... 85 2e-16
sp|P25792|CYSP_SCHMA CATHEPSIN B-LIKE CYSTEINE PROTEINASE PRECUR... 85 3e-16
sp|P10605|CATB_MOUSE CATHEPSIN B PRECURSOR (CATHEPSIN B1) 85 3e-16
sp|P25802|CYS1_OSTOS CATHEPSIN B-LIKE CYSTEINE PROTEINASE 1 PREC... 80 9e-15
sp|P25793|CYS2_HAECO CATHEPSIN B-LIKE CYSTEINE PROTEINASE 2 PREC... 78 2e-14
sp|P19092|CYS1_HAECO CATHEPSIN B-LIKE CYSTEINE PROTEINASE 1 PREC... 78 4e-14
sp|P43507|CPR3_CAEEL CATHEPSIN B-LIKE CYSTEINE PROTEINASE 3 PREC... 73 7e-13
sp|P13823|SERA_PLAFG SERINE-REPEAT ANTIGEN PROTEIN PRECURSOR (P1... 70 6e-12
sp|P32956|CC3_CARCN CYSTEINE PROTEINASE III (CC-III) 61 4e-09
sp|P32957|CC4_CARCN CYSTEINE PROTEINASE IV (CC-IV) 60 9e-09
sp|Q06544|CYS3_OSTOS CATHEPSIN B-LIKE CYSTEINE PROTEINASE 3 59 1e-08
sp|P32954|CC1_CARCN CYSTEINE PROTEINASE I (CC-I) 58 3e-08
sp|P32955|CC2_CARCN CYSTEINE PROTEINASE II (CC-II) 56 1e-07
sp||CATL_CHICK_2 [Segment 2 of 2] CATHEPSIN L 52 2e-06
sp|P12399|CT2A_MOUSE CTLA-2-ALPHA PROTEIN PRECURSOR 42 0.002
sp|P05689|CATX_BOVIN CATHEPSIN 40 0.006
sp|P12400|CT2B_MOUSE CTLA-2-BETA PROTEIN PRECURSOR 39 0.019
sp|P23897|HSER_RAT HEAT-STABLE ENTEROTOXIN RECEPTOR PRECURSOR (G... 36 0.16
sp|P20736|BM86_BOOMI GLYCOPROTEIN ANTIGEN BM86 PRECURSOR (PROTEC... 35 0.22
sp|P46992|YJR1_YEAST HYPOTHETICAL 43.0 KD PROTEIN IN CPS1-FPP1 I... 32 1.9
sp|P28493|PR5_ARATH PATHOGENESIS-RELATED PROTEIN 5 PRECURSOR (PR-5) 32 1.9
sp|P54634|POLN_LORDV NON-STRUCTURAL POLYPROTEIN [CONTAINS: RNA-D... 31 3.2
sp|Q02521|SPP2_YEAST SPLICEOSOME MATURATION PROTEIN SPP2 31 4.2
sp|P41901|SPR3_YEAST SPORULATION-SPECIFIC SEPTIN 31 4.2
sp|Q01532|BLH1_YEAST CYSTEINE PROTEINASE 1 (Y3) (BLEOMYCIN HYDRO... 30 5.5
sp|P24896|NU5M_CAEEL NADH-UBIQUINONE OXIDOREDUCTASE CHAIN 5 30 5.5
sp|P25648|SRB8_YEAST SUPPRESSOR OF RNA POLYMERASE B SRB8 30 7.2
sp|Q04723|PEPC_LACLC AMINOPEPTIDASE C 30 7.2
sp|Q13867|BLMH_HUMAN BLEOMYCIN HYDROLASE (BLM HYDROLASE) (BMH) 30 9.4
sp|P87362|BLMH_CHICK BLEOMYCIN HYDROLASE (BLM HYDROLASE) (BMH) (... 30 9.4
sp|P70645|BLMH_RAT BLEOMYCIN HYDROLASE (BLM HYDROLASE) (BMH) 30 9.4
>sp|P04988|CYS1_DICDI CYSTEINE PROTEINASE 1 PRECURSOR
Length = 343
Score = 709 bits (1811), Expect = 0.0
Identities = 343/351 (97%), Positives = 343/351 (97%), Gaps = 8/351 (2%)
Query: 1 MKVILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIE 60
MKVILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIE
Sbjct: 1 MKVILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIE 60
Query: 61 ELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPP 120
ELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIP
Sbjct: 61 ELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIP- 119
Query: 121 EEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHE 180
TAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHE
Sbjct: 120 ---TAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHE 176
Query: 181 CMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQ 240
pattern 237 ****
CMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIG
Sbjct: 177 CMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIG---- 232
Query: 241 AKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVG 300
AKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVG
Sbjct: 233 AKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVG 292
Query: 301 YSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII 351
YSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII
Sbjct: 293 YSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII 343
>sp|P43295|A494_ARATH PROBABLE CYSTEINE PROTEINASE A494 PRECURSOR
Length = 313
Score = 273 bits (691), Expect = 4e-73
Identities = 149/324 (45%), Positives = 194/324 (58%), Gaps = 26/324 (8%)
Query: 32 FQDKFNKKY-SHEEYLERFEIFKSNLGKIEELNLIAINHKA---DTKFGVNKFADLSSDE 87
F+ KF K Y S EE+ RF +FK+NL L A+ H+ + GV +F+DL+ E
Sbjct: 3 FKKKFGKVYGSIEEHYYRFSVFKANL-------LRAMRHQKMDPSARHGVTQFSDLTRSE 55
Query: 88 FKNYYLNNKEAI-FTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSC 146
F+ +L K D A L + + PEE FDWR RGAVTPVKNQG CGSC
Sbjct: 56 FRRKHLGVKGGFKLPKDANQAPILPTQNL----PEE---FDWRDRGAVTPVKNQGSCGSC 108
Query: 147 WSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYI 206
WSFSTTG +EG HF++ KLVSLSEQ LVDCDHEC + E E +CD GCNGGL +A+ Y
Sbjct: 109 WSFSTTGALEGAHFLATGKLVSLSEQQLVDCDHEC-DPEEEGSCDSGCNGGLMNSAFEYT 167
Query: 207 IKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPL 266
pattern 237 ****
+K GG+ E YPYT G C + + I A +SNF+++ NE +A ++ GPL
Sbjct: 168 LKTGGLMREKDYPYTGTDGGSCKLDRSKI----VASVSNFSVVSINEDQIAANLIKNGPL 223
Query: 267 AIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSAK--NTIFRKNMPYWIVKNSWGAD 324
A+A +A Q YIGGV L+HG+L+VGY + + K PYWI+KNSWG
Sbjct: 224 AVAINAAYMQTYIGGVSCPYICSRRLNHGVLLVGYGSAGFSQARLKEKPYWIIKNSWGES 283
Query: 325 WGEQGYIYLRRGKNTCGVSNFVST 348
WGE G+ + +G+N CGV + VST
Sbjct: 284 WGENGFYKICKGRNICGVDSLVST 307
>sp|P25804|CYSP_PEA CYSTEINE PROTEINASE 15A PRECURSOR (TURGOR-RESPONSIVE PROTEIN 15A)
Length = 363
Score = 270 bits (684), Expect = 2e-72
Identities = 144/327 (44%), Positives = 201/327 (61%), Gaps = 20/327 (6%)
Query: 26 QSQFLEFQDKFNKKYS-HEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLS 84
+ F F+ KF+K Y+ EE+ RF +FKSNL K + + N + G+ KF+DL+
Sbjct: 45 EHHFTSFKSKFSKSYATKEEHDYRFGVFKSNLIKAK----LHQNRDPTAEHGITKFSDLT 100
Query: 85 SDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCG 144
+ EF+ +L K+ + LP + PE+ FDWR +GAVTPVK+QG CG
Sbjct: 101 ASEFRRQFLGLKKRL---RLPAHAQKAPILPTTNLPED---FDWREKGAVTPVKDQGSCG 154
Query: 145 SCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYN 204
SCW+FSTTG +EG H+++ KLVSLSEQ LVDCDH C + E +CD GCNGGL NA+
Sbjct: 155 SCWAFSTTGALEGAHYLATGKLVSLSEQQLVDCDHVC-DPEQAGSCDSGCNGGLMNNAFE 213
Query: 205 YIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTG 264
pattern 237 ****
Y++++GG+ E Y YT G+ C F+ + + A +SNF+++ +E +A +V G
Sbjct: 214 YLLESGGVVQEKDYAYTGRDGS-CKFDKSKV----VASVSNFSVVTLDEDQIAANLVKNG 268
Query: 265 PLAIAADAVEWQFYIGGV-FDIPCNPNSLDHGILIVGY--SAKNTIFRKNMPYWIVKNSW 321
PLA+A +A Q Y+ GV C + LDHG+L+VG+ A I K PYWI+KNSW
Sbjct: 269 PLAVAINAAWMQTYMSGVSCPYVCAKSRLDHGVLLVGFGKGAYAPIRLKEKPYWIIKNSW 328
Query: 322 GADWGEQGYIYLRRGKNTCGVSNFVST 348
G +WGEQGY + RG+N CGV + VST
Sbjct: 329 GQNWGEQGYYKICRGRNVCGVDSMVST 355
>sp|P43296|RD19_ARATH CYSTEINE PROTEINASE RD19A PRECURSOR
Length = 368
Score = 266 bits (672), Expect = 6e-71
Identities = 156/367 (42%), Positives = 206/367 (55%), Gaps = 42/367 (11%)
Query: 6 LFVLAVFTVFVSSR---------------GIPPE---EQSQFLEFQDKFNKKY-SHEEYL 46
+FVL+ F V VSS G P+ + F F+ KF K Y S+EE+
Sbjct: 10 VFVLSFFIVSVSSSDVNDGDDLVIRQVVGGAEPQVLTSEDHFSLFKRKFGKVYASNEEHD 69
Query: 47 ERFEIFKSNLGKIEELNLIAINHKADTK--FGVNKFADLSSDEFKNYYLNNKEAI-FTDD 103
RF +FK+NL + + K D GV +F+DL+ EF+ +L + D
Sbjct: 70 YRFSVFKANLRRARR------HQKLDPSATHGVTQFSDLTRSEFRKKHLGVRSGFKLPKD 123
Query: 104 LPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQ 163
A L E + PE+ FDWR GAVTPVKNQG CGSCWSFS TG +EG +F++
Sbjct: 124 ANKAPILPTENL----PED---FDWRDHGAVTPVKNQGSCGSCWSFSATGALEGANFLAT 176
Query: 164 NKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAE 223
KLVSLSEQ LVDCDHEC + E ++CD GCNGGL +A+ Y +K GG+ E YPYT +
Sbjct: 177 GKLVSLSEQQLVDCDHEC-DPEEADSCDSGCNGGLMNSAFEYTLKTGGLMKEEDYPYTGK 235
Query: 224 TGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVF 283
pattern 237 ****
G C + + I A +SNF++I +E +A +V GPLA+A +A Q YIGGV
Sbjct: 236 DGKTCKLDKSKI----VASVSNFSVISIDEEQIAANLVKNGPLAVAINAGYMQTYIGGVS 291
Query: 284 DIPCNPNSLDHGILIVGYSAKN--TIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCG 341
L+HG+L+VGY A K PYWI+KNSWG WGE G+ + +G+N CG
Sbjct: 292 CPYICTRRLNHGVLLVGYGAAGYAPARFKEKPYWIIKNSWGETWGENGFYKICKGRNICG 351
Query: 342 VSNFVST 348
V + VST
Sbjct: 352 VDSMVST 358
>sp|Q10716|CYS1_MAIZE CYSTEINE PROTEINASE 1 PRECURSOR
Length = 371
Score = 252 bits (638), Expect = 6e-67
Identities = 138/332 (41%), Positives = 190/332 (56%), Gaps = 23/332 (6%)
Query: 26 QSQFLEFQDKFNKKYSH-EEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLS 84
+S FL F +F K Y +E+ R +FK NL + L+ + GV KF+DL+
Sbjct: 45 ESHFLSFVQRFGKSYKDADEHAYRLSVFKDNLRRARRHQLL----DPSAEHGVTKFSDLT 100
Query: 85 SDEFKNYYLN---NKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQG 141
EF+ YL ++ A+ + A + +P + FDWR GAV PVKNQG
Sbjct: 101 PAEFRRTYLGLRKSRRALLRELGESAHEAPVLPTDGLPDD----FDWRDHGAVGPVKNQG 156
Query: 142 QCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPN 201
CGSCWSFS +G +EG H+++ KL LSEQ VDCDHEC E ++CD GCNGGL
Sbjct: 157 SCGSCWSFSASGALEGAHYLATGKLEVLSEQQFVDCDHECDSSE-PDSCDSGCNGGLMTT 215
Query: 202 AYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIV 261
pattern 237 ****
A++Y+ K GG+++E YPYT G +C F+ + I A + NF+++ +E ++ ++
Sbjct: 216 AFSYLQKAGGLESEKDYPYTGSDG-KCKFDKSKI----VASVQNFSVVSVDEAQISANLI 270
Query: 262 STGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKN--TIFRKNMPYWIVKN 319
GPLAI +A Q YIGGV LDHG+L+VGY A I K+ PYWI+KN
Sbjct: 271 KHGPLAIGINAAYMQTYIGGVSCPYICGRHLDHGVLLVGYGASGFAPIRLKDKPYWIIKN 330
Query: 320 SWGADWGEQGYIYLRRG---KNTCGVSNFVST 348
SWG +WGE GY + RG +N CGV + VST
Sbjct: 331 SWGENWGENGYYKICRGSNVRNKCGVDSMVST 362
>sp|P04989|CYS2_DICDI CYSTEINE PROTEINASE 2 PRECURSOR (PRESTALK CATHEPSIN)
Length = 376
Score = 250 bits (633), Expect = 2e-66
Identities = 147/391 (37%), Positives = 213/391 (53%), Gaps = 63/391 (16%)
Query: 1 MKVILLFVLAVFTVFVSSRGIP-------PEEQSQFLEFQDKFNKKYSHEEYLERFEIFK 53
M++++ +L +F F + P + ++ F E+ KFN++YS E+ R+ IFK
Sbjct: 1 MRLLVFLILLIFVNFSFANVRPNGRRFSESQYRTAFTEWTLKFNRQYSSSEFSNRYSIFK 60
Query: 54 SNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNK-EAIFTDDLPVADYLDD 112
SN+ ++ N + T G+N FAD++++E++ YL + A + + L+
Sbjct: 61 SNMDYVDNWNS---KGDSQTVLGLNNFADITNEEYRKTYLGTRVNAHSYNGYDGREVLNV 117
Query: 113 EFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQ 172
E + + P + DWRT+ AVTP+K+QGQCGSCWSFSTTG+ EG H + KLVSLSEQ
Sbjct: 118 EDLQTNPK----SIDWRTKNAVTPIKDQGQCGSCWSFSTTGSTEGAHALKTKKLVSLSEQ 173
Query: 173 NLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNS 232
NLVDC G E + GC+GGL NA++YIIKN GI TESSYPYTAETG+ C FN
Sbjct: 174 NLVDC-------SGPEE-NFGCDGGLMNNAFDYIIKNKGIDTESSYPYTAETGSTCLFNK 225
Query: 233 ANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADAV--EWQFYIGGVFDIP-CNP 289
pattern 237 ****
++IG A I + I + GP+++A DA +Q Y G++ P C+P
Sbjct: 226 SDIG----ATIKGYVNITAGSEISLENGAQHGPVSVAIDASHNSFQLYTSGIYYEPKCSP 281
Query: 290 NSLDHGILIVGY--------------------------------SAKNTIFRKNMPYWIV 317
LDHG+L+VGY + +++ K YWIV
Sbjct: 282 TELDHGVLVVGYGVQGKDDEGPVLNRKQTIVIHKNEDNKVESSDDSSDSVRPKANNYWIV 341
Query: 318 KNSWGADWGEQGYIYLRRG-KNTCGVSNFVS 347
KNSWG WG +GYI + + KN CG+++ S
Sbjct: 342 KNSWGTSWGIKGYILMSKDRKNNCGIASVSS 372
>sp|P54640|CYS5_DICDI CYSTEINE PROTEINASE 5 PRECURSOR
Length = 344
Score = 238 bits (601), Expect = 1e-62
Identities = 139/370 (37%), Positives = 201/370 (53%), Gaps = 45/370 (12%)
Query: 1 MKVI-LLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKI 59
MKV+ L VL V + + ++ F ++ K Y+ EE+ R+ IF +N+ +
Sbjct: 1 MKVLSFLCVLLVSVATAKQQFSELQYRNAFTDWMITHQKSYTSEEFGARYNIFTANMDYV 60
Query: 60 EELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIP 119
++ N + ++T G+N FAD++++E++N YL K F + + NS
Sbjct: 61 QQWN----SKGSETVLGLNNFADITNEEYRNTYLGTK---FDASSLIGTQEEKVHTNSSA 113
Query: 120 PEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDH 179
+ DWR+ GAVTPVKNQGQCG CWSFSTTG+ EG HF S+ +LVSLSEQNL+DC
Sbjct: 114 ASK----DWRSEGAVTPVKNQGQCGGCWSFSTTGSTEGAHFQSKGELVSLSEQNLIDCST 169
Query: 180 ECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEE 239
pattern 237 ***
E + GC+GGL A+ YII N GI TESSYPY AE G +C + S N G
Sbjct: 170 E----------NSGCDGGLMTYAFEYIINNNGIDTESSYPYKAENG-KCEYKSENSG--- 215
Query: 240 QAKISNFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGVFDIP-CNPNSLDHGI 296
pattern 240 *
A +S++ + V+ P+++A DA +Q Y G++ P C+ +LDHG+
Sbjct: 216 -ATLSSYKTVTAGSESSLESAVNVNPVSVAIDASHQSFQLYTSGIYYEPECSSENLDHGV 274
Query: 297 LIVGY--------------SAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGK-NTCG 341
L VGY S+ N + YWIVKNSWG WG +GYI + R + N CG
Sbjct: 275 LAVGYGSGSGSSSGQSSGQSSGNLSASSSNEYWIVKNSWGTSWGIEGYILMSRNRDNNCG 334
Query: 342 VSNFVSTSII 351
+++ S ++
Sbjct: 335 IASSASFPVV 344
>sp|P14658|CYSP_TRYBB CYSTEINE PROTEINASE PRECURSOR
Length = 450
Score = 236 bits (597), Expect = 4e-62
Identities = 137/354 (38%), Positives = 193/354 (53%), Gaps = 34/354 (9%)
Query: 3 VILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSH-EEYLERFEIFKSNLGKIEE 61
V+L + +V + S + + +F F+ K+ K Y +E RF F+ N+ E+
Sbjct: 15 VLLAMAACLASVALGSLHVEESLEMRFAAFKKKYGKVYKDAKEEAFRFRAFEENM---EQ 71
Query: 62 LNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPE 121
+ A + T FGV F+D++ +EF+ Y N A + +N
Sbjct: 72 AKIQAAANPYAT-FGVTPFSDMTREEFRARYRNGASYF-----AAAQKRLRKTVNVTTGR 125
Query: 122 EQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHEC 181
A DWR +GAVTPVK QGQCGSCW+FST GN+EGQ ++ N LVSLSEQ LV CD
Sbjct: 126 APAAVDWREKGAVTPVKVQGQCGSCWAFSTIGNIEGQWQVAGNPLVSLSEQMLVSCD--- 182
Query: 182 MEYEGEEACDEGCNGGLQPNAYNYIIKN--GGIQTESSYPYTAETG--TQCNFNSANIGP 237
pattern 237 *
D GCNGGL NA+N+I+ + G + TE+SYPY + G QC N IG
Sbjct: 183 -------TIDSGCNGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNGEQPQCQMNGHEIG- 234
Query: 238 EEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGIL 297
pattern 238 ***
A I++ +P++E +A Y+ GPLAIA DA + Y GG+ C LDHG+L
Sbjct: 235 ---AAITDHVDLPQDEDAIAAYLAENGPLAIAVDAESFMDYNGGIL-TSCTSKQLDHGVL 290
Query: 298 IVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII 351
+VGY+ + N PYWI+KNSW WGE GYI + +G N C ++ VS++++
Sbjct: 291 LVGYNDNS-----NPPYWIIKNSWSNMWGEDGYIRIEKGTNQCLMNQAVSSAVV 339
>sp|Q26534|CATL_SCHMA CATHEPSIN L PRECURSOR (SMCL1)
Length = 319
Score = 233 bits (589), Expect = 3e-61
Identities = 128/334 (38%), Positives = 190/334 (56%), Gaps = 30/334 (8%)
Query: 21 IPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKF 80
+P ++++F+ K+ K+Y E RF IFKSN+ K + L + + +GV +
Sbjct: 12 LPGNVDEKYVQFKLKYRKQYHETEDEIRFNIFKSNILKAQ---LYQVFVRGSAIYGVTPY 68
Query: 81 ADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQ 140
+DL++DEF +L + + L E +N+IP FDWR +GAVT VKNQ
Sbjct: 69 SDLTTDEFARTHLTASWVVPSSRSNTPTSLGKE-VNNIPKN----FDWREKGAVTEVKNQ 123
Query: 141 GQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQP 200
G CGSCW+FSTTGNVE Q F KL+SLSEQ LVDCD D+GCNGGL
Sbjct: 124 GMCGSCWAFSTTGNVESQWFRKTGKLLSLSEQQLVDCD----------GLDDGCNGGLPS 173
Query: 201 NAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYI 260
pattern 237 ****
NAY IIK GG+ E +YPY A+ +C+ + + I++ + ++ET +A ++
Sbjct: 174 NAYESIIKMGGLMLEDNYPYDAK-NEKCHLKTDGVA----VYINSSVNLTQDETELAAWL 228
Query: 261 VSTGPLAIAADAVEWQFYIGGV---FDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIV 317
+++ +A+ QFY G+ + I C+ LDH +L+VGY + KN P+WIV
Sbjct: 229 YHNSTISVGMNALLLQFYQHGISHPWWIFCSKYLLDHAVLLVGYG----VSEKNEPFWIV 284
Query: 318 KNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII 351
KNSWG +WGE GY + RG +CG++ ++++I
Sbjct: 285 KNSWGVEWGENGYFRMYRGDGSCGINTVATSAMI 318
>sp|P35591|CYS1_LEIPI CYSTEINE PROTEINASE 1 PRECURSOR (AMASTIGOTE CYSTEINE PROTEINASE A-1)
Length = 354
Score = 233 bits (589), Expect = 3e-61
Identities = 144/355 (40%), Positives = 192/355 (53%), Gaps = 40/355 (11%)
Query: 5 LLFVLAVFTVFVSSRGI-------PPEEQ----SQFLEFQDKFNKKYSHE-EYLERFEIF 52
LLF + V +FV G PP + + + F+ + K + + E RF F
Sbjct: 7 LLFAIVVTILFVVCYGSALIAQTPPPVDNFVASAHYGSFKKRHGKAFGGDAEEGHRFNAF 66
Query: 53 KSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDD 112
K N+ LN + D KFADL+ EF YLN + D+ +D
Sbjct: 67 KQNMQTAYFLNTQNPHAHYDVS---GKFADLTPQEFAKLYLNPDYYA----RHLKDHKED 119
Query: 113 EFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQ 172
++ P + DWR +GAVTPVKNQG CGSCW+FS GN+EGQ S + LVSLSEQ
Sbjct: 120 VHVDDSAPSGVMSVDWRDKGAVTPVKNQGLCGSCWAFSAIGNIEGQWAASGHSLVSLSEQ 179
Query: 173 NLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIK--NGGIQTESSYPYTAETGTQCNF 230
LV CD+ DEGCNGGL A N+I++ NG + TE+SYPYT+ GT+
Sbjct: 180 MLVSCDN----------IDEGCNGGLMDQAMNWIMQSHNGSVFTEASYPYTSGGGTRPPC 229
Query: 231 NSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPN 290
pattern 237 ****
+ E AKI+ F +P +E +A ++ GP+A+A DA WQ Y GGV + C
Sbjct: 230 HDEG---EVGAKITGFLSLPHDEERIAEWVEKRGPVAVAVDATTWQLYFGGVVSL-CLAW 285
Query: 291 SLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNF 345
SL+HG+LIVG++ KN PYWIVKNSWG+ WGE+GYI L G N C + N+
Sbjct: 286 SLNHGVLIVGFN-KNA----KPPYWIVKNSWGSSWGEKGYIRLAMGSNQCMLKNY 335
>sp|P25775|LCPA_LEIME CYSTEINE PROTEINASE A PRECURSOR
Length = 354
Score = 231 bits (584), Expect = 1e-60
Identities = 143/355 (40%), Positives = 192/355 (53%), Gaps = 40/355 (11%)
Query: 5 LLFVLAVFTVFVSSRGI-------PPEEQ----SQFLEFQDKFNKKYSHE-EYLERFEIF 52
LLF + V +FV G PP + + + F+ + K + + E RF F
Sbjct: 7 LLFAIVVTILFVVCYGSALIAQTPPPVDNFVASAHYGSFKKRHGKAFGGDAEEGHRFNAF 66
Query: 53 KSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDD 112
K N+ LN + D KFADL+ EF YLN + ++ +D
Sbjct: 67 KQNMQTAYFLNTQNPHAHYDVS---GKFADLTPQEFAKLYLNPDYYA----RHLKNHKED 119
Query: 113 EFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQ 172
++ P + DWR +GAVTPVKNQG CGSCW+FS GN+EGQ S + LVSLSEQ
Sbjct: 120 VHVDDSAPSGVMSVDWRDKGAVTPVKNQGLCGSCWAFSAIGNIEGQWAASGHSLVSLSEQ 179
Query: 173 NLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIK--NGGIQTESSYPYTAETGTQCNF 230
LV CD+ DEGCNGGL A N+I++ NG + TE+SYPYT+ GT+
Sbjct: 180 MLVSCDN----------IDEGCNGGLMDQAMNWIMQSHNGSVFTEASYPYTSGGGTRPPC 229
Query: 231 NSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPN 290
pattern 237 ****
+ E AKI+ F +P +E +A ++ GP+A+A DA WQ Y GGV + C
Sbjct: 230 HDEG---EVGAKITGFLSLPHDEERIAEWVEKRGPVAVAVDATTWQLYFGGVVSL-CLAW 285
Query: 291 SLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNF 345
SL+HG+LIVG++ KN PYWIVKNSWG+ WGE+GYI L G N C + N+
Sbjct: 286 SLNHGVLIVGFN-KNA----KPPYWIVKNSWGSSWGEKGYIRLAMGSNQCMLKNY 335
>sp|P13277|CYS1_HOMAM DIGESTIVE CYSTEINE PROTEINASE 1 PRECURSOR
Length = 322
Score = 221 bits (558), Expect = 1e-57
Identities = 132/349 (37%), Positives = 184/349 (51%), Gaps = 41/349 (11%)
Query: 1 MKVILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSH-EEYLERFEIFKSNLGKI 59
MKV+ LF+ + + + EF+ KF +KY EE R +F NL I
Sbjct: 1 MKVVALFLFGLALAAANP---------SWEEFKGKFGRKYVDLEEERYRLNVFLDNLQYI 51
Query: 60 EELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIP 119
EE N + +N+F+D+++++F K+ P A F ++
Sbjct: 52 EEFNKKYERGEVTYNLAINQFSDMTNEKFNAVMKGYKKG----PRPAA-----VFTSTDA 102
Query: 120 PEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDH 179
E T DWRT+GAVTPVK+QGQCGSCW+FSTTG +EGQHF+ +LVSLSEQ LVDC
Sbjct: 103 APESTEVDWRTKGAVTPVKDQGQCGSCWAFSTTGGIEGQHFLKTGRLVSLSEQQLVDC-- 160
Query: 180 ECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEE 239
pattern 237 ***
G ++GCNGG A Y+ NGG+ TESSYPY A T C FNS IG
Sbjct: 161 -----AGGSYYNQGCNGGWVERAIMYVRDNGGVDTESSYPYEARDNT-CRFNSNTIG--- 211
Query: 240 QAKISNFTMIPK-NETVMAGYIVSTGPLAIAADAVEWQF---YIGGVFDIPCNPNSLDHG 295
pattern 240 *
A + + I + +E+ + GP+++A DA F Y G ++ C+ + LDH
Sbjct: 212 -ATCTGYVGIAQGSESALKTATRDIGPISVAIDASHRSFQSYYTGVYYEPSCSSSQLDHA 270
Query: 296 ILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGK-NTCGVS 343
+L VGY ++ +W+VKNSW WGE GYI + R + N CG++
Sbjct: 271 VLAVGYGSEG-----GQDFWLVKNSWATSWGESGYIKMARNRNNNCGIA 314
>sp|P25779|CYSP_TRYCR CRUZIPAIN PRECURSOR (MAJOR CYSTEINE PROTEINASE) (CRUZAINE)
Length = 467
Score = 221 bits (557), Expect = 2e-57
Identities = 134/358 (37%), Positives = 189/358 (52%), Gaps = 38/358 (10%)
Query: 3 VILLFVLAVFTVFV--SSRGIPPEEQ--SQFLEFQDKFNKKY-SHEEYLERFEIFKSNLG 57
++L VL V V ++ + EE SQF EF+ K + Y S E R +F+ NL
Sbjct: 8 LLLAAVLVVMACLVPAATASLHAEETLTSQFAEFKQKHGRVYESAAEEAFRLSVFRENLF 67
Query: 58 KIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINS 117
+ L+ A H FGV F+DL+ +EF++ Y N + E + +
Sbjct: 68 -LARLHAAANPHAT---FGVTPFSDLTREEFRSRYHNGAAHFAAAQERARVPVKVEVVGA 123
Query: 118 IPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDC 177
A DWR RGAVT VK+QGQCGSCW+FS GNVE Q F++ + L +LSEQ LV C
Sbjct: 124 -----PAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNVECQWFLAGHPLTNLSEQMLVSC 178
Query: 178 DHECMEYEGEEACDEGCNGGLQPNAYNYIIK--NGGIQTESSYPYTAETGTQ--CNFNSA 233
D D GC+GGL NA+ +I++ NG + TE SYPY + G C +
Sbjct: 179 D----------KTDSGCSGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSGH 228
Query: 234 NIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLD 293
pattern 237 ****
+G A I+ +P++E +A ++ GP+A+A DA W Y GGV C LD
Sbjct: 229 TVG----ATITGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGVM-TSCVSEQLD 283
Query: 294 HGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII 351
HG+L+VGY+ + PYWI+KNSW WGE+GYI + +G N C V S++++
Sbjct: 284 HGVLLVGYNDSAAV-----PYWIIKNSWTTQWGEEGYIRIAKGSNQCLVKEEASSAVV 336
>sp|P41721|CATV_NPVBM VIRAL CATHEPSIN (V-CATH)
Length = 323
Score = 216 bits (545), Expect = 5e-56
Identities = 131/349 (37%), Positives = 181/349 (51%), Gaps = 32/349 (9%)
Query: 5 LLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHE-EYLERFEIFKSNLGKIEELN 63
+LF L V+ V S+ P + + F EF +FNK YS E E L RF+IF+ NL +I
Sbjct: 4 ILFYLFVYAVVKSAAYDPLKAPNYFEEFVHRFNKNYSSEVEKLRRFKIFQHNLNEI---- 59
Query: 64 LIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEEQ 123
I N K+ +NKF+DLS DE Y T + LD P +
Sbjct: 60 -INKNQNDSAKYEINKFSDLSKDETIAKYTGLSLPTQTQNFCKVILLDQP-----PGKGP 113
Query: 124 TAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECME 183
FDWR VT VKNQG CG+CW+F+T G++E Q I N+L++LSEQ ++DCD
Sbjct: 114 LEFDWRRLNKVTSVKNQGMCGACWAFATLGSLESQFAIKHNELINLSEQQMIDCDF---- 169
Query: 184 YEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKI 243
pattern 237 ****
D GCNGGL A+ IIK GG+Q ES YPY A+ C NS + +
Sbjct: 170 ------VDAGCNGGLLHTAFEAIIKMGGVQLESDYPYEAD-NNNCRMNSNKFLVQVK--- 219
Query: 244 SNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSA 303
+ I E + + GP+ +A DA + Y G+ C + L+H +L+VGY
Sbjct: 220 DCYRYIIVYEEKLKDLLPLVGPIPMAIDAADIVNYKQGIIKY-CFDSGLNHAVLLVGYGV 278
Query: 304 KNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSN-FVSTSII 351
+N N+PYW KN+WG DWGE G+ +++ N CG+ N ST++I
Sbjct: 279 EN-----NIPYWTFKNTWGTDWGEDGFFRVQQNINACGMRNELASTAVI 322
>sp|P25782|CYS2_HOMAM DIGESTIVE CYSTEINE PROTEINASE 2 PRECURSOR
Length = 323
Score = 215 bits (541), Expect = 1e-55
Identities = 132/357 (36%), Positives = 189/357 (51%), Gaps = 40/357 (11%)
Query: 1 MKVILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKY-SHEEYLERFEIFKSNLGKI 59
MKV +LF+ V S + F+ K+ ++Y EE R IF+ N I
Sbjct: 1 MKVAVLFLCGVALAAASP---------SWEHFKGKYGRQYVDAEEDSYRRVIFEQNQKYI 51
Query: 60 EELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIP 119
EE N N + +NKF D++ +EF N I PV+ + +
Sbjct: 52 EEFNKKYENGEVTFNLAMNKFGDMTLEEFNAVMKGN---IPRRSAPVSVFYPKKETGP-- 106
Query: 120 PEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDH 179
+ T DWRT+GAVTPVK+QGQCGSCW+FSTTG++EGQHF+ L+SL+EQ LVDC
Sbjct: 107 --QATEVDWRTKGAVTPVKDQGQCGSCWAFSTTGSLEGQHFLKTGSLISLAEQQLVDC-- 162
Query: 180 ECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEE 239
pattern 237 ***
+GCNGG +A++YI N GI TE++YPY A G+ C F+S ++
Sbjct: 163 ------SRPYGPQGCNGGWMNDAFDYIKANNGIDTEAAYPYEARDGS-CRFDSNSVA--- 212
Query: 240 QAKISNFTMIPK-NETVMAGYIVSTGPLAIAADAV--EWQFYIGGVFDIP-CNPNSLDHG 295
pattern 240 *
A S T I +ET + + GP+++ DA +QFY GV+ P C+P+ LDH
Sbjct: 213 -ATCSGHTNIASGSETGLQQAVRDIGPISVTIDAAHSSFQFYSSGVYYEPSCSPSYLDHA 271
Query: 296 ILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGK-NTCGVSNFVSTSII 351
+L VGY ++ +W+VKNSW WG+ GYI + R + N CG++ S ++
Sbjct: 272 VLAVGYGSEG-----GQDFWLVKNSWATSWGDAGYIKMSRNRNNNCGIATVASYPLV 323
>sp|P41715|CATV_NPVCF VIRAL CATHEPSIN (V-CATH)
Length = 324
Score = 214 bits (540), Expect = 2e-55
Identities = 130/351 (37%), Positives = 188/351 (53%), Gaps = 33/351 (9%)
Query: 1 MKVILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHE-EYLERFEIFKSNLGKI 59
M I+L++L V ++ + + + F +F KFNK YS E E L RF+IF+ NL +I
Sbjct: 1 MNKIVLYLLVYGAVQCAAYDVL-KAPNYFEDFLHKFNKSYSSESEKLRRFQIFRHNLEEI 59
Query: 60 EELNLIAINHKADT-KFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSI 118
I NH T ++ +NKFADLS DE + Y + T + LD
Sbjct: 60 -----INKNHNDSTAQYEINKFADLSKDETISKYTGLSLPLQTQNFCEVVVLDRP----- 109
Query: 119 PPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCD 178
P + FDWR VT VKNQG CG+CW+F+T G++E Q I N+ ++LSEQ L+DCD
Sbjct: 110 PDKGPLEFDWRRLNKVTSVKNQGMCGACWAFATLGSLESQFAIKHNQFINLSEQQLIDCD 169
Query: 179 HECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPE 238
pattern 237 **
D GC+GGL A+ ++ GGIQ ES YPY A G C N+A +
Sbjct: 170 F----------VDAGCDGGLLHTAFEAVMNMGGIQAESDYPYEANNG-DCRANAAKFVVK 218
Query: 239 EQAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILI 298
pattern 239 **
+ T+ E + + S GP+ +A DA + Y G+ C + L+H +L+
Sbjct: 219 VKKCYRYITVF---EEKLKDLLRSVGPIPVAIDASDIVNYKRGIMKY-CANHGLNHAVLL 274
Query: 299 VGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTS 349
VGY+ +N +P+WI+KN+WGADWGEQGY +++ N CG+ N + +S
Sbjct: 275 VGYAVEN-----GVPFWILKNTWGADWGEQGYFRVQQNINACGIQNELPSS 320
>sp|P25784|CYS3_HOMAM DIGESTIVE CYSTEINE PROTEINASE 3 PRECURSOR
Length = 321
Score = 214 bits (539), Expect = 2e-55
Identities = 125/326 (38%), Positives = 184/326 (56%), Gaps = 47/326 (14%)
Query: 32 FQDKFNKKYSH-EEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEF-- 88
F+ ++ +KY +E L R +F+ N IE+ N N + K +N+F D++++EF
Sbjct: 23 FKTQYGRKYGDAKEELYRQRVFQQNEQLIEDFNKKFENGEVTFKVAMNQFGDMTNEEFNA 82
Query: 89 --KNYYLNNK---EAIFTDDL-PVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQ 142
K Y ++ +A+FT + P+A DWRT+ VTPVK+Q Q
Sbjct: 83 VMKGYKKGSRGEPKAVFTAEAGPMA----------------ADVDWRTKALVTPVKDQEQ 126
Query: 143 CGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNA 202
CGSCW+FS TG +EGQHF+ ++LVSLSEQ LVDC + ++GC GG +A
Sbjct: 127 CGSCWAFSATGALEGQHFLKNDELVSLSEQQLVDC--------STDYGNDGCGGGWMTSA 178
Query: 203 YNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVS 262
pattern 237 ****
++YI NGGI TESSYPY AE C F++ +IG A + + E + +
Sbjct: 179 FDYIKDNGGIDTESSYPYEAE-DRSCRFDANSIG----AICTGSVEVQHTEEALQEAVSG 233
Query: 263 TGPLAIAADA--VEWQFYIGGV-FDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKN 319
GP+++A DA +QFY GV ++ C+P LDHG+L VGY ++T YW+VKN
Sbjct: 234 VGPISVAIDASHFSFQFYSSGVYYEQNCSPTFLDHGVLAVGYGTEST-----KDYWLVKN 288
Query: 320 SWGADWGEQGYIYLRRGK-NTCGVSN 344
SWG+ WG+ GYI + R + N CG+++
Sbjct: 289 SWGSSWGDAGYIKMSRNRDNNCGIAS 314
>sp|P07154|CATL_RAT CATHEPSIN L PRECURSOR (MAJOR EXCRETED PROTEIN) (MEP) (CYCLIC
PROTEIN-2) (CP-2)
Length = 334
Score = 212 bits (535), Expect = 7e-55
Identities = 127/359 (35%), Positives = 195/359 (53%), Gaps = 39/359 (10%)
Query: 3 VILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEEL 62
++LL VL + T + + +Q+ +++ + Y E R +++ N+ I+
Sbjct: 4 LLLLAVLCLGTALATPK-FDQTFNAQWHQWKSTHRRLYGTNEEEWRRAVWEKNMRMIQLH 62
Query: 63 NLIAINHKADTKFGVNKFADLSSDEFKN------YYLNNKEAIFTDDLPVADYLDDEFIN 116
N N K +N F D++++EF+ + + K +F + L +
Sbjct: 63 NGEYSNGKHGFTMEMNAFGDMTNEEFRQIVNGYRHQKHKKGRLFQEPLML---------- 112
Query: 117 SIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVD 176
IP DWR +G VTPVKNQGQCGSCW+FS +G +EGQ F+ KL+SLSEQNLVD
Sbjct: 113 QIPK----TVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVD 168
Query: 177 CDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIG 236
C H+ +G ++GCNGGL A+ YI +NGG+ +E SYPY A+ G+ C + +
Sbjct: 169 CSHD----QG----NQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGS-CKYRA---- 215
Query: 237 PEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGVFDIP-CNPNSLD 293
pattern 237 ****
A + F IP+ E + + + GP+++A DA QFY G++ P C+ LD
Sbjct: 216 EYAVANDTGFVDIPQQEKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSKDLD 275
Query: 294 HGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNT-CGVSNFVSTSII 351
HG+L+VGY + T K+ YW+VKNSWG +WG GYI + + +N CG++ S I+
Sbjct: 276 HGVLVVGYGYEGTDSNKD-KYWLVKNSWGKEWGMDGYIKIAKDRNNHCGLATAASYPIV 333
>sp|P06797|CATL_MOUSE CATHEPSIN L PRECURSOR (MAJOR EXCRETED PROTEIN) (MEP)
Length = 334
Score = 212 bits (533), Expect = 1e-54
Identities = 126/359 (35%), Positives = 198/359 (55%), Gaps = 39/359 (10%)
Query: 3 VILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEEL 62
++LL VL + T + + +++ +++ + Y E R I++ N+ I+
Sbjct: 4 LLLLAVLCLGTALATPK-FDQTFSAEWHQWKSTHRRLYGTNEEEWRRAIWEKNMRMIQLH 62
Query: 63 NLIAINHKADTKFGVNKFADLSSDEFKN------YYLNNKEAIFTDDLPVADYLDDEFIN 116
N N + +N F D++++EF+ + + K +F + L +
Sbjct: 63 NGEYSNGQHGFSMEMNAFGDMTNEEFRQVVNGYRHQKHKKGRLFQEPLML---------- 112
Query: 117 SIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVD 176
IP + DWR +G VTPVKNQGQCGSCW+FS +G +EGQ F+ KL+SLSEQNLVD
Sbjct: 113 KIPK----SVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVD 168
Query: 177 CDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIG 236
C H +G ++GCNGGL A+ YI +NGG+ +E SYPY A+ G+ C + +
Sbjct: 169 CSHA----QG----NQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGS-CKYRA---- 215
Query: 237 PEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGVFDIP-CNPNSLD 293
pattern 237 ****
A + F IP+ E + + + GP+++A DA QFY G++ P C+ +LD
Sbjct: 216 EFAVANDTGFVDIPQQEKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSKNLD 275
Query: 294 HGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGK-NTCGVSNFVSTSII 351
HG+L+VGY + T KN YW+VKNSWG++WG +GYI + + + N CG++ S ++
Sbjct: 276 HGVLLVGYGYEGTDSNKN-KYWLVKNSWGSEWGMEGYIKIAKDRDNHCGLATAASYPVV 333
>sp|P12412|CYSP_VIGMU VIGNAIN PRECURSOR (BEAN ENDOPEPTIDASE) (CYSTEINE PROTEINASE)
(SULFHYDRYL-ENDOPEPTIDASE) (SH-EP)
Length = 362
Score = 209 bits (526), Expect = 8e-54
Identities = 127/313 (40%), Positives = 179/313 (56%), Gaps = 35/313 (11%)
Query: 47 ERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNK---EAIFTDD 103
+RF +FK+N+ + N + +K +NKFAD+++ EF++ Y +K +F
Sbjct: 58 KRFNVFKANVMHVHNTNKMDKPYKLK----LNKFADMTNHEFRSTYAGSKVNHHKMFRGS 113
Query: 104 LPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQ 163
+ E + S+P + DWR +GAVT VK+QGQCGSCW+FST VEG + I
Sbjct: 114 QHGSGTFMYEKVGSVP----ASVDWRKKGAVTDVKDQGQCGSCWAFSTIVAVEGINQIKT 169
Query: 164 NKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAE 223
NKLVSLSEQ LVDCD E ++GCNGGL +A+ +I + GGI TES+YPYTA+
Sbjct: 170 NKLVSLSEQELVDCDKE---------ENQGCNGGLMESAFEFIKQKGGITTESNYPYTAQ 220
Query: 224 TGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGG 281
pattern 237 ****
GT C+ + N + I +P N+ V+ P+++A DA ++QFY G
Sbjct: 221 EGT-CDESKVN---DLAVSIDGHENVPVNDENALLKAVANQPVSVAIDAGGSDFQFYSEG 276
Query: 282 VFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRG----K 337
VF CN L+HG+ IVGY T+ N YWIV+NSWG +WGEQGYI ++R +
Sbjct: 277 VFTGDCN-TDLNHGVAIVGYG--TTVDGTN--YWIVRNSWGPEWGEQGYIRMQRNISKKE 331
Query: 338 NTCGVSNFVSTSI 350
CG++ S I
Sbjct: 332 GLCGIAMMASYPI 344
>sp|P25783|CATV_NPVAC VIRAL CATHEPSIN (V-CATH)
Length = 323
Score = 209 bits (526), Expect = 8e-54
Identities = 129/349 (36%), Positives = 179/349 (50%), Gaps = 32/349 (9%)
Query: 5 LLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHE-EYLERFEIFKSNLGKIEELN 63
+LF L V+ V S+ + + F EF +FNK Y E E L RF+IF+ NL +I
Sbjct: 4 ILFYLFVYGVVNSAAYDLLKAPNYFEEFVHRFNKDYGSEVEKLRRFKIFQHNLNEI---- 59
Query: 64 LIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEEQ 123
I N K+ +NKF+DLS DE Y I T + LD P +
Sbjct: 60 -INKNQNDSAKYEINKFSDLSKDETIAKYTGLSLPIQTQNFCKVIVLDQP-----PGKGP 113
Query: 124 TAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECME 183
FDWR VT VKNQG CG+CW+F+T ++E Q I N+L++LSEQ ++DCD
Sbjct: 114 LEFDWRRLNKVTSVKNQGMCGACWAFATLASLESQFAIKHNQLINLSEQQMIDCDF---- 169
Query: 184 YEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKI 243
pattern 237 ****
D GCNGGL A+ IIK GG+Q ES YPY A+ C NS + +
Sbjct: 170 ------VDAGCNGGLLHTAFEAIIKMGGVQLESDYPYEAD-NNNCRMNSNKFLVQVK--- 219
Query: 244 SNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSA 303
+ I E + + GP+ +A DA + Y G+ C + L+H +L+VGY
Sbjct: 220 DCYRYITVYEEKLKDLLRLVGPIPMAIDAADIVNYKQGIIKY-CFNSGLNHAVLLVGYGV 278
Query: 304 KNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSN-FVSTSII 351
+N N+PYW KN+WG DWGE G+ +++ N CG+ N ST++I
Sbjct: 279 EN-----NIPYWTFKNTWGTDWGEDGFFRVQQNINACGMRNELASTAVI 322
>sp|P25975|CATL_BOVIN CATHEPSIN L PRECURSOR
Length = 334
Score = 208 bits (525), Expect = 1e-53
Identities = 126/351 (35%), Positives = 184/351 (51%), Gaps = 35/351 (9%)
Query: 7 FVLAVFTVFVSSRG--IPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNL 64
F L V + V+S + P + + +++ + Y E R +++ N I+ N
Sbjct: 5 FFLTVLCLGVASAAPKLDPNLDAHWHQWKATHRRLYGMNEEEWRRAVWEKNKKIIDLHNQ 64
Query: 65 IAINHKADTKFGVNKFADLSSDEFK---NYYLNNKEAIFTDDLPVADYLDDEFINSIPPE 121
K + +N F D++++EF+ N + N K + + +P
Sbjct: 65 EYSEGKHAFRMAMNAFGDMTNEEFRQVMNGFQNQKHK-------KGKLFHEPLLVDVPK- 116
Query: 122 EQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHEC 181
+ DW +G VTPVKNQGQCGSCW+FS TG +EGQ F KLVSLSEQNLVDC
Sbjct: 117 ---SVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRA- 172
Query: 182 MEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPE-EQ 240
pattern 237 ** **
+G ++GCNGGL NA+ YI NGG+ +E SYPY A CN+ PE
Sbjct: 173 ---QG----NQGCNGGLMDNAFQYIKDNGGLDSEESYPYLATDTNSCNYK-----PECSA 220
Query: 241 AKISNFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGV-FDIPCNPNSLDHGIL 297
A + F IP+ E + + + GP+++A DA +QFY G+ +D C+ LDHG+L
Sbjct: 221 ANDTGFVDIPQREKALMKAVATVGPISVAIDAGHTSFQFYKSGIYYDPDCSCKDLDHGVL 280
Query: 298 IVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNT-CGVSNFVS 347
+VGY + T N +WIVKNSWG +WG GY+ + + +N CG++ S
Sbjct: 281 VVGYGFEGTDSNNN-KFWIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAAS 330
>sp|Q40143|CYS3_LYCES CYSTEINE PROTEINASE 3 PRECURSOR
Length = 356
Score = 207 bits (522), Expect = 2e-53
Identities = 129/331 (38%), Positives = 181/331 (53%), Gaps = 40/331 (12%)
Query: 29 FLEFQDKFNKKY-SHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDE 87
F F + K+Y S EE +RFEIF NL I N +++K G+N+F DL+ DE
Sbjct: 57 FARFAIRHRKRYDSVEEIKQRFEIFLDNLKMIRSHNRKGLSYK----LGINEFTDLTWDE 112
Query: 88 FKNYYLN---NKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCG 144
F+ + L N A +L + N + PE + DWR G V+PVK QG+CG
Sbjct: 113 FRKHKLGASQNCSATTKGNLKLT--------NVVLPETK---DWRKDGIVSPVKAQGKCG 161
Query: 145 SCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYN 204
SCW+FSTTG +E + + K +SLSEQ LVDC + GCNGGL A+
Sbjct: 162 SCWTFSTTGALEAAYAQAFGKGISLSEQQLVDCAGAFNNF--------GCNGGLPSQAFE 213
Query: 205 YIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTG 264
pattern 237 ****
YI NGG+ TE +YPYT + G C F+ ANIG + + + N T+ + E A +V
Sbjct: 214 YIKFNGGLDTEEAYPYTGKNGI-CKFSQANIGVKVISSV-NITLGAEYELKYAVALVR-- 269
Query: 265 PLAIAADAVE-WQFYIGGVF---DIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNS 320
P+++A + V+ ++ Y GV+ + P ++H +L VGY +N PYW++KNS
Sbjct: 270 PVSVAFEVVKGFKQYKSGVYASTECGDTPMDVNHAVLAVGYGVEN-----GTPYWLIKNS 324
Query: 321 WGADWGEQGYIYLRRGKNTCGVSNFVSTSII 351
WGADWGE GY + GKN CGV+ S I+
Sbjct: 325 WGADWGEDGYFKMEMGKNMCGVATCASYPIV 355
>sp|Q05094|CYS2_LEIPI CYSTEINE PROTEINASE 2 PRECURSOR (AMASTIGOTE CYSTEINE PROTEINASE A-2)
Length = 444
Score = 207 bits (521), Expect = 3e-53
Identities = 122/327 (37%), Positives = 177/327 (53%), Gaps = 39/327 (11%)
Query: 29 FLEFQDKFNKKYSH-EEYLERFEIFKSNLGKIEELNLIAINHKA---DTKFGVNKFADLS 84
F EF+ + + Y E +R F+ NL + E H+A +FG+ KF DLS
Sbjct: 38 FEEFKRTYGRAYETLAEEQQRLANFERNLELMRE-------HQARNPHAQFGITKFFDLS 90
Query: 85 SDEFKNYYLNNKEAIFTDDLPVADYLDDEF--INSIPPEEQTAFDWRTRGAVTPVKNQGQ 142
EF YLN A + ++++P A DWR +GAVTPVK+QG
Sbjct: 91 EAEFAARYLNGAAYFAAAKRHAAQHYRKARADLSAVPD----AVDWREKGAVTPVKDQGA 146
Query: 143 CGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNA 202
CGSCW+FS GN+EGQ +++ ++LVSLSEQ LV CD ++GC+GGL A
Sbjct: 147 CGSCWAFSAVGNIEGQWYLAGHELVSLSEQQLVSCDD----------MNDGCDGGLMLQA 196
Query: 203 YNYIIK--NGGIQTESSYPYTAETG--TQCNFNSANIGPEEQAKISNFTMIPKNETVMAG 258
pattern 237 ****
++++++ NG + TE SYPY + G +C+ +S + A+I +I +E MA
Sbjct: 197 FDWLLQNTNGHLHTEDSYPYVSGNGYVPECSNSSEEL--VVGAQIDGHVLIGSSEKAMAA 254
Query: 259 YIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVK 318
++ GP+AIA DA + Y GV C L+HG+L+VGY + PYW++K
Sbjct: 255 WLAKNGPIAIALDASSFMSYKSGVL-TACIGKQLNHGVLLVGYDMTGEV-----PYWVIK 308
Query: 319 NSWGADWGEQGYIYLRRGKNTCGVSNF 345
NSWG DWGEQGY+ + G N C +S +
Sbjct: 309 NSWGGDWGEQGYVRVVMGVNACLLSEY 335
>sp|P36400|LCPB_LEIME CYSTEINE PROTEINASE B PRECURSOR
Length = 443
Score = 206 bits (520), Expect = 4e-53
Identities = 122/327 (37%), Positives = 177/327 (53%), Gaps = 40/327 (12%)
Query: 29 FLEFQDKFNKKYSH-EEYLERFEIFKSNLGKIEELNLIAINHKA---DTKFGVNKFADLS 84
F EF+ + + Y E +R F+ NL + E H+A +FG+ KF DLS
Sbjct: 38 FEEFKRTYGRAYETLAEEQQRLANFERNLELMRE-------HQARNPHAQFGITKFFDLS 90
Query: 85 SDEFKNYYLNNKEAIFTDDLPVADYLDDEF--INSIPPEEQTAFDWRTRGAVTPVKNQGQ 142
EF YLN A + ++++P A DWR +GAVTPVK+QG
Sbjct: 91 EAEFAARYLNGAAYFAAAKRHAAQHYRKARADLSAVPD----AVDWREKGAVTPVKDQGA 146
Query: 143 CGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNA 202
CGSCW+FS GN+EGQ +++ ++LVSLSEQ LV CD ++GC+GGL A
Sbjct: 147 CGSCWAFSAVGNIEGQWYLAGHELVSLSEQQLVSCDD----------MNDGCDGGLMLQA 196
Query: 203 YNYIIK--NGGIQTESSYPYTAETG--TQCNFNSANIGPEEQAKISNFTMIPKNETVMAG 258
pattern 237 ****
++++++ NG + TE SYPY + G +C+ +S + A+I +I +E MA
Sbjct: 197 FDWLLQNTNGHLHTEDSYPYVSGNGYVPECSNSSELV---VGAQIDGHVLIGSSEKAMAA 253
Query: 259 YIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVK 318
++ GP+AIA DA + Y GV C L+HG+L+VGY + PYW++K
Sbjct: 254 WLAKNGPIAIALDASSFMSYKSGVL-TACIGKQLNHGVLLVGYDMTGEV-----PYWVIK 307
Query: 319 NSWGADWGEQGYIYLRRGKNTCGVSNF 345
NSWG DWGEQGY+ + G N C +S +
Sbjct: 308 NSWGGDWGEQGYVRVVMGVNACLLSEY 334
>sp|P07711|CATL_HUMAN CATHEPSIN L PRECURSOR (MAJOR EXCRETED PROTEIN) (MEP)
Length = 333
Score = 206 bits (520), Expect = 4e-53
Identities = 125/349 (35%), Positives = 187/349 (52%), Gaps = 34/349 (9%)
Query: 8 VLAVFTVFVSSRGIPPEE--QSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLI 65
+LA F + ++S + + ++Q+ +++ N+ Y E R +++ N+ IE N
Sbjct: 6 ILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEEGWRRAVWEKNMKMIELHNQE 65
Query: 66 AINHKADTKFGVNKFADLSSDEFK---NYYLNNKEAIFTDDLPVADYLDDEFINSIPPEE 122
K +N F D++S+EF+ N + N K F + E
Sbjct: 66 YREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPR-----------KGKVFQEPLFYEA 114
Query: 123 QTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECM 182
+ DWR +G VTPVKNQGQCGSCW+FS TG +EGQ F +L+SLSEQNLVDC
Sbjct: 115 PRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDC----- 169
Query: 183 EYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAK 242
pattern 237 ****
G + +EGCNGGL A+ Y+ NGG+ +E SYPY A T C +N A
Sbjct: 170 --SGPQG-NEGCNGGLMDYAFQYVQDNGGLDSEESYPYEA-TEESCKYNP----KYSVAN 221
Query: 243 ISNFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGV-FDIPCNPNSLDHGILIV 299
+ F IPK E + + + GP+++A DA + FY G+ F+ C+ +DHG+L+V
Sbjct: 222 DTGFVDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVV 281
Query: 300 GYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRG-KNTCGVSNFVS 347
GY ++T N YW+VKNSWG +WG GY+ + + +N CG+++ S
Sbjct: 282 GYGFEST-ESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAAS 329
>sp|Q28944|CATL_PIG CATHEPSIN L PRECURSOR
Length = 334
Score = 206 bits (519), Expect = 5e-53
Identities = 121/316 (38%), Positives = 167/316 (52%), Gaps = 33/316 (10%)
Query: 40 YSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFK---NYYLNNK 96
Y E R +++ N+ IE N K +N F D++++EF+ N + N K
Sbjct: 40 YGMNEEGWRRAVWEKNMKMIELHNQEYSQGKHGFSMAMNAFGDMTNEEFRQVMNGFQNQK 99
Query: 97 EAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVE 156
F S+ E + DWR +G VT VKNQGQCGSCW+FS TG +E
Sbjct: 100 HK-----------KGKVFHESLVLEVPKSVDWREKGYVTAVKNQGQCGSCWAFSATGALE 148
Query: 157 GQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTES 216
GQ F KLVSLSEQNLVDC +G ++GCNGGL NA+ Y+ NGG+ TE
Sbjct: 149 GQMFRKTGKLVSLSEQNLVDCSRP----QG----NQGCNGGLMDNAFQYVKDNGGLDTEE 200
Query: 217 SYPYTAETGTQCNFNSANIGPE-EQAKISNFTMIPKNETVMAGYIVSTGPLAIAADA--V 273
pattern 237 ** **
SYPY C + PE A + F IP+ E + + + GP+++A DA
Sbjct: 201 SYPYLGRETNSCTYK-----PECSAANDTGFVDIPQREKALMKAVATVGPISVAIDAGHS 255
Query: 274 EWQFYIGGV-FDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIY 332
+QFY G+ +D C+ LDHG+L+VGY + T + +WIVKNSWG +WG GY+
Sbjct: 256 SFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGT-DSNSSKFWIVKNSWGPEWGWNGYVK 314
Query: 333 LRRGKNT-CGVSNFVS 347
+ + +N CG+S S
Sbjct: 315 MAKDQNNHCGISTAAS 330
>sp|P00785|ACTN_ACTCH ACTINIDAIN PRECURSOR (ACTINIDIN)
Length = 380
Score = 204 bits (513), Expect = 3e-52
Identities = 124/334 (37%), Positives = 178/334 (53%), Gaps = 41/334 (12%)
Query: 24 EEQSQFLEFQDKFNKKY-SHEEYLERFEIFKSNLGKIEELNLIAINHKADT----KFGVN 78
E ++ + + K+ K Y S E+ RFEIFK L I+E H ADT K G+N
Sbjct: 37 EVKAMYESWLIKYGKSYNSLGEWERRFEIFKETLRFIDE-------HNADTNRSYKVGLN 89
Query: 79 KFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVK 138
+FADL+ +EF++ YL ++ V++ + F +P + DWR+ GAV +K
Sbjct: 90 QFADLTDEEFRSTYLGFTSG--SNKTKVSNRYEPRFGQVLP----SYVDWRSAGAVVDIK 143
Query: 139 NQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGL 198
+QG+CG CW+FS VEG + I L+SLSEQ L+DC G GCNGG
Sbjct: 144 SQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDC--------GRTQNTRGCNGGY 195
Query: 199 QPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAG 258
pattern 237 ****
+ + +II NGGI TE +YPYTA+ G +CN + N E+ I + +P N
Sbjct: 196 ITDGFQFIINNGGINTEENYPYTAQDG-ECNLDLQN---EKYVTIDTYENVPYNNEWALQ 251
Query: 259 YIVSTGPLAIAADAV--EWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWI 316
V+ P+++A DA ++ Y G+F PC ++DH + IVGY + I YWI
Sbjct: 252 TAVTYQPVSVALDAAGDAFKHYSSGIFTGPCG-TAIDHAVTIVGYGTEGGI-----DYWI 305
Query: 317 VKNSWGADWGEQGYIYLRR---GKNTCGVSNFVS 347
VKNSW WGE+GY+ + R G TCG++ S
Sbjct: 306 VKNSWDTTWGEEGYMRILRNVGGAGTCGIATMPS 339
>sp|P25803|CYSP_PHAVU VIGNAIN PRECURSOR (BEAN ENDOPEPTIDASE) (CYSTEINE PROTEINASE EP-C1)
Length = 362
Score = 203 bits (510), Expect = 6e-52
Identities = 125/313 (39%), Positives = 177/313 (55%), Gaps = 35/313 (11%)
Query: 47 ERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNK---EAIFTDD 103
+RF +FK+NL + N + +K +NKFAD+++ EF++ Y +K +F
Sbjct: 58 KRFNVFKANLMHVHNTNKMDKPYKLK----LNKFADMTNHEFRSTYAGSKVNHPRMFRGT 113
Query: 104 LPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQ 163
E + S+PP + DWR +GAVT VK+QGQCGSCW+FST VEG + I
Sbjct: 114 PHENGAFMYEKVVSVPP----SVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKT 169
Query: 164 NKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAE 223
NKLV+LSEQ LVDCD E ++GCNGGL +A+ +I + GGI TES+YPY A+
Sbjct: 170 NKLVALSEQELVDCDKE---------ENQGCNGGLMESAFEFIKQKGGITTESNYPYKAQ 220
Query: 224 TGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGG 281
pattern 237 ****
GT C+ + N + I +P N+ V+ P+++A DA ++QFY G
Sbjct: 221 EGT-CDASKVN---DLAVSIDGHENVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEG 276
Query: 282 VFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRG----K 337
VF C+ L+HG+ IVGY T+ N YWIV+NSWG +WGE GYI ++R +
Sbjct: 277 VFTGDCS-TDLNHGVAIVGYG--TTVDGTN--YWIVRNSWGPEWGEHGYIRMQRNISKKE 331
Query: 338 NTCGVSNFVSTSI 350
CG++ S I
Sbjct: 332 GLCGIAMLPSYPI 344
>sp|Q10991|CATL_SHEEP CATHEPSIN L
Length = 217
Score = 201 bits (507), Expect = 1e-51
Identities = 105/226 (46%), Positives = 139/226 (61%), Gaps = 23/226 (10%)
Query: 127 DWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEG 186
DW +G VTPVKNQGQCGSCW+FS TG +EGQ F KLVSLSEQNLVD
Sbjct: 6 DWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVD--------SS 57
Query: 187 EEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPE-EQAKISN 245
pattern 237 ** **
++GCNGGL NA+ YI +NGG+ +E SYPY A T T CN+ PE AK +
Sbjct: 58 RPQGNQGCNGGLMDNAFQYIKENGGLDSEESYPYEA-TDTSCNYK-----PEYSAAKDTG 111
Query: 246 FTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGV-FDIPCNPNSLDHGILIVGYS 302
F IP+ E + + + GP+++A DA +QFY G+ +D C+ LDHG+L+VGY
Sbjct: 112 FVDIPQREKALMKAVATVGPISVAIDAGHSSFQFYKSGIYYDPDCSSKDLDHGVLVVGYG 171
Query: 303 AKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNT-CGVSNFVS 347
+ T N +WIVKNSWG +WG +GY+ + + +N CG++ S
Sbjct: 172 FEGT----NNKFWIVKNSWGPEWGNKGYVKMAKDQNNHCGIATAAS 213
>sp|P43156|CYSP_HEMSP THIOL PROTEASE SEN102 PRECURSOR
Length = 360
Score = 201 bits (506), Expect = 2e-51
Identities = 121/307 (39%), Positives = 161/307 (52%), Gaps = 28/307 (9%)
Query: 43 EEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTD 102
+E RF +FK N+ I E N A K +NKF D+++ EF++ Y +K
Sbjct: 54 DEKNRRFNVFKENVKFIHEFNQ---KKDAPYKLALNKFGDMTNQEFRSKYAGSKIQHHRS 110
Query: 103 DLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFIS 162
+ ++ + DWR +GAVT VK+QGQCGSCW+FST +VEG + I
Sbjct: 111 QRGIQKNTGSFMYENVGSLPAASIDWRAKGAVTGVKDQGQCGSCWAFSTIASVEGINQIK 170
Query: 163 QNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTA 222
+LVSLSEQ LVDCD + +EGCNGGL A+ +I KN GI TE SYPY
Sbjct: 171 TGELVSLSEQELVDCD---------TSYNEGCNGGLMDYAFEFIQKN-GITTEDSYPYAE 220
Query: 223 ETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIG 280
pattern 237 ****
+ GT C N N I +P N V+ P++++ +A +QFY
Sbjct: 221 QDGT-CASNLLN---SPVVSIDGHQDVPANNENALMQAVANQPISVSIEASGYGFQFYSE 276
Query: 281 GVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRG---- 336
GVF C LDHG+ IVGY A R YWIVKNSWG +WGE GYI ++RG
Sbjct: 277 GVFTGRCG-TELDHGVAIVGYGAT----RDGTKYWIVKNSWGEEWGESGYIRMQRGISDK 331
Query: 337 KNTCGVS 343
+ CG++
Sbjct: 332 RGKCGIA 338
>sp|P54639|CYS4_DICDI CYSTEINE PROTEINASE 4 PRECURSOR
Length = 442
Score = 200 bits (504), Expect = 3e-51
Identities = 117/308 (37%), Positives = 169/308 (53%), Gaps = 32/308 (10%)
Query: 4 ILLFVLAVFTVFVSSRGIPPEEQ--SQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEE 61
+L F+ + + S++ E Q + F + + YS EE+ R++IFKSN+ + +
Sbjct: 3 VLSFLCLLLVSYASAKQQFSELQYRNAFTNWMQAHQRTYSSEEFNARYQIFKSNMDYVHQ 62
Query: 62 LNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPE 121
N + +T G+N FAD+++ E++ YL F + ++E I S P
Sbjct: 63 WN----SKGGETVLGLNVFADITNQEYRTTYLGTP---FDGSALIGT--EEEKIFSTPAP 113
Query: 122 EQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFI---SQNKLVSLSEQNLVDCD 178
DWR +GAVTP+KNQGQCG CWSFSTTG+ EG HFI ++ LVSLSEQNL+DC
Sbjct: 114 ---TVDWRAQGAVTPIKNQGQCGGCWSFSTTGSTEGAHFIASGTKKDLVSLSEQNLIDC- 169
Query: 179 HECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPE 238
pattern 237 **
+ + GC GGL + YII N GI TESSYPYTAE G +C F ++NIG
Sbjct: 170 -------SKSYGNNGCEGGLMTLGFEYIINNKGIDTESSYPYTAEDGKECKFKTSNIG-- 220
Query: 239 EQAKISNFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGVFDIP-CNPNSLDHG 295
pattern 239 **
A+I ++ + + P+++A DA +Q Y G++ P C P LDHG
Sbjct: 221 --AQIVSYQNVTSGSEASLQSASNNAPVSVAIDASNESFQLYESGIYYEPACTPTQLDHG 278
Query: 296 ILIVGYSA 303
+L+VGY +
Sbjct: 279 VLVVGYGS 286
Score = 48.8 bits (114), Expect = 2e-05
Identities = 18/35 (51%), Positives = 24/35 (68%), Gaps = 1/35 (2%)
Query: 314 YWIVKNSWGADWGEQGYIYLRRGK-NTCGVSNFVS 347
YWIVKNSWG WG GYI++ + + N CG++ S
Sbjct: 401 YWIVKNSWGTSWGMDGYIFMSKDRNNNCGIATMAS 435
>sp|O60911|CATM_HUMAN CATHEPSIN L2 PRECURSOR (CATHEPSIN V)
Length = 334
Score = 199 bits (501), Expect = 7e-51
Identities = 127/357 (35%), Positives = 191/357 (52%), Gaps = 43/357 (12%)
Query: 5 LLFVLAVFTVFVSSRGIPPEEQS---QFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEE 61
L VLA F + ++S +P +Q+ ++ +++ + Y E R +++ N+ IE
Sbjct: 3 LSLVLAAFCLGIAS-AVPKFDQNLDTKWYQWKATHRRLYGANEEGWRRAVWEKNMKMIEL 61
Query: 62 LNLIAINHKADTKFGVNKFADLSSDEFKNY---YLNNK---EAIFTDDLPVADYLDDEFI 115
N K +N F D++++EF+ + N K +F + L +LD
Sbjct: 62 HNGEYSQGKHGFTMAMNAFPDMTNEEFRQMMGCFRNQKFRKGKVFREPL----FLD---- 113
Query: 116 NSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLV 175
+P + DWR +G VTPVKNQ QCGSCW+FS TG +EGQ F KLVSLSEQNLV
Sbjct: 114 --LPK----SVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLV 167
Query: 176 DCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANI 235
DC +G ++GCNGG A+ Y+ +NGG+ +E SYPY A C + N
Sbjct: 168 DCSRP----QG----NQGCNGGFMARAFQYVKENGGLDSEESYPYVA-VDEICKYRPEN- 217
Query: 236 GPEEQAKISNFTMI-PKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGV-FDIPCNPNS 291
pattern 237 ****
A + FT++ P E + + + GP+++A DA +QFY G+ F+ C+ +
Sbjct: 218 ---SVANDTGFTVVAPGKEKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKN 274
Query: 292 LDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNT-CGVSNFVS 347
LDHG+L+VGY + N YW+VKNSWG +WG GY+ + + KN CG++ S
Sbjct: 275 LDHGVLVVGYGFEGA-NSNNSKYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAAS 330
>sp|O10364|CATV_NPVOP VIRAL CATHEPSIN (V-CATH)
Length = 324
Score = 196 bits (494), Expect = 5e-50
Identities = 116/322 (36%), Positives = 168/322 (52%), Gaps = 30/322 (9%)
Query: 29 FLEFQDKFNKKYSHE-EYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDE 87
F +F KFNK YS E E L RF+IF+ NL +I N + + ++ +NKF+DLS +E
Sbjct: 28 FEDFLHKFNKNYSSESEKLHRFKIFQHNLEEIINKN----QNDSTAQYEINKFSDLSKEE 83
Query: 88 FKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCW 147
+ Y T + LD P FDWR VT VKNQG CG+CW
Sbjct: 84 AISKYTGLSLPHQTQNFCEVVILDRP-----PDRGPLEFDWRQFNKVTSVKNQGVCGACW 138
Query: 148 SFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYII 207
+F+T G++E Q I N+L++LSEQ +DCD + GC+GGL A+ +
Sbjct: 139 AFATLGSLESQFAIKYNRLINLSEQQFIDCDR----------VNAGCDGGLLHTAFESAM 188
Query: 208 KNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLA 267
pattern 237 ****
+ GG+Q ES YPY G QC N ++ M E + + + GP+
Sbjct: 189 EMGGVQMESDYPYETANG-QCRINPNRFVVGVRSCRRYIVMF---EEKLKDLLRAVGPIP 244
Query: 268 IAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGE 327
+A DA + Y G+ C + L+H +L+VGY+ +N N+PYWI+KN+WG DWGE
Sbjct: 245 VAIDASDIVNYRRGIMR-QCANHGLNHAVLLVGYAVEN-----NIPYWILKNTWGTDWGE 298
Query: 328 QGYIYLRRGKNTCGVSNFVSTS 349
GY +++ N CG+ N + +S
Sbjct: 299 DGYFRVQQNINACGIRNELVSS 320
>sp|P25777|ORYB_ORYSA ORYZAIN BETA CHAIN PRECURSOR
Length = 471
Score = 196 bits (494), Expect = 5e-50
Identities = 115/310 (37%), Positives = 166/310 (53%), Gaps = 31/310 (10%)
Query: 44 EYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDD 103
E+ RF +F NL ++ N A + + G+N+FADL+++EF+ +L K A
Sbjct: 69 EHERRFLVFWDNLKFVDAHNARA-DEGGGFRLGMNRFADLTNEEFRATFLGAKVA--ERS 125
Query: 104 LPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQ 163
+ + + +P + DWR +GAV PVKNQGQCGSCW+FS VE + +
Sbjct: 126 RAAGERYRHDGVEELPE----SVDWREKGAVAPVKNQGQCGSCWAFSAVSTVESINQLVT 181
Query: 164 NKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAE 223
++++LSEQ LV+C + GCNGGL +A+++IIKNGGI TE YPY A
Sbjct: 182 GEMITLSEQELVEC--------STNGQNSGCNGGLMADAFDFIIKNGGIDTEDDYPYKAV 233
Query: 224 TGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGG 281
pattern 237 ****
G +C+ N N + I F +P+N+ V+ P+++A +A E+Q Y G
Sbjct: 234 DG-KCDINREN---AKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLYHSG 289
Query: 282 VFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNT-- 339
VF C SLDHG++ VGY N YWIV+NSWG WGE GY+ + R N
Sbjct: 290 VFSGRCG-TSLDHGVVAVGYGTDN-----GKDYWIVRNSWGPKWGESGYVRMERNINVTT 343
Query: 340 --CGVSNFVS 347
CG++ S
Sbjct: 344 GKCGIAMMAS 353
>sp|P25776|ORYA_ORYSA ORYZAIN ALPHA CHAIN PRECURSOR
Length = 458
Score = 194 bits (488), Expect = 2e-49
Identities = 124/355 (34%), Positives = 183/355 (50%), Gaps = 43/355 (12%)
Query: 3 VILLFVLAVFTVFVSSRGIPPEEQSQ--FLEFQDKFNKKYSHE-EYLERFEIFKSNLGKI 59
++LL LA + + S G EE+++ + E++ + K Y+ E R+ F+ NL I
Sbjct: 12 LLLLLSLAAADMSIVSYGERSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYI 71
Query: 60 EELNLIAINHKADTKFGVNKFADLSSDEFKNYYLN-----NKEAIFTDDLPVADYLDDEF 114
+E N A + G+N+FADL+++E+++ YL +E +D AD
Sbjct: 72 DEHNAAADAGVHSFRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAAD------ 125
Query: 115 INSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNL 174
N PE + DWRT+GAV +K+QG CGSCW+FS VE + I L+SLSEQ L
Sbjct: 126 -NEALPE---SVDWRTKGAVAEIKDQGGCGSCWAFSAIAAVEDINQIVTGDLISLSEQEL 181
Query: 175 VDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSAN 234
VDCD + +EGCNGGL A+++II NGGI TE YPY + +C+ N N
Sbjct: 182 VDCD---------TSYNEGCNGGLMDYAFDFIINNGGIDTEDDYPYKGK-DERCDVNRKN 231
Query: 235 IGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGVFDIPCNPNSL 292
pattern 237 ****
+ I ++ + N V P+++A +A +Q Y G+F C +L
Sbjct: 232 ---AKVVTIDSYEDVTPNSETSLQKAVRNQPVSVAIEAGGRAFQLYSSGIFTGKCG-TAL 287
Query: 293 DHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRR----GKNTCGVS 343
DHG+ VGY +N YWIV+NSWG WGE GY+ + R CG++
Sbjct: 288 DHGVAAVGYGTEN-----GKDYWIVRNSWGKSWGESGYVRMERNIKASSGKCGIA 337
>sp|P43297|RD21_ARATH CYSTEINE PROTEINASE RD21A PRECURSOR
Length = 462
Score = 193 bits (486), Expect = 4e-49
Identities = 122/321 (38%), Positives = 168/321 (52%), Gaps = 43/321 (13%)
Query: 35 KFNKKYSHEEYLE---RFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNY 91
K K S +E RFEIFK NL ++E N ++++ G+ +FADL++DE+++
Sbjct: 56 KHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEKNLSYR----LGLTRFADLTNDEYRSK 111
Query: 92 YLN---NKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWS 148
YL K+ L + DE SI DWR +GAV VK+QG CGSCW+
Sbjct: 112 YLGAKMEKKGERRTSLRYEARVGDELPESI--------DWRKKGAVAEVKDQGGCGSCWA 163
Query: 149 FSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIK 208
FST G VEG + I L++LSEQ LVDCD + +EGCNGGL A+ +IIK
Sbjct: 164 FSTIGAVEGINQIVTGDLITLSEQELVDCD---------TSYNEGCNGGLMDYAFEFIIK 214
Query: 209 NGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAI 268
pattern 237 ****
NGGI T+ YPY GT C+ N + I ++ +P V+ P++I
Sbjct: 215 NGGIDTDKDYPYKGVDGT-CDQIRKN---AKVVTIDSYEDVPTYSEESLKKAVAHQPISI 270
Query: 269 AADA--VEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWG 326
A +A +Q Y G+FD C LDHG++ VGY +N YWIV+NSWG WG
Sbjct: 271 AIEAGGRAFQLYDSGIFDGSCG-TQLDHGVVAVGYGTEN-----GKDYWIVRNSWGKSWG 324
Query: 327 EQGYIYLRR----GKNTCGVS 343
E GY+ + R CG++
Sbjct: 325 ESGYLRMARNIASSSGKCGIA 345
>sp|Q10717|CYS2_MAIZE CYSTEINE PROTEINASE 2 PRECURSOR
Length = 360
Score = 193 bits (485), Expect = 5e-49
Identities = 115/329 (34%), Positives = 172/329 (51%), Gaps = 32/329 (9%)
Query: 28 QFLEFQDKFNKKY-SHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSD 86
+F F ++ K Y S E +RF IF +L + N ++++ G+N+FAD+S +
Sbjct: 58 RFARFAVRYGKSYESAAEVHKRFRIFSESLQLVRSTNRKGLSYR----LGINRFADMSWE 113
Query: 87 EFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSC 146
EF+ L + A + + + DWR G V+PVKNQG CGSC
Sbjct: 114 EFRATRLGAAQNCS------ATLTGNHRMRAAAVALPETKDWREDGIVSPVKNQGHCGSC 167
Query: 147 WSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYI 206
W+FSTTG +E + + K +SLSEQ LVDC + GCNGGL A+ YI
Sbjct: 168 WTFSTTGALEAAYTQATGKPISLSEQQLVDCGFAFNNF--------GCNGGLPSQAFEYI 219
Query: 207 IKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPL 266
pattern 237 ****
NGG+ TE SYPY G C F + N+G + + N T+ ++E A +V P+
Sbjct: 220 KYNGGLDTEESYPYQGVNGI-CKFKNENVGVKVLDSV-NITLGAEDELKDAVGLVR--PV 275
Query: 267 AIAADAVE-WQFYIGGVF---DIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWG 322
++A + + ++ Y GV+ P ++H +L VGY ++ +PYW++KNSWG
Sbjct: 276 SVAFEVITGFRLYKSGVYTSDHCGTTPMDVNHAVLAVGYGVED-----GVPYWLIKNSWG 330
Query: 323 ADWGEQGYIYLRRGKNTCGVSNFVSTSII 351
ADWG++GY + GKN CGV+ S I+
Sbjct: 331 ADWGDEGYFKMEMGKNMCGVATCASYPIV 359
>sp|P14080|PAP2_CARPA CHYMOPAPAIN PRECURSOR (PAPAYA PROTEINASE II) (PPII)
Length = 352
Score = 192 bits (482), Expect = 1e-48
Identities = 128/319 (40%), Positives = 169/319 (52%), Gaps = 43/319 (13%)
Query: 35 KFNKKY-SHEEYLERFEIFKSNLGKIEELNLIAINHKADTKF--GVNKFADLSSDEFKNY 91
K NK Y S +E + RFEIF+ NL I+E N K + + G+N FADLS+DEFK
Sbjct: 54 KHNKIYESIDEKIYRFEIFRDNLMYIDETN------KKNNSYWLGLNGFADLSNDEFKKK 107
Query: 92 YLNNKEAIFTDDLPVADYLDDE-FINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFS 150
Y+ +D ++ D+E F + DWR +GAVTPVKNQG CGSCW+FS
Sbjct: 108 YVG----FVAEDFTGLEHFDNEDFTYKHVTNYPQSIDWRAKGAVTPVKNQGACGSCWAFS 163
Query: 151 TTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNG 210
T VEG + I L+ LSEQ LVDCD GC GG Q + Y + N
Sbjct: 164 TIATVEGINKIVTGNLLELSEQELVDCDKH----------SYGCKGGYQTTSLQY-VANN 212
Query: 211 GIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKN-ETVMAGYIVSTGPLAIA 269
pattern 237 ****
G+ T YPY A+ +C A P + KI+ + +P N ET G + + PL++
Sbjct: 213 GVHTSKVYPYQAKQ-YKCR---ATDKPGPKVKITGYKRVPSNCETSFLGALANQ-PLSVL 267
Query: 270 ADA--VEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGE 327
+A +Q Y GVFD PC LDH + VGY + KN Y I+KNSWG +WGE
Sbjct: 268 VEAGGKPFQLYKSGVFDGPCG-TKLDHAVTAVGYGTSD---GKN--YIIIKNSWGPNWGE 321
Query: 328 QGYIYLRR----GKNTCGV 342
+GY+ L+R + TCGV
Sbjct: 322 KGYMRLKRQSGNSQGTCGV 340
>sp|P00786|CATH_RAT CATHEPSIN H PRECURSOR (CATHEPSIN B3) (CATHEPSIN BA)
Length = 333
Score = 192 bits (482), Expect = 1e-48
Identities = 121/333 (36%), Positives = 173/333 (51%), Gaps = 38/333 (11%)
Query: 25 EQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLS 84
E+ F + + K YS EY R ++F +N KI+ N NH K G+N+F+D+S
Sbjct: 29 EKFHFTSWMKQHQKTYSSREYSHRLQVFANNWRKIQAHN--QRNHTF--KMGLNQFSDMS 84
Query: 85 SDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRG-AVTPVKNQGQC 143
E K+ YL ++ ++ P ++ DWR +G V+PVKNQG C
Sbjct: 85 FAEIKHKYLWSEPQN-------CSATKSNYLRGTGPYP-SSMDWRKKGNVVSPVKNQGAC 136
Query: 144 GSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAY 203
GSCW+FSTTG +E I+ K+++L+EQ LVDC + + GC GGL A+
Sbjct: 137 GSCWTFSTTGALESAVAIASGKMMTLAEQQLVDC--------AQNFNNHGCQGGLPSQAF 188
Query: 204 NYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQ-AKISNFTMIPKN-ETVMAGYIV 261
pattern 237 ****
YI+ N GI E SYPY + G QC FN PE+ A + N I N E M +
Sbjct: 189 EYILYNKGIMGEDSYPYIGKNG-QCKFN-----PEKAVAFVKNVVNITLNDEAAMVEAVA 242
Query: 262 STGPLAIAADAVE-WQFYIGGVFDI-PCN--PNSLDHGILIVGYSAKNTIFRKNMPYWIV 317
P++ A + E + Y GV+ C+ P+ ++H +L VGY +N + YWIV
Sbjct: 243 LYNPVSFAFEVTEDFMMYKSGVYSSNSCHKTPDKVNHAVLAVGYGEQNGLL-----YWIV 297
Query: 318 KNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSI 350
KNSWG++WG GY + RGKN CG++ S I
Sbjct: 298 KNSWGSNWGNNGYFLIERGKNMCGLAACASYPI 330
>sp|P25251|CYS4_BRANA CYSTEINE PROTEINASE COT44 PRECURSOR
Length = 328
Score = 190 bits (477), Expect = 5e-48
Identities = 114/304 (37%), Positives = 164/304 (53%), Gaps = 29/304 (9%)
Query: 47 ERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPV 106
ERF IFK NL I+ N N A K G+ FA+L++DE+++ YL + +
Sbjct: 27 ERFNIFKDNLRFIDLHN--ENNKNATYKLGLTIFANLTNDEYRSLYLGARTEPVRR-ITK 83
Query: 107 ADYLDDEFINSIPPEE-QTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNK 165
A ++ ++ ++ +E DWR +GAV +K+QG CGSCW+FST VEG + I +
Sbjct: 84 AKNVNMKYSAAVNVDEVPVTVDWRQKGAVNAIKDQGTCGSCWAFSTAAAVEGINKIVTGE 143
Query: 166 LVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETG 225
LVSLSEQ LVDCD ++ ++GCNGGL A+ +I+KNGG+ TE YPY G
Sbjct: 144 LVSLSEQELVDCD---------KSYNQGCNGGLMDYAFQFIMKNGGLNTEKDYPYHGTNG 194
Query: 226 TQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGVF 283
pattern 237 ****
+CN N I + +P + VS P+++A DA +Q Y G+F
Sbjct: 195 -KCNSLLKN---SRVVTIDGYEDVPSKDETALKRAVSYQPVSVAIDAGGRAFQHYQSGIF 250
Query: 284 DIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRG----KNT 339
C N +DH ++ VGY ++N + YWIV+NSWG WGE GYI + R
Sbjct: 251 TGKCGTN-MDHAVVAVGYGSEN-----GVDYWIVRNSWGTRWGEDGYIRMERNVASKSGK 304
Query: 340 CGVS 343
CG++
Sbjct: 305 CGIA 308
>sp|P09668|CATH_HUMAN CATHEPSIN H PRECURSOR
Length = 335
Score = 188 bits (472), Expect = 2e-47
Identities = 123/332 (37%), Positives = 170/332 (51%), Gaps = 36/332 (10%)
Query: 25 EQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLS 84
E+ F + K K YS EEY R + F SN KI N N K +N+F+D+S
Sbjct: 31 EKFHFKSWMSKHRKTYSTEEYHHRLQTFASNWRKINAHN----NGNHTFKMALNQFSDMS 86
Query: 85 SDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGA-VTPVKNQGQC 143
E K+ YL ++ ++YL PP + DWR +G V+PVKNQG C
Sbjct: 87 FAEIKHKYLWSEPQ--NCSATKSNYLRGT--GPYPP----SVDWRKKGNFVSPVKNQGAC 138
Query: 144 GSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAY 203
GSCW+FSTTG +E I+ K++SL+EQ LVDC + Y GC GGL A+
Sbjct: 139 GSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNY--------GCQGGLPSQAF 190
Query: 204 NYIIKNGGIQTESSYPYTAETGTQCNFNSAN-IGPEEQAKISNFTMIPKNETVMAGYIVS 262
pattern 237 ****
YI+ N GI E +YPY + G C F IG + ++N T+ +E M +
Sbjct: 191 EYILYNKGIMGEDTYPYQGKDG-YCKFQPGKAIGFVKD--VANITIY--DEEAMVEAVAL 245
Query: 263 TGPLAIAADAV-EWQFYIGGVF-DIPCN--PNSLDHGILIVGYSAKNTIFRKNMPYWIVK 318
P++ A + ++ Y G++ C+ P+ ++H +L VGY KN I PYWIVK
Sbjct: 246 YNPVSFAFEVTQDFMMYRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGI-----PYWIVK 300
Query: 319 NSWGADWGEQGYIYLRRGKNTCGVSNFVSTSI 350
NSWG WG GY + RGKN CG++ S I
Sbjct: 301 NSWGPQWGMNGYFLIERGKNMCGLAACASYPI 332
>sp|P10056|PAP3_CARPA CARICAIN PRECURSOR (PAPAYA PROTEINASE OMEGA) (PAPAYA PROTEINASE III)
(PPIII) (PAPAYA PEPTIDASE A)
Length = 348
Score = 187 bits (471), Expect = 2e-47
Identities = 121/319 (37%), Positives = 161/319 (49%), Gaps = 38/319 (11%)
Query: 37 NKKYSH-EEYLERFEIFKSNLGKIEELNLIAINHKADTKF--GVNKFADLSSDEFKNYYL 93
NK Y + +E L RFEIFK NL I+E N K + + G+N+FADLS+DEF Y+
Sbjct: 56 NKFYENVDEKLYRFEIFKDNLNYIDETN------KKNNSYWLGLNEFADLSNDEFNEKYV 109
Query: 94 NNKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTG 153
+ D + D+EFIN DWR +GAVTPV++QG CGSCW+FS
Sbjct: 110 GS-----LIDATIEQSYDEEFINEDTVNLPENVDWRKKGAVTPVRHQGSCGSCWAFSAVA 164
Query: 154 NVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQ 213
VEG + I KLV LSEQ LVDC+ GC GG P A Y+ KN GI
Sbjct: 165 TVEGINKIRTGKLVELSEQELVDCERR----------SHGCKGGYPPYALEYVAKN-GIH 213
Query: 214 TESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADAV 273
pattern 237 ****
S YPY A+ GT C GP K S + N ++ P+++ ++
Sbjct: 214 LRSKYPYKAKQGT-CRAKQVG-GP--IVKTSGVGRVQPNNEGNLLNAIAKQPVSVVVESK 269
Query: 274 --EWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYI 331
+Q Y GG+F+ PC +DH + VGY Y ++KNSWG WGE+GYI
Sbjct: 270 GRPFQLYKGGIFEGPCG-TKVDHAVTAVGYGKSG-----GKGYILIKNSWGTAWGEKGYI 323
Query: 332 YLRRGK-NTCGVSNFVSTS 349
++R N+ GV +S
Sbjct: 324 RIKRAPGNSPGVCGLYKSS 342
>sp|P25778|ORYC_ORYSA ORYZAIN GAMMA CHAIN PRECURSOR
Length = 362
Score = 187 bits (471), Expect = 2e-47
Identities = 112/329 (34%), Positives = 170/329 (51%), Gaps = 33/329 (10%)
Query: 28 QFLEFQDKFNKKYSHE-EYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSD 86
+F F + K+Y E RF IF +L + N + ++ G+N+FAD+S +
Sbjct: 61 RFARFAVRHGKRYGDAAEVQRRFRIFSESLELVRSTNRRGLPYR----LGINRFADMSWE 116
Query: 87 EFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSC 146
EF+ L + A + + P +T DWR G V+PVK+QG CGSC
Sbjct: 117 EFQASRLGAAQNCS------ATLAGNHRMRDAPALPETK-DWREDGIVSPVKDQGHCGSC 169
Query: 147 WSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYI 206
W FSTTG++E ++ + VSLSEQ L DC + GC+GGL A+ YI
Sbjct: 170 WPFSTTGSLEARYTQATGPPVSLSEQQLADCATRYNNF--------GCSGGLPSQAFEYI 221
Query: 207 IKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPL 266
pattern 237 ****
NGG+ TE +YPYT G C++ N G + + N T++ ++E A +V P+
Sbjct: 222 KYNGGLDTEEAYPYTGVNGI-CHYKPENAGVKVLDSV-NITLVAEDELKNAVGLVR--PV 277
Query: 267 AIAADAVE-WQFYIGGVF---DIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWG 322
++A + ++ Y GV+ +P ++H +L VGY +N +PYW++KNSWG
Sbjct: 278 SVAFQVINGFRMYKSGVYTSDHCGTSPMDVNHAVLAVGYGVEN-----GVPYWLIKNSWG 332
Query: 323 ADWGEQGYIYLRRGKNTCGVSNFVSTSII 351
ADWG+ GY + GKN CG++ S I+
Sbjct: 333 ADWGDNGYFTMEMGKNMCGIATCASYPIV 361
>sp|P15242|TES1_RAT TESTIN 1/2 PRECURSOR (CMB-22/CMB-23)
Length = 333
Score = 187 bits (469), Expect = 4e-47
Identities = 115/356 (32%), Positives = 184/356 (51%), Gaps = 30/356 (8%)
Query: 3 VILLFVLAVFTVFVSSRGIPPEEQS--QFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIE 60
+I + LA+ + V S P+ ++ E++ K K Y+ E + +++ N IE
Sbjct: 1 MIAVLFLAILCLEVDSTAPTPDPSLDVEWNEWRTKHGKTYNMNEERLKRAVWEKNFKMIE 60
Query: 61 ELNLIAINHKADTKFGVNKFADLSSDEFKNYYLN-NKEAIFTDDLPVADYLDDEFINSIP 119
N + + D +N F DL++ EF ++ I + + D +F+ +P
Sbjct: 61 LHNWEYLEGRHDFTMAMNAFGDLTNIEFVKMMTGFQRQKIKKTHI----FQDHQFLY-VP 115
Query: 120 PEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDH 179
DWR G VTPVKNQG C S W+FS TG++EGQ F +L+ LSEQNL+DC
Sbjct: 116 KR----VDWRQLGYVTPVKNQGHCASSWAFSATGSLEGQMFRKTERLIPLSEQNLLDCMG 171
Query: 180 ECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEE 239
pattern 237 ***
+ + GC+GG A+ Y+ NGG+ TE SYPY + G +C +++ N
Sbjct: 172 SNVTH--------GCSGGFMQYAFQYVKDNGGLATEESYPYRGQ-GRECRYHAEN----S 218
Query: 240 QAKISNFTMIPKNETVMAGYIVSTGPLAIAADAV--EWQFYIGGVFDIP-CNPNSLDHGI 296
pattern 240 *
A + +F IP +E + + GP+++A DA +QFY G++ P C L+H +
Sbjct: 219 AANVRDFVQIPGSEEALMKAVAKVGPISVAVDASHGSFQFYGSGIYYEPQCKRVHLNHAV 278
Query: 297 LIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRG-KNTCGVSNFVSTSII 351
L+VGY + N +W+VKNSWG +WG +GY+ L + N CG++ + + I+
Sbjct: 279 LVVGYGFEGEESDGN-SFWLVKNSWGEEWGMKGYMKLAKDWSNHCGIATYSTYPIV 333
>sp|O46427|CATH_PIG CATHEPSIN H PRECURSOR
Length = 335
Score = 186 bits (468), Expect = 5e-47
Identities = 124/343 (36%), Positives = 176/343 (51%), Gaps = 42/343 (12%)
Query: 17 SSRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFG 76
S+ + E+ F + + KKYS EEY R ++F SN KI N A NH K G
Sbjct: 23 SNLAVSSFEKLHFKSWMVQHQKKYSLEEYHHRLQVFVSNWRKINAHN--AGNHTF--KLG 78
Query: 77 VNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGA-VT 135
+N+F+D+S DE ++ YL ++ +YL PP + DWR +G V+
Sbjct: 79 LNQFSDMSFDEIRHKYLWSEPQ--NCSATKGNYLRGT--GPYPP----SMDWRKKGNFVS 130
Query: 136 PVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCN 195
PVKNQG CGSCW+FSTTG +E I+ K++SL+EQ LVDC + + GC
Sbjct: 131 PVKNQGSCGSCWTFSTTGALESAVAIATGKMLSLAEQQLVDC--------AQNFNNHGCQ 182
Query: 196 GGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQ----AKISNFTMIPK 251
pattern 237 ****
GGL A+ YI N GI E +YPY + C F P++ ++N TM
Sbjct: 183 GGLPSQAFEYIRYNKGIMGEDTYPYKGQ-DDHCKFQ-----PDKAIAFVKDVANITM--N 234
Query: 252 NETVMAGYIVSTGPLAIAADAV-EWQFYIGGVF-DIPCN--PNSLDHGILIVGYSAKNTI 307
+E M + P++ A + ++ Y G++ C+ P+ ++H +L VGY +N I
Sbjct: 235 DEEAMVEAVALYNPVSFAFEVTNDFLMYRKGIYSSTSCHKTPDKVNHAVLAVGYGEENGI 294
Query: 308 FRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSI 350
PYWIVKNSWG WG GY + RGKN CG++ S I
Sbjct: 295 -----PYWIVKNSWGPQWGMNGYFLIERGKNMCGLAACASYPI 332
>sp|P05167|ALEU_HORVU THIOL PROTEASE ALEURAIN PRECURSOR
Length = 362
Score = 185 bits (466), Expect = 9e-47
Identities = 111/329 (33%), Positives = 169/329 (50%), Gaps = 33/329 (10%)
Query: 28 QFLEFQDKFNKKY-SHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSD 86
+F F ++ K Y S E RF IF +L ++ N + ++ G+N+F+D+S +
Sbjct: 60 RFARFAVRYGKSYESAAEVRRRFRIFSESLEEVRSTNRKGLPYR----LGINRFSDMSWE 115
Query: 87 EFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSC 146
EF+ L + A + + +T DWR G V+PVKNQ CGSC
Sbjct: 116 EFQATRLGAAQTCS------ATLAGNHLMRDAAALPETK-DWREDGIVSPVKNQAHCGSC 168
Query: 147 WSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYI 206
W+FSTTG +E + + K +SLSEQ LVDC + GCNGGL A+ YI
Sbjct: 169 WTFSTTGALEAAYTQATGKNISLSEQQLVDCAGGFNNF--------GCNGGLPSQAFEYI 220
Query: 207 IKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPL 266
pattern 237 ****
NGGI TE SYPY G C++ + N + + N T+ ++E A +V P+
Sbjct: 221 KYNGGIDTEESYPYKGVNGV-CHYKAENAAVQVLDSV-NITLNAEDELKNAVGLVR--PV 276
Query: 267 AIAADAVE-WQFYIGGVF---DIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWG 322
++A ++ ++ Y GV+ P+ ++H +L VGY +N +PYW++KNSWG
Sbjct: 277 SVAFQVIDGFRQYKSGVYTSDHCGTTPDDVNHAVLAVGYGVEN-----GVPYWLIKNSWG 331
Query: 323 ADWGEQGYIYLRRGKNTCGVSNFVSTSII 351
ADWG+ GY + GKN C ++ S ++
Sbjct: 332 ADWGDNGYFKMEMGKNMCAIATCASYPVV 360
>sp|P43235|CATK_HUMAN CATHEPSIN K PRECURSOR (CATHEPSIN O) (CATHEPSIN X) (CATHEPSIN O2)
Length = 329
Score = 185 bits (465), Expect = 1e-46
Identities = 123/350 (35%), Positives = 185/350 (52%), Gaps = 39/350 (11%)
Query: 9 LAVFTVFVSSRGIPPEE--QSQFLEFQDKFNKKYSHE-EYLERFEIFKSNLGKIEELNLI 65
L V + V S + PEE + + ++ K+Y+++ + + R I++ NL I NL
Sbjct: 4 LKVLLLPVVSFALYPEEILDTHWELWKKTHRKQYNNKVDEISRRLIWEKNLKYISIHNLE 63
Query: 66 AINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEEQTA 125
A + +N D++S+E K +P++ ++ + IP E A
Sbjct: 64 ASLGVHTYELAMNHLGDMTSEEVVQKMTGLK-------VPLSHSRSNDTLY-IPEWEGRA 115
Query: 126 ---FDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECM 182
D+R +G VTPVKNQGQCGSCW+FS+ G +EGQ KL++LS QNLVDC E
Sbjct: 116 PDSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE-- 173
Query: 183 EYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAK 242
pattern 237 ****
++GC GG NA+ Y+ KN GI +E +YPY + C +N + AK
Sbjct: 174 --------NDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQE-ESCMYNPTG----KAAK 220
Query: 243 ISNFTMIPK-NETVMAGYIVSTGPLAIAADA--VEWQFYIGGV-FDIPCNPNSLDHGILI 298
+ IP+ NE + + GP+++A DA +QFY GV +D CN ++L+H +L
Sbjct: 221 CRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLA 280
Query: 299 VGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGK-NTCGVSNFVS 347
VGY +K +WI+KNSWG +WG +GYI + R K N CG++N S
Sbjct: 281 VGYG-----IQKGNKHWIIKNSWGENWGNKGYILMARNKNNACGIANLAS 325
>sp|P05994|PAP4_CARPA PAPAYA PROTEINASE IV PRECURSOR (PPIV) (PAPAYA PEPTIDASE B) (GLYCYL
ENDOPEPTIDASE)
Length = 348
Score = 184 bits (462), Expect = 3e-46
Identities = 116/315 (36%), Positives = 162/315 (50%), Gaps = 37/315 (11%)
Query: 35 KFNKKYSH-EEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYL 93
K NK Y + +E L RFEIFK NL I+E N + + G+N+F+DLS+DEFK Y+
Sbjct: 54 KHNKNYKNVDEKLYRFEIFKDNLKYIDERNKMINGYW----LGLNEFSDLSNDEFKEKYV 109
Query: 94 NNKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTG 153
+ +T+ D+EF+N + + DWR +GAVTPVK+QG C SCW+FST
Sbjct: 110 GSLPEDYTNQP-----YDEEFVNEDIVDLPESVDWRAKGAVTPVKHQGYCESCWAFSTVA 164
Query: 154 NVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQ 213
VEG + I LV LSEQ LVDCD + GCN G Q + Y+ +N GI
Sbjct: 165 TVEGINKIKTGNLVELSEQELVDCDKQ----------SYGCNRGYQSTSLQYVAQN-GIH 213
Query: 214 TESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADAV 273
pattern 237 ****
+ YPY A+ T C N GP + K + + N ++ P+++ ++
Sbjct: 214 LRAKYPYIAKQQT-CRANQVG-GP--KVKTNGVGRVQSNNEGSLLNAIAHQPVSVVVESA 269
Query: 274 --EWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYI 331
++Q Y GG+F+ C +DH + VGY Y ++KNSWG WGE GYI
Sbjct: 270 GRDFQNYKGGIFEGSCG-TKVDHAVTAVGYGKSG-----GKGYILIKNSWGPGWGENGYI 323
Query: 332 YLRRGK----NTCGV 342
+RR CGV
Sbjct: 324 RIRRASGNSPGVCGV 338
>sp|P25250|CYS2_HORVU CYSTEINE PROTEINASE EP-B 2 PRECURSOR
Length = 373
Score = 183 bits (461), Expect = 3e-46
Identities = 125/349 (35%), Positives = 171/349 (48%), Gaps = 40/349 (11%)
Query: 8 VLAVFTVFVSSRGIPPEEQSQFLE---------FQDKFNKKYSHEEYLERFEIFKSNLGK 58
VLAV V + S IP E++ E +Q + H E RF FKSN
Sbjct: 17 VLAVAAVELCS-AIPMEDKDLESEEALWDLYERWQSAHRVRRHHAEKHRRFGTFKSNAHF 75
Query: 59 IEELNLIAINHKADTKFGV--NKFADLSSDEFKNYYLNNKEAIFTDDLP-VADYLDDEF- 114
I + N + D + + N+F D+ EF+ ++ + P V ++
Sbjct: 76 IH-----SHNKRGDHPYRLHLNRFGDMDQAEFRATFVGDLRRDTPSKPPSVPGFMYAALN 130
Query: 115 INSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNL 174
++ +PP + DWR +GAVT VK+QG+CGSCW+FST +VEG + I LVSLSEQ L
Sbjct: 131 VSDLPP----SVDWRQKGAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGSLVSLSEQEL 186
Query: 175 VDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSAN 234
+DCD A ++GC GGL NA+ YI NGG+ TE++YPY A GT CN A
Sbjct: 187 IDCD---------TADNDGCQGGLMDNAFEYIKNNGGLITEAAYPYRAARGT-CNVARAA 236
Query: 235 IGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGVFDIPCNPNSL 292
pattern 237 ****
I +P N V+ P+++A +A + FY GVF C L
Sbjct: 237 QNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVAVEASGKAFMFYSEGVFTGECG-TEL 295
Query: 293 DHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCG 341
DHG+ +VGY + YW VKNSWG WGEQGYI + + G
Sbjct: 296 DHGVAVVGYG----VAEDGKAYWTVKNSWGPSWGEQGYIRVEKDSGASG 340
>sp|P25249|CYS1_HORVU CYSTEINE PROTEINASE EP-B 1 PRECURSOR
Length = 371
Score = 183 bits (460), Expect = 5e-46
Identities = 126/353 (35%), Positives = 170/353 (47%), Gaps = 48/353 (13%)
Query: 8 VLAVFTVFVSSRGIPPEEQSQFLE---------FQDKFNKKYSHEEYLERFEIFKSNLGK 58
VLAV V + S IP E++ E +Q + H E RF FKSN
Sbjct: 17 VLAVAAVELCS-AIPMEDKDLESEEALWDLYERWQSAHRVRRHHAEKHRRFGTFKSNAHF 75
Query: 59 IEELNLIAINHKADTKFGV--NKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEF-- 114
I + N + D + + N+F D+ EF+ ++ + D P F
Sbjct: 76 IH-----SHNKRGDHPYRLHLNRFGDMDQAEFRATFVGDLRR----DTPAKPPSVPGFMY 126
Query: 115 ----INSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLS 170
++ +PP + DWR +GAVT VK+QG+CGSCW+FST +VEG + I LVSLS
Sbjct: 127 AALNVSDLPP----SVDWRQKGAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGSLVSLS 182
Query: 171 EQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNF 230
EQ L+DCD A ++GC GGL NA+ YI NGG+ TE++YPY A GT CN
Sbjct: 183 EQELIDCD---------TADNDGCQGGLMDNAFEYIKNNGGLITEAAYPYRAARGT-CNV 232
Query: 231 NSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGVFDIPCN 288
pattern 237 ****
A I +P N V+ P+++A +A + FY GVF C
Sbjct: 233 ARAAQNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVAVEASGKAFMFYSEGVFTGDCG 292
Query: 289 PNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCG 341
LDHG+ +VGY + YW VKNSWG WGEQGYI + + G
Sbjct: 293 -TELDHGVAVVGYG----VAEDGKAYWTVKNSWGPSWGEQGYIRVEKDSGASG 340
>sp|P43236|CATK_RABIT CATHEPSIN K PRECURSOR (OC-2 PROTEIN)
Length = 329
Score = 183 bits (459), Expect = 6e-46
Identities = 119/348 (34%), Positives = 181/348 (51%), Gaps = 35/348 (10%)
Query: 9 LAVFTVFVSSRGIPPEE--QSQFLEFQDKFNKKYSHE-EYLERFEIFKSNLGKIEELNLI 65
L V + V S + PEE +Q+ ++ ++K+Y+ + + + R I++ NL I NL
Sbjct: 4 LKVLLLPVVSFALHPEEILDTQWELWKKTYSKQYNSKVDEISRRLIWEKNLKHISIHNLE 63
Query: 66 AINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDE-FINSIPPEEQT 124
A + +N D++S+E K P + +D +I
Sbjct: 64 ASLGVHTYELAMNHLGDMTSEEVVQKMTGLKVP------PSRSHSNDTLYIPDWEGRTPD 117
Query: 125 AFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEY 184
+ D+R +G VTPVKNQGQCGSCW+FS+ G +EGQ KL++LS QNLVDC E
Sbjct: 118 SIDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE---- 173
Query: 185 EGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKIS 244
pattern 237 ****
+ GC GG NA+ Y+ +N GI +E +YPY + C +N + AK
Sbjct: 174 ------NYGCGGGYMTNAFQYVQRNRGIDSEDAYPYVGQ-DESCMYNPTG----KAAKCR 222
Query: 245 NFTMIPK-NETVMAGYIVSTGPLAIAADA--VEWQFYIGGV-FDIPCNPNSLDHGILIVG 300
+ IP+ NE + + GP+++A DA +QFY GV +D C+ ++++H +L VG
Sbjct: 223 GYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDENCSSDNVNHAVLAVG 282
Query: 301 YSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGK-NTCGVSNFVS 347
Y +K +WI+KNSWG WG +GYI + R K N CG++N S
Sbjct: 283 YG-----IQKGNKHWIIKNSWGESWGNKGYILMARNKNNACGIANLAS 325
>sp|P22895|P34_SOYBN P34 PROBABLE THIOL PROTEASE PRECURSOR
Length = 379
Score = 182 bits (458), Expect = 8e-46
Identities = 110/322 (34%), Positives = 173/322 (53%), Gaps = 38/322 (11%)
Query: 40 YSHEEYLERFEIFKSNLGKIEELNLIAINHKA--DTKFGVNKFADLSSDEFKNYYLNNKE 97
++HEE +R EIFK+N I ++N N K+ + G+NKFAD++ EF YL +
Sbjct: 56 HNHEEEAKRLEIFKNNSNYIRDMNA---NRKSPHSHRLGLNKFADITPQEFSKKYLQAPK 112
Query: 98 AIFTDDLPVAD--YLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNV 155
+ + + +A+ +++ PP ++DWR +G +T VK QG CG W+FS TG +
Sbjct: 113 DV-SQQIKMANKKMKKEQYSCDHPP---ASWDWRKKGVITQVKYQGGCGRGWAFSATGAI 168
Query: 156 EGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTE 215
E H I+ LVSLSEQ LVDC E EG G Q ++ +++++GGI T+
Sbjct: 169 EAAHAIATGDLVSLSEQELVDCVEE----------SEGSYNGWQYQSFEWVLEHGGIATD 218
Query: 216 SSYPYTAETGTQCNFN----SANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAAD 271
pattern 237 ****
YPY A+ G +C N I E +S+ + + E I+ P++++ D
Sbjct: 219 DDYPYRAKEG-RCKANKIQDKVTIDGYETLIMSDESTESETEQAFLSAILEQ-PISVSID 276
Query: 272 AVEWQFYIGGVFDIP--CNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQG 329
A ++ Y GG++D +P ++H +L+VGY + + + YWI KNSWG DWGE G
Sbjct: 277 AKDFHLYTGGIYDGENCTSPYGINHFVLLVGYGSAD-----GVDYWIAKNSWGFDWGEDG 331
Query: 330 YIYLRRGK----NTCGVSNFVS 347
YI+++R CG++ F S
Sbjct: 332 YIWIQRNTGNLLGVCGMNYFAS 353
>sp|P49935|CATH_MOUSE CATHEPSIN H PRECURSOR (CATHEPSIN B3) (CATHEPSIN BA)
Length = 333
Score = 180 bits (451), Expect = 5e-45
Identities = 115/332 (34%), Positives = 166/332 (49%), Gaps = 36/332 (10%)
Query: 25 EQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLS 84
E+ F + + K YS EY R ++F +N KI+ N NH K +N+F+D+S
Sbjct: 29 EKFHFKSWMKQHQKTYSSVEYNHRLQMFANNWRKIQAHN--QRNHTF--KMALNQFSDMS 84
Query: 85 SDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRG-AVTPVKNQGQC 143
E K+ +L ++ ++ P ++ DWR +G V+PVKNQG C
Sbjct: 85 FAEIKHKFLWSEPQN-------CSATKSNYLRGTGPYP-SSMDWRKKGNVVSPVKNQGAC 136
Query: 144 GSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAY 203
SCW+FSTTG +E I+ K++SL+EQ LVDC + + GC GGL A+
Sbjct: 137 ASCWTFSTTGALESAVAIASGKMLSLAEQQLVDC--------AQAFNNHGCKGGLPSQAF 188
Query: 204 NYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKN-ETVMAGYIVS 262
pattern 237 ****
YI+ N GI E SYPY + + C FN + A + N I N E M +
Sbjct: 189 EYILYNKGIMEEDSYPYIGK-DSSCRFNP----QKAVAFVKNVVNITLNDEAAMVEAVAL 243
Query: 263 TGPLAIAADAVE-WQFYIGGVFDIPC---NPNSLDHGILIVGYSAKNTIFRKNMPYWIVK 318
P++ A + E + Y GV+ P+ ++H +L VGY +N + YWIVK
Sbjct: 244 YNPVSFAFEVTEDFLMYKSGVYSSKSCHKTPDKVNHAVLAVGYGEQNGLL-----YWIVK 298
Query: 319 NSWGADWGEQGYIYLRRGKNTCGVSNFVSTSI 350
NSWG+ WGE GY + RGKN CG++ S I
Sbjct: 299 NSWGSQWGENGYFLIERGKNMCGLAACASYPI 330
>sp|P55097|CATK_MOUSE CATHEPSIN K PRECURSOR
Length = 329
Score = 178 bits (447), Expect = 2e-44
Identities = 117/352 (33%), Positives = 182/352 (51%), Gaps = 43/352 (12%)
Query: 9 LAVFTVFVSSRGIPPEEQ--SQFLEFQDKFNKKYSHE-EYLERFEIFKSNLGKIEELNLI 65
L V + + S + PEE +Q+ ++ K+Y+ + + + R I++ NL +I NL
Sbjct: 4 LKVLLLPMVSFALSPEEMLDTQWELWKKTHQKQYNSKVDEISRRLIWEKNLKQISAHNLE 63
Query: 66 AINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDD-----EFINSIPP 120
A + +N D++S+E + P Y +D E+ +P
Sbjct: 64 ASLGVHTYELAMNHLGDMTSEEVVQKMTGLRIP------PSRSYSNDTLYTPEWEGRVPD 117
Query: 121 EEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHE 180
+ D+R +G VTPVKNQGQCGSCW+FS+ G +EGQ KL++LS QNLVDC E
Sbjct: 118 ----SIDYRKKGYVTPVKNQGQCGSCWAFSSAGALEGQLKKKTGKLLALSPQNLVDCVTE 173
Query: 181 CMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQ 240
pattern 237 ****
+ GC GG A+ Y+ +NGGI +E ++PY + C +N+ +
Sbjct: 174 ----------NYGCGGGYMTTAFQYVQQNGGIDSEDAFPYVGQ-DESCMYNAT----AKA 218
Query: 241 AKISNFTMIP-KNETVMAGYIVSTGPLAIAADA--VEWQFYIGGV-FDIPCNPNSLDHGI 296
AK + IP NE + + GP++++ DA +QFY GV +D C+ ++++H +
Sbjct: 219 AKCRGYREIPVGNEKALKRAVARVGPISVSIDASLASFQFYSRGVYYDENCDRDNVNHAV 278
Query: 297 LIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGK-NTCGVSNFVS 347
L+VGY +K +WI+KNSWG WG +GY L R K N CG++N S
Sbjct: 279 LVVGYGT-----QKGSKHWIIKNSWGESWGNKGYALLARNKNNACGITNMAS 325
>sp|P56202|CATW_HUMAN CATHEPSIN W PRECURSOR (LYMPHOPAIN)
Length = 376
Score = 177 bits (445), Expect = 3e-44
Identities = 112/351 (31%), Positives = 171/351 (47%), Gaps = 47/351 (13%)
Query: 22 PPEEQSQFLEFQDKFNKKY-SHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKF 80
P E + F FQ +FN+ Y S EE+ R +IF NL + + L + +FGV F
Sbjct: 35 PLELKEAFKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLG---TAEFGVTPF 91
Query: 81 ADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEEQTAF--DWR-TRGAVTPV 137
+DL+ +EF Y + A + I S PEE F DWR GA++P+
Sbjct: 92 SDLTEEEFGQLYGYRRAAGGVPSM-------GREIRSEEPEESVPFSCDWRKVAGAISPI 144
Query: 138 KNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGG 197
K+Q C CW+ + GN+E IS V +S L+DC C +GC+GG
Sbjct: 145 KDQKNCNCCWAMAAAGNIETLWRISFWDFVDVSVHELLDCGR----------CGDGCHGG 194
Query: 198 LQPNAYNYIIKNGGIQTESSYPYTAETGT-QCNFNSANIGPEEQAKISNFTMIPKNETVM 256
pattern 237 ****
+A+ ++ N G+ +E YP+ + +C+ ++ A I +F M+ NE +
Sbjct: 195 FVWDAFITVLNNSGLASEKDYPFQGKVRAHRCHPKKY----QKVAWIQDFIMLQNNEHRI 250
Query: 257 AGYIVSTGPLAIAADAVEWQFYIGGVFDIP---CNPNSLDHGILIVGYSA--------KN 305
A Y+ + GP+ + + Q Y GV C+P +DH +L+VG+ +
Sbjct: 251 AQYLATYGPITVTINMKPLQLYRKGVIKATPTTCDPQLVDHSVLLVGFGSVKSEEGIWAE 310
Query: 306 TIFRKNMP-------YWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTS 349
T+ ++ P YWI+KNSWGA WGE+GY L RG NTCG++ F T+
Sbjct: 311 TVSSQSQPQPPHPTPYWILKNSWGAQWGEKGYFRLHRGSNTCGITKFPLTA 361
>sp|P56203|CATW_MOUSE CATHEPSIN W PRECURSOR (LYMPHOPAIN)
Length = 371
Score = 176 bits (442), Expect = 6e-44
Identities = 110/346 (31%), Positives = 166/346 (47%), Gaps = 40/346 (11%)
Query: 22 PPEEQSQFLEFQDKFNKKY-SHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKF 80
P E + F FQ +FN+ Y + EY R IF NL + + L + +FG F
Sbjct: 33 PLELKEVFKLFQIRFNRSYWNPAEYTRRLSIFAHNLAQAQRLQQEDLG---TAEFGETPF 89
Query: 81 ADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWR-TRGAVTPVKN 139
+DL+ +EF Y + T ++ + + S+P DWR + ++ VKN
Sbjct: 90 SDLTEEEFGQLYGQERSPERTPNM-TKKVESNTWGESVP----RTCDWRKAKNIISSVKN 144
Query: 140 QGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQ 199
QG C CW+ + N++ I + V +S Q L+DC E C GCNGG
Sbjct: 145 QGSCKCCWAMAAADNIQALWRIKHQQFVDVSVQELLDC----------ERCGNGCNGGFV 194
Query: 200 PNAYNYIIKNGGIQTESSYPYTAETGT-QCNFNSANIGPEEQAKISNFTMIPKNETVMAG 258
pattern 237 ****
+AY ++ N G+ +E YP+ + +C ++ A I +FTM+ NE +A
Sbjct: 195 WDAYLTVLNNSGLASEKDYPFQGDRKPHRCLAKKY----KKVAWIQDFTMLSNNEQAIAH 250
Query: 259 YIVSTGPLAIAADAVEWQFYIGGVFDIP---CNPNSLDHGILIVGYSAKN------TIF- 308
Y+ GP+ + + Q Y GV C+P +DH +L+VG+ K T+
Sbjct: 251 YLAVHGPITVTINMKLLQHYQKGVIKATPSSCDPRQVDHSVLLVGFGKKKEGMQTGTVLS 310
Query: 309 -----RKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTS 349
R + PYWI+KNSWGA WGE+GY L RG NTCGV+ + T+
Sbjct: 311 HSRKRRHSSPYWILKNSWGAHWGEKGYFRLYRGNNTCGVTKYPFTA 356
>sp|P43234|CATO_HUMAN CATHEPSIN O PRECURSOR
Length = 321
Score = 173 bits (435), Expect = 4e-43
Identities = 100/304 (32%), Positives = 152/304 (49%), Gaps = 30/304 (9%)
Query: 52 FKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLD 111
F+ +L + LN + + + +G+N+F+ L +EFK YL +K + F
Sbjct: 44 FRESLNRHRYLNSLFPSENSTAFYGINQFSYLFPEEFKAIYLRSKPSKFPR-------YS 96
Query: 112 DEFINSIPPEE-QTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLS 170
E SIP FDWR + VT V+NQ CG CW+FS G VE + I L LS
Sbjct: 97 AEVHMSIPNVSLPLRFDWRDKQVVTQVRNQQMCGGCWAFSVVGAVESAYAIKGKPLEDLS 156
Query: 171 EQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIK-NGGIQTESSYPYTAETGTQCN 229
Q ++DC + + GCNGG NA N++ K + +S YP+ A+ G C+
Sbjct: 157 VQQVIDCSYN----------NYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQNGL-CH 205
Query: 230 FNSANIGPEEQAKISNFTM--IPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPC 287
pattern 237 ****
+ S G I ++ E MA +++ GPL + DAV WQ Y+GG+ C
Sbjct: 206 YFS---GSHSGFSIKGYSAYDFSDQEDEMAKALLTFGPLVVIVDAVSWQDYLGGIIQHHC 262
Query: 288 NPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVS 347
+ +H +LI G+ + PYWIV+NSWG+ WG GY +++ G N CG+++ VS
Sbjct: 263 SSGEANHAVLITGFDKTG-----STPYWIVRNSWGSSWGVDGYAHVKMGSNVCGIADSVS 317
Query: 348 TSII 351
+ +
Sbjct: 318 SIFV 321
>sp|P00784|PAPA_CARPA PAPAIN PRECURSOR (PAPAYA PROTEINASE I) (PPI)
Length = 345
Score = 173 bits (433), Expect = 7e-43
Identities = 119/322 (36%), Positives = 163/322 (49%), Gaps = 43/322 (13%)
Query: 35 KFNKKYSH-EEYLERFEIFKSNLGKIEELNLIAINHKADTKF--GVNKFADLSSDEFKNY 91
K NK Y + +E + RFEIFK NL I+E N K + + G+N FAD+S+DEFK
Sbjct: 54 KHNKIYKNIDEKIYRFEIFKDNLKYIDETN------KKNNSYWLGLNVFADMSNDEFKEK 107
Query: 92 YLNNKEAIFTD-DLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFS 150
Y + +T +L + L+D +N PE DWR +GAVTPVKNQG CGSCW+FS
Sbjct: 108 YTGSIAGNYTTTELSYEEVLNDGDVNI--PEY---VDWRQKGAVTPVKNQGSCGSCWAFS 162
Query: 151 TTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNG 210
+EG I L SEQ L+DCD GCNGG +A ++
Sbjct: 163 AVVTIEGIIKIRTGNLNEYSEQELLDCDRR----------SYGCNGGYPWSALQ-LVAQY 211
Query: 211 GIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAA 270
pattern 237 ****
GI ++YPY G Q S GP + P NE + Y ++ P+++
Sbjct: 212 GIHYRNTYPY---EGVQRYCRSREKGPYAAKTDGVRQVQPYNEGALL-YSIANQPVSVVL 267
Query: 271 DAV--EWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQ 328
+A ++Q Y GG+F PC N +DH + VGY Y ++KNSWG WGE
Sbjct: 268 EAAGKDFQLYRGGIFVGPCG-NKVDHAVAAVGYGPN---------YILIKNSWGTGWGEN 317
Query: 329 GYIYLRRGK-NTCGVSNFVSTS 349
GYI ++RG N+ GV ++S
Sbjct: 318 GYIRIKRGTGNSYGVCGLYTSS 339
>sp|P25774|CATS_HUMAN CATHEPSIN S PRECURSOR
Length = 331
Score = 171 bits (428), Expect = 3e-42
Identities = 116/351 (33%), Positives = 175/351 (49%), Gaps = 35/351 (9%)
Query: 5 LLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYS--HEEYLERFEIFKSNLGKIEEL 62
L+ VL V + V+ P + ++ + K+Y +EE + R I++ NL +
Sbjct: 4 LVCVLLVCSSAVAQLHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRL-IWEKNLKFVMLH 62
Query: 63 NLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEE 122
NL G+N D++S+E + T L V P
Sbjct: 63 NLEHSMGMHSYDLGMNHLGDMTSEEVMS---------LTSSLRVPSQWQRNITYKSNPNR 113
Query: 123 --QTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHE 180
+ DWR +G VT VK QG CG+CW+FS G +E Q + KLV+LS QNLVDC
Sbjct: 114 ILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVTLSAQNLVDC--- 170
Query: 181 CMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQ 240
pattern 237 ****
E+ ++GCNGG A+ YII N GI +++SYPY A +C ++S
Sbjct: 171 ----STEKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKA-MDQKCQYDS----KYRA 221
Query: 241 AKISNFTMIP-KNETVMAGYIVSTGPLAIAADAVEWQFYI--GGVFDIPCNPNSLDHGIL 297
A S +T +P E V+ + + GP+++ DA F++ GV+ P +++HG+L
Sbjct: 222 ATCSKYTELPYGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVL 281
Query: 298 IVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGK-NTCGVSNFVS 347
+VGY N YW+VKNSWG ++GE+GYI + R K N CG+++F S
Sbjct: 282 VVGYGDLN-----GKEYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPS 327
>sp||CATL_CHICK_1 [Segment 1 of 2] CATHEPSIN L
Length = 176
Score = 167 bits (420), Expect = 2e-41
Identities = 87/179 (48%), Positives = 115/179 (63%), Gaps = 16/179 (8%)
Query: 127 DWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEG 186
DWR +G VTPVK+QGQCGSCW+FSTTG +EGQHF ++ KLVSLSEQNLVDC EG
Sbjct: 6 DWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRTKGKLVSLSEQNLVDCSRP----EG 61
Query: 187 EEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNF 246
pattern 237 ****
++GCNGGL A+ Y+ NGGI +E SYPYTA+ C + + A + F
Sbjct: 62 ----NQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDEDCRYKA----EYNAANDTGF 113
Query: 247 TMIPK-NETVMAGYIVSTGPLAIAADA--VEWQFYIGGVFDIP-CNPNSLDHGILIVGY 301
IP+ +E + + S GP+++A DA +QFY G++ P C+ LDHG+L+VGY
Sbjct: 114 VDIPQGHERALMKAVASVGPVSVAIDAGHSSFQFYQSGIYYEPDCSSEDLDHGVLVVGY 172
>sp|P25326|CATS_BOVIN CATHEPSIN S
Length = 217
Score = 165 bits (413), Expect = 1e-40
Identities = 90/227 (39%), Positives = 129/227 (56%), Gaps = 21/227 (9%)
Query: 125 AFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEY 184
+ DWR +G VT VK QG CGSCW+FS G +E Q + KLVSLS QNLVDC
Sbjct: 4 SMDWREKGCVTEVKYQGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDC------- 56
Query: 185 EGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKIS 244
pattern 237 ****
+ ++GCNGG A+ YII N GI +E+SYPY A G +C ++ N A S
Sbjct: 57 STAKYGNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAMDG-KCQYDVKN----RAATCS 111
Query: 245 NFTMIP-KNETVMAGYIVSTGPLAIAADAVEWQFYI--GGVFDIPCNPNSLDHGILIVGY 301
+ +P +E + + + GP+++ DA F++ GV+ P +++HG+L+VGY
Sbjct: 112 RYIELPFGSEEALKEAVANKGPVSVGIDASHSSFFLYKTGVYYDPSCTQNVNHGVLVVGY 171
Query: 302 SAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGK-NTCGVSNFVS 347
+ YW+VKNSWG +G+QGYI + R N CG++N+ S
Sbjct: 172 GNLD-----GKDYWLVKNSWGLHFGDQGYIRMARNSGNHCGIANYPS 213
>sp|P80884|ANAN_ANACO ANANAIN
Length = 216
Score = 161 bits (403), Expect = 2e-39
Identities = 93/224 (41%), Positives = 123/224 (54%), Gaps = 26/224 (11%)
Query: 125 AFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEY 184
+ DWR GAVT VKNQG+CGSCW+F++ VE + I + LVSLSEQ ++DC
Sbjct: 4 SIDWRDSGAVTSVKNQGRCGSCWAFASIATVESIYKIKRGNLVSLSEQQVLDC------- 56
Query: 185 EGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKIS 244
pattern 237 ****
A GC GG AY++II N G+ + + YPY A GT C N G A I+
Sbjct: 57 ----AVSYGCKGGWINKAYSFIISNKGVASAAIYPYKAAKGT-CKTN----GVPNSAYIT 107
Query: 245 NFTMIPKNETVMAGYIVSTGPLAIAADAV-EWQFYIGGVFDIPCNPNSLDHGILIVGYSA 303
+T + +N Y VS P+A A DA +Q Y GVF PC L+H I+I+GY
Sbjct: 108 RYTYVQRNNERNMMYAVSNQPIAAALDASGNFQHYKRGVFTGPCG-TRLNHAIVIIGYGQ 166
Query: 304 KNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNT----CGVS 343
+ +WIV+NSWGA WGE GYI L R ++ CG++
Sbjct: 167 DSA----GKKFWIVRNSWGAGWGEGGYIRLARDVSSSFGICGIA 206
>sp|Q02765|CATS_RAT CATHEPSIN S PRECURSOR
Length = 330
Score = 158 bits (396), Expect = 1e-38
Identities = 89/226 (39%), Positives = 128/226 (56%), Gaps = 22/226 (9%)
Query: 127 DWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEG 186
DWR +G VT VK QG CGSCW+FS G +EGQ + KLVSLS QNLVDC E
Sbjct: 118 DWREKGCVTNVKYQGSCGSCWAFSAEGALEGQLKLKTGKLVSLSAQNLVDCSTE------ 171
Query: 187 EEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNF 246
pattern 237 ****
E+ ++GC GG A+ YII + I +E+SYPY A +C ++ N A S +
Sbjct: 172 EKYGNKGCGGGFMTEAFQYII-DTSIDSEASYPYKA-MDEKCLYDPKN----RAATCSRY 225
Query: 247 TMIP-KNETVMAGYIVSTGPLAIAADAV---EWQFYIGGVFDIPCNPNSLDHGILIVGYS 302
+P +E + + + GP+++ D + Y GV+D P +++HG+L+VGY
Sbjct: 226 IELPFGDEEALKEAVATKGPVSVGIDDASHSSFFLYQSGVYDDPSCTENMNHGVLVVGYG 285
Query: 303 AKNTIFRKNMPYWIVKNSWGADWGEQGYIYL-RRGKNTCGVSNFVS 347
+ YW+VKNSWG +G+QGYI + R KN CG++++ S
Sbjct: 286 TLD-----GKDYWLVKNSWGLHFGDQGYIRMARNNKNHCGIASYCS 326
>sp|P20721|CYSL_LYCES LOW-TEMPERATURE-INDUCED CYSTEINE PROTEINASE PRECURSOR
Length = 346
Score = 158 bits (395), Expect = 2e-38
Identities = 87/238 (36%), Positives = 130/238 (54%), Gaps = 25/238 (10%)
Query: 112 DEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSE 171
D ++ + + DWR +G + VK+QG CGSCW+FS +E + I L+SLSE
Sbjct: 8 DRYLPKVGDSLPESIDWREKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSE 67
Query: 172 QNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFN 231
Q LVDCD + +EGC+GGL A+ ++IKNGGI TE YPY G C+
Sbjct: 68 QELVDCD---------RSYNEGCDGGLMDYAFEFVIKNGGIDTEEDYPYKERNGV-CDQY 117
Query: 232 SANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGVFDIPCNP 289
pattern 237 ****
N + KI ++ +P N V+ P++IA +A ++Q Y G+F C
Sbjct: 118 RKN---AKVVKIDSYEDVPVNNEKALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCG- 173
Query: 290 NSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNT----CGVS 343
++DHG++I GY +N M YWIV+NSWGA+ E GY+ ++R ++ CG++
Sbjct: 174 TAVDHGVVIAGYGTEN-----GMDYWIVRNSWGANCRENGYLRVQRNVSSSSGLCGLA 226
>sp|P36184|ACP1_ENTHI CYSTEINE PROTEINASE ACP1 PRECURSOR
Length = 308
Score = 152 bits (379), Expect = 1e-36
Identities = 105/320 (32%), Positives = 151/320 (46%), Gaps = 48/320 (15%)
Query: 29 FLEFQDKFNKKYSHE-EYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDE 87
F ++ NK +++ EYL RF +F N +E A+ +N FAD++ +E
Sbjct: 18 FKQWAATHNKVFANRAEYLYRFAVFLDNKKFVE----------ANANTELNVFADMTHEE 67
Query: 88 FKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCW 147
F +L T ++P + + P + DWR+ + P K+QGQCGSCW
Sbjct: 68 FIQTHLG-----MTYEVPETTSNVKAAVKAAPE----SVDWRS--IMNPAKDQGQCGSCW 116
Query: 148 SFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYII 207
+F TT +EG+ KL S SEQ LVDCD A D GC GG N+ +I
Sbjct: 117 TFCTTAVLEGRVNKDLGKLYSFSEQQLVDCD----------ASDNGCEGGHPSNSLKFIQ 166
Query: 208 KNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLA 267
pattern 237 ****
+N G+ ES YPY A GT C N+ ++ + +ET + I GP+A
Sbjct: 167 ENNGLGLESDYPYKAVAGT-CK-KVKNVATVTGSR----RVTDGSETGLQTIIAENGPVA 220
Query: 268 IAADA--VEWQFYIGGVF--DIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGA 323
+ DA +Q Y G D C ++H + VGY + + N YWI++NSWG
Sbjct: 221 VGMDASRPSFQLYKKGTIYSDTKCRSRMMNHCVTAVGYGSNS-----NGKYWIIRNSWGT 275
Query: 324 DWGEQGYIYLRR-GKNTCGV 342
WG+ GY L R N CG+
Sbjct: 276 SWGDAGYFLLARDSNNMCGI 295
>sp|Q01957|CPP1_ENTHI CYSTEINE PROTEINASE 1 PRECURSOR
Length = 315
Score = 150 bits (375), Expect = 4e-36
Identities = 103/317 (32%), Positives = 163/317 (50%), Gaps = 47/317 (14%)
Query: 37 NKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADT-KFGVN-KFADLSSDEFKNYYLN 94
NK ++ E L R IF N ++A N++ +T K V+ FA ++++E+ +
Sbjct: 24 NKHFTAVESLRRRAIFNMNA------RIVAENNRKETFKLSVDGPFAAMTNEEYNSLLKL 77
Query: 95 NKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGN 154
+ ++ ++N P+ A DWR +G VTP+++QG CGSC++F +
Sbjct: 78 KRSGEEKGEV--------RYLNIQAPK---AVDWRKKGKVTPIRDQGNCGSCYTFGSIAA 126
Query: 155 VEGQHFISQ---NKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGG 211
+EG+ I + ++ + LSE+++V C E +G + GCNGGL N YNYI++N G
Sbjct: 127 LEGRLLIEKGGDSETLDLSEEHMVQCTRE----DG----NNGCNGGLGSNVYNYIMEN-G 177
Query: 212 IQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAAD 271
pattern 237 ****
I ES YPYT T + AKI ++ + +N V +S G + ++ D
Sbjct: 178 IAKESDYPYTGSDST------CRSDVKAFAKIKSYNRVARNNEVELKAAISQGLVDVSID 231
Query: 272 A--VEWQFYIGGVF-DIPCNPN--SLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWG 326
A V++Q Y G + D C N +L+H + VGY + WIV+NSWG WG
Sbjct: 232 ASSVQFQLYKSGAYTDTQCKNNYFALNHEVCAVGYGVVD-----GKECWIVRNSWGTGWG 286
Query: 327 EQGYIYLRRGKNTCGVS 343
E+GYI + NTCGV+
Sbjct: 287 EKGYINMVIEGNTCGVA 303
>sp|O17473|CATL_BRUPA CATHEPSIN L-LIKE PRECURSOR
Length = 395
Score = 150 bits (374), Expect = 6e-36
Identities = 101/331 (30%), Positives = 157/331 (46%), Gaps = 29/331 (8%)
Query: 26 QSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSS 85
++++ ++ K Y +E R IF+SN E +N +N ADL+
Sbjct: 88 ETEWKDYVTALGKHYDQKENNFRMAIFESNELMTERINKKYEQGLVSYTTALNDLADLTD 147
Query: 86 DEF--KNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQC 143
+EF +N + +++ + +P + DWRT+GAVTPV+NQG+C
Sbjct: 148 EEFMVRNGLRLPNQTDLRGKRQTSEFYRYDKSERLPDQ----VDWRTKGAVTPVRNQGEC 203
Query: 144 GSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAY 203
GSC++F+T +E H +L+ LS QN+VDC + GC+GG P A+
Sbjct: 204 GSCYAFATAAALEAYHKQMTGRLLDLSPQNIVDCT--------RNLGNNGCSGGYMPTAF 255
Query: 204 NYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMI-PKNETVMAGYIVS 262
pattern 237 ****
Y + GI ES YPY T +C + + + + F I P +E + +
Sbjct: 256 QYASRY-GIAMESRYPYVG-TEQRCRWQQSIAVVTD----NGFNEIQPGDELALKHAVAK 309
Query: 263 TGP--LAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNS 320
GP + I+ ++FY GV+ N DH +L VGY + YWIVKNS
Sbjct: 310 RGPVVVGISGSKRSFRFYKDGVYS-EGNCGRPDHAVLAVGYGTHPSY----GDYWIVKNS 364
Query: 321 WGADWGEQGYIYLRRGK-NTCGVSNFVSTSI 350
WG DWG+ GY+Y+ R + N C +++ S I
Sbjct: 365 WGTDWGKDGYVYMARNRGNMCHIASAASFPI 395
>sp|P46102|CYSP_PLAVN CYSTEINE PROTEINASE PRECURSOR
Length = 506
Score = 150 bits (374), Expect = 6e-36
Identities = 116/363 (31%), Positives = 180/363 (48%), Gaps = 64/363 (17%)
Query: 27 SQFLEFQDKFNKKYSH-EEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSS 85
S+F ++ + NKKY + +E L+RFE FK K ++ N + + VN+++D S
Sbjct: 160 SKFFKYMKENNKKYENMDEQLQRFENFKIRYMKTQKHNEMVGKNGLTYVQKVNQYSDFSK 219
Query: 86 DEFKNYYLNNKEAIFTDDL------PVADYLDDEFINSIPPEEQT---AFDWRTRGAVTP 136
+EF NY+ K DL P+ +L + + S+ + + + D+R++ P
Sbjct: 220 EEFDNYF--KKLLSVPMDLKSKYIVPLKKHLANTNLISVDNKSKDFPDSRDYRSKFNFLP 277
Query: 137 VKNQGQCGSCWSFSTTGNVEGQHFISQNKL-VSLSEQNLVDCDHECMEYEGEEACDEGCN 195
K+QG CGSCW+F+ GN E + +++++ +S SEQ +VDC E + GC+
Sbjct: 278 PKDQGNCGSCWAFAAIGNFEYLYVHTRHEMPISFSEQQMVDCSTE----------NYGCD 327
Query: 196 GGLQPNAYNYIIKNGGIQTESSYPYTAETGTQC-NFNSANIGPEEQAKISNFTMIPKNET 254
pattern 237 ****
GG A+ Y+I NG + YPY C N+ + +G ++ + NE
Sbjct: 328 GGNPFYAFLYMINNG-VCLGDEYPYKGHEDFFCLNYRCSLLG-----RVHFIGDVKPNEL 381
Query: 255 VMAGYIVSTGPLAIAADAVE-WQFYIGGVFDIPCNPNSLDHGILIVGYSA---------- 303
+MA V GP+ IA A E + Y GGVFD CNP L+H +L+VGY
Sbjct: 382 IMALNYV--GPVTIAVGASEDFVLYSGGVFDGECNPE-LNHSVLLVGYGQVKKSLAFEDS 438
Query: 304 -----KNTI--FRKNMP---------YWIVKNSWGADWGEQGYIYLRRGK----NTCGVS 343
N I +++N+ YWIV+NSWG +WGE GYI ++R K CGV
Sbjct: 439 HSNVDSNLIKKYKENIKGDDDDDIIYYWIVRNSWGPNWGEGGYIRIKRNKAGDDGFCGVG 498
Query: 344 NFV 346
+ V
Sbjct: 499 SDV 501
>sp|Q06964|CPP3_ENTHI CYSTEINE PROTEINASE 3 PRECURSOR (CYSTEINE PROTEINASE ACP3)
Length = 308
Score = 149 bits (372), Expect = 9e-36
Identities = 103/316 (32%), Positives = 159/316 (49%), Gaps = 45/316 (14%)
Query: 37 NKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVN-KFADLSSDEFKNYYLNN 95
NK ++ E L R IF N + E N K K V+ FA ++++E++ L +
Sbjct: 17 NKHFTAVEALRRRAIFNMNARFVAEFN-----KKGSFKLSVDGPFAAMTNEEYRTL-LKS 70
Query: 96 KEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNV 155
K + + ++N PE + DWR +G VTP+++Q QCGSC++F + +
Sbjct: 71 KRTVEENGKVT-------YLNIQAPE---SVDWRAQGKVTPIRDQAQCGSCYTFGSLAAL 120
Query: 156 EGQHFISQN---KLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGI 212
EG+ I + + LSE++LV C + + GCNGGL N Y+YII+N G+
Sbjct: 121 EGRLLIEKGGNANTLDLSEEHLVQCT--------RDNGNNGCNGGLGSNVYDYIIQN-GV 171
Query: 213 QTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADA 272
pattern 237 ****
ES YPYT T + C N + AKI+ + +P+N +S G + ++ DA
Sbjct: 172 AKESDYPYTG-TDSTCKTN-----VKAFAKITGYNKVPRNNEAELKAALSQGLVDVSIDA 225
Query: 273 --VEWQFYIGGVF-DIPCNPN--SLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGE 327
++Q Y G + D C N +L+H + VGY + WIV+NSWG WG+
Sbjct: 226 SSAKFQLYKSGAYSDTKCKNNFFALNHEVCAVGYGVVD-----GKECWIVRNSWGTGWGD 280
Query: 328 QGYIYLRRGKNTCGVS 343
+GYI + NTCGV+
Sbjct: 281 KGYINMVIEGNTCGVA 296
>sp|Q01958|CPP2_ENTHI CYSTEINE PROTEINASE 2 PRECURSOR
Length = 315
Score = 149 bits (372), Expect = 9e-36
Identities = 102/324 (31%), Positives = 161/324 (49%), Gaps = 45/324 (13%)
Query: 29 FLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVN-KFADLSSDE 87
F + K NK ++ E L R IF N ++ N I K V+ FA ++++E
Sbjct: 16 FNTWASKNNKHFTAIEKLRRRAIFNMNAKFVDSFNKIG-----SFKLSVDGPFAAMTNEE 70
Query: 88 FKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCW 147
++ + + T++ YL+ + S+ DWR G VTP+++Q QCGSC+
Sbjct: 71 YRTLLKSKRT---TEENGQVKYLNIQAPESV--------DWRKEGKVTPIRDQAQCGSCY 119
Query: 148 SFSTTGNVEGQHFISQN---KLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYN 204
+F + +EG+ I + + LSE+++V C + + GCNGGL N Y+
Sbjct: 120 TFGSLAALEGRLLIEKGGDANTLDLSEEHMVQCT--------RDNGNNGCNGGLGSNVYD 171
Query: 205 YIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTG 264
pattern 237 ****
YII++ G+ ES YPYT T C N + AKI+ +T +P+N +S G
Sbjct: 172 YIIEH-GVAKESDYPYTGSDST-CKTNVKSF-----AKITGYTKVPRNNEAELKAALSQG 224
Query: 265 PLAIAADA--VEWQFYIGGVF-DIPCNPN--SLDHGILIVGYSAKNTIFRKNMPYWIVKN 319
+ ++ DA ++Q Y G + D C N +L+H + VGY + WIV+N
Sbjct: 225 LVDVSIDASSAKFQLYKSGAYTDTKCKNNYFALNHEVCAVGYGVVD-----GKECWIVRN 279
Query: 320 SWGADWGEQGYIYLRRGKNTCGVS 343
SWG WG++GYI + NTCGV+
Sbjct: 280 SWGTGWGDKGYINMVIEGNTCGVA 303
>sp|P36185|ACP2_ENTHI CYSTEINE PROTEINASE ACP2 PRECURSOR
Length = 310
Score = 145 bits (363), Expect = 1e-34
Identities = 102/330 (30%), Positives = 160/330 (47%), Gaps = 40/330 (12%)
Query: 20 GIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVN- 78
GI F + K NK ++ E L R IF N ++ N I K V+
Sbjct: 3 GIRIASAIDFNTWASKNNKHFTAIEKLRRRAIFNMNAKFVDSFNKIG-----SFKLSVDG 57
Query: 79 KFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVK 138
FA ++++E++ + + T++ YL+ + S+ DWR G VTP++
Sbjct: 58 PFAAMTNEEYRTLLKSKRT---TEENGQVKYLNIQAPESV--------DWRKEGKVTPLR 106
Query: 139 NQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGL 198
+Q QCGSC++F + +EG+ I + + N +D E M+ + + GCNGGL
Sbjct: 107 DQAQCGSCYTFGSLAALEGRLLIEKG-----GDANTLDLSEEHMQCTRDNG-NNGCNGGL 160
Query: 199 QPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAG 258
pattern 237 ****
N Y+YII++G + ES YPYT T C N + KI+ +T +P+N
Sbjct: 161 GSNVYDYIIEHG-VAKESDYPYTGSDST-CKTNVKSF-----RKITGYTKVPRNNEAELK 213
Query: 259 YIVSTGPLAIAAD--AVEWQFYIGGVF-DIPCNPN--SLDHGILIVGYSAKNTIFRKNMP 313
+S G L ++ D + ++Q Y G + D C N +L+H + VGY +
Sbjct: 214 AALSQGLLDVSIDVSSAKFQLYKSGAYTDTKCKNNYFALNHEVCAVGYGVVD-----GKE 268
Query: 314 YWIVKNSWGADWGEQGYIYLRRGKNTCGVS 343
WIV+NSWG WG++GYI + NTCGV+
Sbjct: 269 CWIVRNSWGTSWGDKGYINMVIEGNTCGVA 298
>sp|P25781|CYSP_THEAN CYSTEINE PROTEINASE PRECURSOR
Length = 441
Score = 145 bits (362), Expect = 1e-34
Identities = 107/345 (31%), Positives = 165/345 (47%), Gaps = 58/345 (16%)
Query: 28 QFLEFQDKFNKKY-SHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGV--NKFADLS 84
+F F +K+ K + S ++ ++RF F+ N ++ HK + + NKF+DLS
Sbjct: 119 EFDAFVEKYKKVHRSFDQRVQRFLTFRKNYHIVK-------THKPTEPYSLDLNKFSDLS 171
Query: 85 SDEFKNYY--------------------LNNKEAIFTDDLPVADYLDDEFINSIPPEEQT 124
+EFK Y +++K I+ L A +++ S+ E
Sbjct: 172 DEEFKALYPVITPPKTYTSLSKHLEFKKMSHKNPIYISKLKKAKGIEEIKDLSLITGEN- 230
Query: 125 AFDWRTRGAVTPVKNQGQ-CGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECME 183
+W AV+P K+QG CGSCW+FS+ +VE + + +NK LSEQ LV+CD M
Sbjct: 231 -LNWARTDAVSPTKDQGDHCGSCWAFSSIASVESLYRLYKNKSYFLSEQELVNCDKSSM- 288
Query: 184 YEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKI 243
pattern 237 ****
GC GGL A Y I + G+ ES PYT + C + N + I
Sbjct: 289 ---------GCAGGLPITALEY-IHSKGVSFESEVPYTGIV-SPCKPSIKN-----KVFI 332
Query: 244 SNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSA 303
+ +++ N+ V ++S + IA E + Y GG+F C L+H +L+VG
Sbjct: 333 DSISILKGNDVVNKSLVISPTVVGIAV-TKELKLYSGGIFTGKCG-GELNHAVLLVGEGV 390
Query: 304 KNTIFRKNMPYWIVKNSWGADWGEQGYIYLRR---GKNTCGVSNF 345
+ M YWI+KNSWG DWGE G++ L+R G + CG+ F
Sbjct: 391 DH---ETGMRYWIIKNSWGEDWGENGFLRLQRTKKGLDKCGILTF 432
>sp|P22497|CYSP_THEPA CYSTEINE PROTEINASE PRECURSOR
Length = 439
Score = 143 bits (357), Expect = 5e-34
Identities = 105/351 (29%), Positives = 163/351 (45%), Gaps = 72/351 (20%)
Query: 24 EEQSQFLEFQDKFNKKYS-HEEYLERFEIFKSNLGKIEELNLIAINHKADTKF--GVNKF 80
E +F EF K+N++++ +E L R F+SN +++E K D + G+N+F
Sbjct: 119 EVYREFEEFNSKYNRRHATQQERLNRLVTFRSNYLEVKE-------QKGDEPYVKGINRF 171
Query: 81 ADLSSDEF--------------------------KNYYLNNKEAIFTDDLPVADYLDDEF 114
+DL+ EF K Y N K+A+ TD+ D
Sbjct: 172 SDLTEREFYKLFPVMKPPKATYSNGYYLLSHMANKTYLKNLKKALNTDE--------DVD 223
Query: 115 INSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNL 174
+ + E DWR +VT VK+Q CG CW+FST G+VEG + +K LS Q L
Sbjct: 224 LAKLTGEN---LDWRRSSSVTSVKDQSNCGGCWAFSTVGSVEGYYMSHFDKSYELSVQEL 280
Query: 175 VDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSAN 234
+DCD + GC GGL +AY Y+ K G+ + P+ + +C+ A
Sbjct: 281 LDCD----------SFSNGCQGGLLESAYEYVRKY-GLVSAKDLPF-VDKARRCSVPKA- 327
Query: 235 IGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDH 294
pattern 237 ****
++ + ++ + K + VM + S+ + + E Y GVF C SL+H
Sbjct: 328 ----KKVSVPSYHVF-KGKEVMTRSLTSSPCSVYLSVSPELAKYKSGVFTGECG-KSLNH 381
Query: 295 GILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRR---GKNTCGV 342
+++VG ++ YW+V+NSWG DWGE GY+ L R G + CGV
Sbjct: 382 AVVLVGEGYDEVTKKR---YWVVQNSWGTDWGENGYMRLERTNMGTDKCGV 429
>sp|P25805|CYSP_PLAFA THROPHOZOITE CYSTEINE PROTEINASE PRECURSOR (TCP)
Length = 569
Score = 141 bits (351), Expect = 3e-33
Identities = 107/367 (29%), Positives = 169/367 (45%), Gaps = 62/367 (16%)
Query: 27 SQFLEFQDKFNKKYSH-EEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSS 85
S+F +F + NK Y + +E + +FEIFK N I+ N +N A K VN+F+D S
Sbjct: 223 SKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHN--KLNKNAMYKKKVNQFSDYSE 280
Query: 86 DEFKNYYLN----NKEAIFTDDLPVADYLDD-----EFINSIPPEEQTAF-------DWR 129
+E K Y+ I P ++L D EF + E+ F D+R
Sbjct: 281 EELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYR 340
Query: 130 TRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEA 189
+G V K+QG CGSCW+F++ GN+E ++S SEQ +VDC +
Sbjct: 341 EKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKD--------- 391
Query: 190 CDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMI 249
pattern 237 ****
+ GC+GG ++ Y+++N + Y Y A+ C N + + +S+ +
Sbjct: 392 -NFGCDGGHPFYSFLYVLQN-ELCLGDEYKYKAKDDMFC----LNYRCKRKVSLSSIGAV 445
Query: 250 PKNETVMAGYIVSTGPLAIAADA-VEWQFYIGGVFDIPCNPNSLDHGILIVGY------- 301
+N+ ++A + GPL++ ++ Y GV++ C+ L+H +L+VGY
Sbjct: 446 KENQLILA--LNEVGPLSVNVGVNNDFVAYSEGVYNGTCS-EELNHSVLLVGYGQVEKTK 502
Query: 302 -------SAKNTIFRKNMP------YWIVKNSWGADWGEQGYIYLRRGKN----TCGVSN 344
NT N P YWI+KNSW WGE G++ L R KN CG+
Sbjct: 503 LNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGE 562
Query: 345 FVSTSII 351
V I+
Sbjct: 563 EVFYPIL 569
>sp|P14518|BROM_ANACO BROMELAIN, STEM
Length = 212
Score = 139 bits (348), Expect = 6e-33
Identities = 81/224 (36%), Positives = 113/224 (50%), Gaps = 31/224 (13%)
Query: 125 AFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEY 184
+ DWR GAVT VKNQ CG+CW+F+ VE + I + L LSEQ ++DC
Sbjct: 5 SIDWRDYGAVTSVKNQNPCGACWAFAAIATVESIYKIKKGILEPLSEQQVLDC------- 57
Query: 185 EGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKIS 244
pattern 237 ****
A GC GG + A+ +II N G+ + + YPY A GT C + G A I+
Sbjct: 58 ----AKGYGCKGGWEFRAFEFIISNKGVASGAIYPYKAAKGT-CKTD----GVPNSAYIT 108
Query: 245 NFTMIPKNETVMAGYIVSTGPLAIAADA-VEWQFYIGGVFDIPCNPNSLDHGILIVGYSA 303
+ +P+N Y VS P+ +A DA +Q+Y GVF+ PC SL+H + +GY
Sbjct: 109 GYARVPRNNESSMMYAVSKQPITVAVDANANFQYYKSGVFNGPCG-TSLNHAVTAIGYGQ 167
Query: 304 KNTIFRKNMPYWIVKNSWGADWGEQGYIYLRR----GKNTCGVS 343
+ I+ K WGA WGE GYI + R CG++
Sbjct: 168 DSIIYPK---------KWGAKWGEAGYIRMARDVSSSSGICGIA 202
>sp|P16311|MMAL_DERFA MAJOR MITE FECAL ALLERGEN DER F 1 PRECURSOR (DER F I)
Length = 321
Score = 138 bits (345), Expect = 1e-32
Identities = 115/352 (32%), Positives = 157/352 (43%), Gaps = 52/352 (14%)
Query: 7 FVLAVFTVFVSSRGIP-PEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLI 65
FVLA+ ++ V S P F EF+ FNK Y+ +E E+ + N +E L +
Sbjct: 3 FVLAIASLLVLSTVYARPASIKTFEEFKKAFNKNYAT---VEEEEVARKNF--LESLKYV 57
Query: 66 AINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEF----INSIPPE 121
N K +N +DLS DEFKN YL + EA + L L+ E INS+
Sbjct: 58 EAN-----KGAINHLSDLSLDEFKNRYLMSAEAF--EQLKTQFDLNAETSACRINSVNVP 110
Query: 122 EQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHEC 181
+ D R+ VTP++ QG CGSCW+FS E + +N + LSEQ LVDC
Sbjct: 111 SE--LDLRSLRTVTPIRMQGGCGSCWAFSGVAATESAYLAYRNTSLDLSEQELVDC---- 164
Query: 182 MEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQA 241
pattern 237 ****
A GC+G P YI +NG ++ E SYPY A NS + G
Sbjct: 165 -------ASQHGCHGDTIPRGIEYIQQNGVVE-ERSYPYVAREQRCRRPNSQHYG----- 211
Query: 242 KISNFTMIPKNETVMAGYIVSTGPLAIAA-----DAVEWQFYIGGVF---DIPCNPNSLD 293
ISN+ I + ++ AIA D +Q Y G D PN
Sbjct: 212 -ISNYCQIYPPDVKQIREALTQTHTAIAVIIGIKDLRAFQHYDGRTIIQHDNGYQPNY-- 268
Query: 294 HGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNF 345
H + IVGY + + YWIV+NSW WG+ GY Y + G N + +
Sbjct: 269 HAVNIVGYGS-----TQGDDYWIVRNSWDTTWGDSGYGYFQAGNNLMMIEQY 315
>sp|P42666|CYSP_PLAVI CYSTEINE PROTEINASE PRECURSOR
Length = 583
Score = 129 bits (320), Expect = 1e-29
Identities = 100/370 (27%), Positives = 166/370 (44%), Gaps = 84/370 (22%)
Query: 27 SQFLEFQDKFNKKYSH-EEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSS 85
S+F F +K+ + Y E +E+++ FK N KI++ N K VN+F+D S
Sbjct: 235 SKFFNFMNKYKRSYKDINEQMEKYKNFKMNYLKIKKHN----ETNQMYKMKVNQFSDYSK 290
Query: 86 DEFKNYYLNNKEAIFTDDLPVADYLDDEFI--------------------NSIPPEEQTA 125
+F++Y F +P+ D+L +++ ++ +
Sbjct: 291 KDFESY--------FRKLVPIPDHLKKKYVVPFSSMNNGKGKNVVTSSSGANLLADVPEI 342
Query: 126 FDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNK-LVSLSEQNLVDCDHECMEY 184
D+R +G V K+QG CGSCW+F++ GNVE + NK +++LSEQ +VDC
Sbjct: 343 LDYREKGIVHEPKDQGLCGSCWAFASVGNVECMYAKEHNKTILTLSEQEVVDC------- 395
Query: 185 EGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKIS 244
pattern 237 ****
+ GC+GG ++ Y I+N GI Y Y A C N + + +S
Sbjct: 396 ---SKLNFGCDGGHPFYSFIYAIEN-GICMGDDYKYKAMDNLFC----LNYRCKNKVTLS 447
Query: 245 NFTMIPKNETVMAGYIVSTGPLAIAADAV-EWQFYIGGVFDIPCNPNSLDHGILIVGYS- 302
+ + +NE + A + GP+++ ++ FY GG+F+ C L+H +L+VGY
Sbjct: 448 SVGGVKENELIRA--LNEVGPVSVNVGVTDDFSFYGGGIFNGTCT-EELNHSVLLVGYGQ 504
Query: 303 -AKNTIFRKN-------------------------MPYWIVKNSWGADWGEQGYIYLRRG 336
+ IF++ YWI+KNSW WGE G++ + R
Sbjct: 505 VQSSKIFQEKNAYDDASGVTKKGALSYPSKADDGIQYYWIIKNSWSKFWGENGFMRISRN 564
Query: 337 KN----TCGV 342
K CG+
Sbjct: 565 KEGDNVFCGI 574
>sp|P08176|MMAL_DERPT MAJOR MITE FECAL ALLERGEN DER P 1 PRECURSOR (DER P I)
Length = 320
Score = 121 bits (300), Expect = 3e-27
Identities = 111/345 (32%), Positives = 151/345 (43%), Gaps = 57/345 (16%)
Query: 1 MKVILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIE 60
MK++L + V +R P F E++ FNK Y+ E E + N +E
Sbjct: 1 MKIVLAIASLLALSAVYAR---PSSIKTFEEYKKAFNKSYAT---FEDEEAARKNF--LE 52
Query: 61 ELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEF----IN 116
+ + N A +N +DLS DEFKN +L + EA + L L+ E IN
Sbjct: 53 SVKYVQSNGGA-----INHLSDLSLDEFKNRFLMSAEAF--EHLKTQFDLNAETNACSIN 105
Query: 117 SIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVD 176
P E D R VTP++ QG CGSCW+FS E + +N+ + L+EQ LVD
Sbjct: 106 GNAPAE---IDLRQMRTVTPIRMQGGCGSCWAFSGVAATESAYLAYRNQSLDLAEQELVD 162
Query: 177 CDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIG 236
C A GC+G P YI NG +Q ES Y Y A + N+ G
Sbjct: 163 C-----------ASQHGCHGDTIPRGIEYIQHNGVVQ-ESYYRYVAREQSCRRPNAQRFG 210
Query: 237 PEEQAKISNFTMI-PKNETVMAGYIVSTGPLAIAA-----DAVEWQFYIGGVF---DIPC 287
pattern 237 ****
ISN+ I P N + + T AIA D ++ Y G D
Sbjct: 211 ------ISNYCQIYPPNVNKIREALAQTHS-AIAVIIGIKDLDAFRHYDGRTIIQRDNGY 263
Query: 288 NPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIY 332
PN H + IVGYS + + YWIV+NSW +WG+ GY Y
Sbjct: 264 QPNY--HAVNIVGYSN-----AQGVDYWIVRNSWDTNWGDNGYGY 301
>sp|P80067|CATC_RAT DIPEPTIDYL-PEPTIDASE I PRECURSOR (DPP-I) (DPPI) (CATHEPSIN C)
(CATHEPSIN J) (DIPEPTIDYL TRANSFERASE)
Length = 462
Score = 111 bits (274), Expect = 3e-24
Identities = 83/260 (31%), Positives = 128/260 (48%), Gaps = 34/260 (13%)
Query: 105 PVADYLDDEFINSIPPEEQTAFDWRT-RGA--VTPVKNQGQCGSCWSFSTTGNVEGQHFI 161
P+ D + + + S+P ++DWR RG V+PV+NQ CGSC+SF++ G +E + I
Sbjct: 218 PITDEIQQQIL-SLPE----SWDWRNVRGINFVSPVRNQESCGSCYSFASIGMLEARIRI 272
Query: 162 SQNKLVS--LSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYP 219
N + LS Q +V C +GC+GG ++ G+ E+ +P
Sbjct: 273 LTNNSQTPILSPQEVVSCSPYA----------QGCDGGFPYLIAGKYAQDFGVVEENCFP 322
Query: 220 YTAETGTQCN--FNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVE-WQ 276
pattern 237 ****
YTA T C N E + F NE +M +V GP+A+A + + +
Sbjct: 323 YTA-TDAPCKPKENCLRYYSSEYYYVGGFYG-GCNEALMKLELVKHGPMAVAFEVHDDFL 380
Query: 277 FYIGGVF-----DIPCNPNSL-DHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGY 330
Y G++ P NP L +H +L+VGY K+ + + YWIVKNSWG+ WGE GY
Sbjct: 381 HYHSGIYHHTGLSDPFNPFELTNHAVLLVGYG-KDPV--TGLDYWIVKNSWGSQWGESGY 437
Query: 331 IYLRRGKNTCGVSNFVSTSI 350
+RRG + C + + +I
Sbjct: 438 FRIRRGTDECAIESIAMAAI 457
>sp|P97821|CATC_MOUSE DIPEPTIDYL-PEPTIDASE I PRECURSOR (DPP-I) (DPPI) (CATHEPSIN C)
(CATHEPSIN J) (DIPEPTIDYL TRANSFERASE)
Length = 462
Score = 109 bits (270), Expect = 9e-24
Identities = 91/335 (27%), Positives = 155/335 (46%), Gaps = 42/335 (12%)
Query: 34 DKFNKKYSH-----EEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEF 88
+K N +H E Y ER ++ N ++ +N + K+ T ++ +S +
Sbjct: 147 EKVNMNAAHLGGLQERYSER--LYTHNHNFVKAINTV---QKSWTATAYKEYEKMSLRDL 201
Query: 89 KNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRT-RGA--VTPVKNQGQCGS 145
+++ P+ D + + +N PE ++DWR +G V+PV+NQ CGS
Sbjct: 202 IRRSGHSQRIPRPKPAPMTDEIQQQILNL--PE---SWDWRNVQGVNYVSPVRNQESCGS 256
Query: 146 CWSFSTTGNVEGQHFISQNKLVS--LSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAY 203
C+SF++ G +E + I N + LS Q +V C +GC+GG
Sbjct: 257 CYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSPYA----------QGCDGGFPYLIA 306
Query: 204 NYIIKNGGIQTESSYPYTA-ETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVS 262
pattern 237 ****
++ G+ ES +PYTA ++ + N + + F NE +M +V
Sbjct: 307 GKYAQDFGVVEESCFPYTAKDSPCKPRENCLRYYSSDYYYVGGFYG-GCNEALMKLELVK 365
Query: 263 TGPLAIAADAVE-WQFYIGGVF-----DIPCNPNSL-DHGILIVGYSAKNTIFRKNMPYW 315
GP+A+A + + + Y G++ P NP L +H +L+VGY + YW
Sbjct: 366 HGPMAVAFEVHDDFLHYHSGIYHHTGLSDPFNPFELTNHAVLLVGYGRDPVT---GIEYW 422
Query: 316 IVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSI 350
I+KNSWG++WGE GY +RRG + C + + +I
Sbjct: 423 IIKNSWGSNWGESGYFRIRRGTDECAIESIAVAAI 457
>sp|P25773|CATL_FELCA CATHEPSIN L (PROGESTERONE-DEPENDENT PROTEIN) (PDP)
Length = 139
Score = 108 bits (267), Expect = 2e-23
Identities = 55/145 (37%), Positives = 84/145 (57%), Gaps = 9/145 (6%)
Query: 196 GGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETV 255
pattern 237 ****
GGL +A+ Y+ NGG+ +E SYPY A+ G C + N A ++++ IP E
Sbjct: 1 GGLIDDAFQYVKDNGGLDSEESYPYHAQ-GDSCKYRPEN----SVANVTDYWDIPSKENE 55
Query: 256 MAGYIVSTGPLAIAADAV--EWQFYIGGVF-DIPCNPNSLDHGILIVGYSAKNTIFRKNM 312
+ + + GP++ A DA ++FY G++ D C+ +DHG+L+VGY A T +N
Sbjct: 56 LMITLAAVGPISAAIDASLDTFRFYKEGIYYDPSCSSEDVDHGVLVVGYGADGTE-TENK 114
Query: 313 PYWIVKNSWGADWGEQGYIYLRRGK 337
YWI+KNSWG DWG GYI + + +
Sbjct: 115 KYWIIKNSWGTDWGMDGYIKMAKDR 139
>sp|Q26563|CATC_SCHMA CATHEPSIN C PRECURSOR
Length = 454
Score = 108 bits (266), Expect = 3e-23
Identities = 75/238 (31%), Positives = 109/238 (45%), Gaps = 33/238 (13%)
Query: 126 FDWRT-----RGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQN--KLVSLSEQNLVDCD 178
FDW + R VTP++NQG CGSC++ + +E + + N + LS Q +VDC
Sbjct: 222 FDWTSPPDGSRSPVTPIRNQGICGSCYASPSAAALEARIRLVSNFSEQPILSPQTVVDCS 281
Query: 179 HECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNF--NSANIG 236
EGCNGG ++ G+ + PYT E +C N
Sbjct: 282 ----------PYSEGCNGGFPFLIAGKYGEDFGLPQKIVIPYTGEDTGKCTVSKNCTRYY 331
Query: 237 PEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVE-WQFYIGGVFDIPC-------- 287
pattern 237 ****
+ + I + NE +M ++S GP + + E +QFY G++
Sbjct: 332 TTDYSYIGGYYGAT-NEKLMQLELISNGPFPVGFEVYEDFQFYKEGIYHHTTVQTDHYNF 390
Query: 288 NPNSL-DHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSN 344
NP L +H +L+VGY PYW VKNSWG +WGEQGY + RG + CGV +
Sbjct: 391 NPFELTNHAVLLVGYGVDKL---SGEPYWKVKNSWGVEWGEQGYFRILRGTDECGVES 445
>sp|P53634|CATC_HUMAN DIPEPTIDYL-PEPTIDASE I PRECURSOR (DPP-I) (DPPI) (CATHEPSIN C)
(CATHEPSIN J) (DIPEPTIDYL TRANSFERASE)
Length = 463
Score = 107 bits (265), Expect = 3e-23
Identities = 75/235 (31%), Positives = 111/235 (46%), Gaps = 29/235 (12%)
Query: 124 TAFDWRTRGA---VTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVS--LSEQNLVDCD 178
T++DWR V+PV+NQ CGSC+SF++ G +E + I N + LS Q +V C
Sbjct: 233 TSWDWRNVHGINFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCS 292
Query: 179 HECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSA--NIG 236
+GC GG ++ G+ E+ +PYT T + C
Sbjct: 293 QYA----------QGCEGGFPYLIAGKYAQDFGLVEEACFPYTG-TDSPCKMKEDCFRYY 341
Query: 237 PEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVE-WQFYIGGVFDI-----PCNPN 290
pattern 237 ****
E + F NE +M +V GP+A+A + + + Y G++ P NP
Sbjct: 342 SSEYHYVGGFYG-GCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPF 400
Query: 291 SL-DHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSN 344
L +H +L+VGY + M YWIVKNSWG WGE GY +RRG + C + +
Sbjct: 401 ELTNHAVLLVGYGTDSA---SGMDYWIVKNSWGTGWGENGYFRIRRGTDECAIES 452
>sp|P25780|EUM1_EURMA MITE GROUP I ALLERGEN EUR M 1 (EUR M I)
Length = 211
Score = 99.8 bits (245), Expect = 7e-21
Identities = 73/228 (32%), Positives = 102/228 (44%), Gaps = 33/228 (14%)
Query: 117 SIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVD 176
S+P E D R+ VTP++ QG CGSCW+FS + E + +N + L+EQ LVD
Sbjct: 10 SLPSE----LDLRSLRTVTPIRMQGGCGSCWAFSGVASTESAYLAYRNMSLDLAEQELVD 65
Query: 177 CDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIG 236
C A GC+G P YI +NG +Q E YPY A + N+ G
Sbjct: 66 C-----------ASQNGCHGDTIPRGIEYIQQNGVVQ-EHYYPYVAREQSCHRPNAQRYG 113
Query: 237 PEEQAKISNFTMIPKNETVMAGYIVSTGPLAI---AADAVEWQFYIGGVF---DIPCNPN 290
pattern 237 ****
+ +IS P + + + +A+ D ++ Y G D PN
Sbjct: 114 LKNYCQISP----PDSNKIRQALTQTHTAVAVIIGIKDLNAFRHYDGRTIMQHDNGYQPN 169
Query: 291 SLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKN 338
H + IVGY NT + + YWIV+NSW WG+ GY Y N
Sbjct: 170 Y--HAVNIVGYG--NT---QGVDYWIVRNSWDTTWGDNGYGYFAANIN 210
>sp|Q23894|CYS3_DICDI CYSTEINE PROTEINASE 3 (CYSTEINE PROTEINASE II)
Length = 151
Score = 94.8 bits (232), Expect = 2e-19
Identities = 60/158 (37%), Positives = 87/158 (54%), Gaps = 15/158 (9%)
Query: 41 SHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIF 100
+H+E++ R+E FK N+ + N + + T G+N+ ADLS++E++ YL + I
Sbjct: 1 THKEFMPRYEEFKKNMDYVHNWN----SKGSKTVLGLNQHADLSNEEYRLNYLGTRAHIK 56
Query: 101 TDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHF 160
+ + +N ++ DWR + AVTPVK+QGQCGSC STTG+VEG
Sbjct: 57 LNGYHKRNL--GLRLNRPHFKQPLNVDWREKDAVTPVKDQGQCGSC-IISTTGSVEGVTA 113
Query: 161 ISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGL 198
I KLVSLSEQN++ +EGCNGGL
Sbjct: 114 IKTGKLVSLSEQNILRL--------SSSFGNEGCNGGL 143
>sp|P43509|CPR5_CAEEL CATHEPSIN B-LIKE CYSTEINE PROTEINASE 5 PRECURSOR
Length = 344
Score = 90.9 bits (222), Expect = 4e-18
Identities = 69/272 (25%), Positives = 111/272 (40%), Gaps = 47/272 (17%)
Query: 108 DYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLV 167
D + E ++IP W ++ +++Q CGSCW+F+ + + I+ N V
Sbjct: 72 DIVATEVSDAIPDHFDARDQWPNCMSINNIRDQSDCGSCWAFAAAEAISDRTCIASNGAV 131
Query: 168 S--LSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSY------- 218
+ LS ++L+ C G +C GC GG A+ + +K+G + T SY
Sbjct: 132 NTLLSSEDLLSC------CTGMFSCGNGCEGGYPIQAWKWWVKHG-LVTGGSYETQFGCK 184
Query: 219 PY-----------------------TAETGTQCNFNSANIGPEEQAKISNFTM--IPKNE 253
pattern 237 ****
PY T + C + P Q K T + K
Sbjct: 185 PYSIAPCGETVNGVKWPACPEDTEPTPKCVDSCTSKNNYATPYLQDKHFGSTAYAVGKKV 244
Query: 254 TVMAGYIVSTGPLAIAADAVE-WQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNM 312
+ I++ GP+ +A E + Y GV+ + H + I+G+ N
Sbjct: 245 EQIQTEILTNGPIEVAFTVYEDFYQYTTGVYVHTAGASLGGHAVKILGWGVDN-----GT 299
Query: 313 PYWIVKNSWGADWGEQGYIYLRRGKNTCGVSN 344
PYW+V NSW WGE+GY + RG N CG+ +
Sbjct: 300 PYWLVANSWNVAWGEKGYFRIIRGLNECGIEH 331
>sp|P43508|CPR4_CAEEL CATHEPSIN B-LIKE CYSTEINE PROTEINASE 4 PRECURSOR
Length = 335
Score = 90.5 bits (221), Expect = 5e-18
Identities = 73/299 (24%), Positives = 124/299 (41%), Gaps = 50/299 (16%)
Query: 82 DLSSDEFKNYYLNNK-EAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQ 140
D++ ++ K + + A T D+ V + +E ++IP W ++ +++Q
Sbjct: 46 DITIEQVKKRLMRTEFVAPHTPDVEVVKHDINE--DTIPATFDARTQWPNCMSINNIRDQ 103
Query: 141 GQCGSCWSFSTTGNVEGQHFISQNKLVS--LSEQNLVDCDHECMEYEGEEACDEGCNGGL 198
CGSCW+F+ + I+ N V+ LS ++++ C C C GC GG
Sbjct: 104 SDCGSCWAFAAAEAASDRFCIASNGAVNTLLSAEDVLSC---CSN------CGYGCEGGY 154
Query: 199 QPNAYNYIIKNG---GIQTESSYPYTAETGTQCNFNSANI--------GPEEQAKISNFT 247
pattern 237 ****
NA+ Y++K+G G E+ + + C N+ G + A ++ T
Sbjct: 155 PINAWKYLVKSGFCTGGSYEAQFGCKPYSLAPCGETVGNVTWPSCPDDGYDTPACVNKCT 214
Query: 248 -------------------MIPKNETVMAGYIVSTGPLAIAADAVE-WQFYIGGVFDIPC 287
+ K + + I++ GP+ A E + Y GV+
Sbjct: 215 NKNYNVAYTADKHFGSTAYAVGKKVSQIQAEIIAHGPVEAAFTVYEDFYQYKTGVYVHTT 274
Query: 288 NPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFV 346
H I I+G+ N PYW+V NSW +WGE GY + RG N CG+ + V
Sbjct: 275 GQELGGHAIRILGWGTDN-----GTPYWLVANSWNVNWGENGYFRIIRGTNECGIEHAV 328
>sp|P05993|PAP5_CARPA CYSTEINE PROTEINASE (CLONE PLBPC13)
Length = 96
Score = 90.5 bits (221), Expect = 5e-18
Identities = 43/87 (49%), Positives = 55/87 (62%), Gaps = 2/87 (2%)
Query: 264 GPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKN--TIFRKNMPYWIVKNSW 321
GPLA+A +A Q YIGGV L+HG+L+VGY + I K PYW++KNSW
Sbjct: 1 GPLAVAINAAYMQTYIGGVSCPYICSRRLNHGVLLVGYGSAGYAPIRLKEKPYWVIKNSW 60
Query: 322 GADWGEQGYIYLRRGKNTCGVSNFVST 348
G +WGE GY + RG+N CGV + VST
Sbjct: 61 GENWGENGYYKICRGRNICGVDSMVST 87
>sp|P07688|CATB_BOVIN CATHEPSIN B PRECURSOR
Length = 335
Score = 88.5 bits (216), Expect = 2e-17
Identities = 65/259 (25%), Positives = 105/259 (40%), Gaps = 47/259 (18%)
Query: 118 IPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSL---SEQNL 174
+P W + +++QG CGSCW+F + + I N V++ +E L
Sbjct: 80 LPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEVSAEDML 139
Query: 175 VDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQ--------------------- 213
C EC +GCNGG A+N+ K G +
Sbjct: 140 TCCGGEC---------GDGCNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEHH 190
Query: 214 -TESSYPYTAETGT-QCNFN-----SANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPL 266
pattern 237 ****
S P T E T +CN S + ++ S++++ + +MA I GP+
Sbjct: 191 VNGSRPPCTGEGDTPKCNKTCEPGYSPSYKEDKHFGCSSYSVANNEKEIMAE-IYKNGPV 249
Query: 267 AIAADAV-EWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADW 325
A ++ Y GV+ H I I+G+ +N PYW+V NSW DW
Sbjct: 250 EGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWGVEN-----GTPYWLVGNSWNTDW 304
Query: 326 GEQGYIYLRRGKNTCGVSN 344
G+ G+ + RG++ CG+ +
Sbjct: 305 GDNGFFKILRGQDHCGIES 323
>sp|P00787|CATB_RAT CATHEPSIN B PRECURSOR (CATHEPSIN B1) (RSG-2)
Length = 339
Score = 87.4 bits (213), Expect = 4e-17
Identities = 66/265 (24%), Positives = 113/265 (41%), Gaps = 45/265 (16%)
Query: 117 SIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSL--SEQNL 174
++P W + +++QG CGSCW+F + + I N V++ S ++L
Sbjct: 79 NLPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSAEDL 138
Query: 175 VDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIK----NGGIQTE--------------- 215
+ C C C +GCNGG A+N+ + +GG+
Sbjct: 139 LTC---C-----GIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEHH 190
Query: 216 ---SSYPYTAETGT-QCNFN-----SANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPL 266
pattern 237 ****
S P T E T +CN S + ++ +++++ + +MA I GP+
Sbjct: 191 VNGSRPPCTGEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAE-IYKNGPV 249
Query: 267 AIAADAV-EWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADW 325
A ++ Y GV+ H I I+G+ +N + PYW+V NSW DW
Sbjct: 250 EGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGIENGV-----PYWLVANSWNVDW 304
Query: 326 GEQGYIYLRRGKNTCGVSNFVSTSI 350
G+ G+ + RG+N CG+ + + I
Sbjct: 305 GDNGFFKILRGENHCGIESEIVAGI 329
>sp|P25807|CYS1_CAEEL GUT-SPECIFIC CYSTEINE PROTEINASE PRECURSOR
Length = 329
Score = 87.0 bits (212), Expect = 5e-17
Identities = 66/288 (22%), Positives = 117/288 (39%), Gaps = 38/288 (13%)
Query: 82 DLSSDEFKNYYLNNK-EAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQ 140
+++ +E K ++ K A +D++ + + + S+P + W ++ +++Q
Sbjct: 50 EITEEEMKFKLMDGKYAAAHSDEIRATE--QEVVLASVPATFDSRTQWSECKSIKLIRDQ 107
Query: 141 GQCGSCWSFSTTGNVEGQHFISQNKLVS--LSEQNLVDCDHECMEYEGEEACDEGCNGGL 198
CGSCW+F + + I +S +L+ C C +C GC GG
Sbjct: 108 ATCGSCWAFGAAEMISDRTCIETKGAQQPIISPDDLLSC---C-----GSSCGNGCEGGY 159
Query: 199 QPNAYNY-----IIKNGGIQTESSYPYTAETGTQ----------CNFNSANIGPEEQAKI 243
pattern 237 ****
A + ++ G PY T C+ + + AK
Sbjct: 160 PIQALRWWDSKGVVTGGDYHGAGCKPYPIAPCTSGNCPESKTPSCSMSCQSGYSTAYAKD 219
Query: 244 SNFTM----IPKNETVMAGYIVSTGPLAIAADAVE-WQFYIGGVFDIPCNPNSLDHGILI 298
+F + +PKN + I + GP+ A E + Y GV+ H I I
Sbjct: 220 KHFGVSAYAVPKNAASIQAEIYANGPVEAAFSVYEDFYKYKSGVYKHTAGKYLGGHAIKI 279
Query: 299 VGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFV 346
+G+ ++ PYW+V NSWG +WGE G+ + RG + CG+ + V
Sbjct: 280 IGWGTES-----GSPYWLVANSWGVNWGESGFFKIYRGDDQCGIESAV 322
>sp|P07858|CATB_HUMAN CATHEPSIN B PRECURSOR (CATHEPSIN B1) (APP SECRETASE)
Length = 339
Score = 86.2 bits (210), Expect = 9e-17
Identities = 68/285 (23%), Positives = 110/285 (37%), Gaps = 55/285 (19%)
Query: 96 KEAIFTDDLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNV 155
+ +FT+DL +P W + +++QG CGSCW+F +
Sbjct: 70 QRVMFTEDL------------KLPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAI 117
Query: 156 EGQHFISQNKLVSL--SEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQ 213
+ I N VS+ S ++L+ C C C +GCNGG A+N+ + G +
Sbjct: 118 SDRICIHTNAHVSVEVSAEDLLTC---C-----GSMCGDGCNGGYPAEAWNFWTRKGLVS 169
Query: 214 ----------------------TESSYPYTAETGTQ-----CNFNSANIGPEEQAKISNF 246
pattern 237 ****
S P T E T C + +++ N
Sbjct: 170 GGLYESHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNS 229
Query: 247 TMIPKNETVMAGYIVSTGPLAIAADAV-EWQFYIGGVFDIPCNPNSLDHGILIVGYSAKN 305
+ +E + I GP+ A ++ Y GV+ H I I+G+ +N
Sbjct: 230 YSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVEN 289
Query: 306 TIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSI 350
PYW+V NSW DWG+ G+ + RG++ CG+ + V I
Sbjct: 290 -----GTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGI 329
>sp|P43157|CYSP_SCHJA CATHEPSIN B-LIKE CYSTEINE PROTEINASE PRECURSOR (ANTIGEN SJ31)
Length = 342
Score = 85.4 bits (208), Expect = 2e-16
Identities = 64/271 (23%), Positives = 109/271 (39%), Gaps = 57/271 (21%)
Query: 118 IPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQ--NKLVSLSEQNLV 175
IP + + W +++ +++Q +CGSCW+F + + I + LS +L+
Sbjct: 90 IPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGGQSAELSALDLI 149
Query: 176 DCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGI---------------------QT 214
C C + C +GC GG A++Y +K G + T
Sbjct: 150 SC---CKD------CGDGCQGGFPGVAWDYWVKRGIVTGGSKENHTGCQPYPFPKCEHHT 200
Query: 215 ESSYP-------------YTAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIV 261
pattern 237 ****
+ YP T + G + + +E + N NE V+ I+
Sbjct: 201 KGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDESYNVQN------NEKVIQRDIM 254
Query: 262 STGPLAIAADAVE-WQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNS 320
GP+ A D E + Y G++ H I I+G+ + K PYW++ NS
Sbjct: 255 MYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVE-----KRTPYWLIANS 309
Query: 321 WGADWGEQGYIYLRRGKNTCGVSNFVSTSII 351
W DWGE+G + RG++ C + + V +I
Sbjct: 310 WNEDWGEKGLFRMVRGRDECSIESDVVAGLI 340
>sp|P43233|CATB_CHICK CATHEPSIN B PRECURSOR (CATHEPSIN B1)
Length = 340
Score = 85.4 bits (208), Expect = 2e-16
Identities = 66/265 (24%), Positives = 111/265 (40%), Gaps = 46/265 (17%)
Query: 118 IPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSL--SEQNLV 175
+P T W ++ +++QG CGSCW+F + + + N VS+ S ++L+
Sbjct: 80 LPDTFDTRKQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSAEDLL 139
Query: 176 DCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQ---------------------- 213
C C G E C GCNGG A+ Y + G +
Sbjct: 140 SC---C----GFE-CGMGCNGGYPSGAWRYWTERGLVSGGLYDSHVGCRAYTIPPCEHHV 191
Query: 214 TESSYPYTAETGT--QCNFN-----SANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPL 266
pattern 237 ****
S P T E G +C+ + S + ++ I+++ +P++E + I GP+
Sbjct: 192 NGSRPPCTGEGGETPRCSRHCEPGYSPSYKEDKHYGITSYG-VPRSEKEIMAEIYKNGPV 250
Query: 267 AIAADAVE-WQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADW 325
A E + Y GV+ H I I+G+ +N PYW+ NSW DW
Sbjct: 251 EGAFIVYEDFLMYKSGVYQHVSGEQVGGHAIRILGWGVEN-----GTPYWLAANSWNTDW 305
Query: 326 GEQGYIYLRRGKNTCGVSNFVSTSI 350
G G+ + RG++ CG+ + + +
Sbjct: 306 GITGFFKILRGEDHCGIESEIVAGV 330
>sp|P43510|CPR6_CAEEL CATHEPSIN B-LIKE CYSTEINE PROTEINASE 6 PRECURSOR
Length = 379
Score = 85.0 bits (207), Expect = 2e-16
Identities = 71/265 (26%), Positives = 116/265 (42%), Gaps = 53/265 (20%)
Query: 118 IPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNK--LVSLSEQNLV 175
IP + +W ++ +++Q CGSCW+F + + I+ + V+LS +L+
Sbjct: 105 IPESFDSRDNWPKCDSIKVIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVTLSADDLL 164
Query: 176 DCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQ------CN 229
C C ++C GCNGG A+ Y +K+G I T S+Y TA G + C
Sbjct: 165 SC---C------KSCGFGCNGGDPLAAWRYWVKDG-IVTGSNY--TANNGCKPYPFPPCE 212
Query: 230 FNSANIGPE------------EQAKISNFTMIPKNETVMAGY---------------IVS 262
pattern 237 ** **
+S + E+ +S++T +E G +++
Sbjct: 213 HHSKKTHFDPCPHDLYPTPKCEKKCVSDYTDKTYSEDKFFGASAYGVKDDVEAIQKELMT 272
Query: 263 TGPLAIAADAVE-WQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSW 321
GPL IA + E + Y GGV+ H + ++G+ + I PYW V NSW
Sbjct: 273 HGPLEIAFEVYEDFLNYDGGVYVHTGGKLGGGHAVKLIGWGIDDGI-----PYWTVANSW 327
Query: 322 GADWGEQGYIYLRRGKNTCGVSNFV 346
DWGE G+ + RG + CG+ + V
Sbjct: 328 NTDWGEDGFFRILRGVDECGIESGV 352
>sp|P25792|CYSP_SCHMA CATHEPSIN B-LIKE CYSTEINE PROTEINASE PRECURSOR (ANTIGEN SM31)
Length = 340
Score = 84.6 bits (206), Expect = 3e-16
Identities = 64/260 (24%), Positives = 107/260 (40%), Gaps = 45/260 (17%)
Query: 118 IPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQN--KLVSLSEQNLV 175
IP + W ++ +++Q +CGSCWSF + + I + V LS +L+
Sbjct: 89 IPSNFDSRKKWPGCKSIATIRDQSRCGSCWSFGAVEAMSDRSCIQSGGKQNVELSAVDLL 148
Query: 176 DCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTA---ETGTQCNFNS 232
C C E+C GC GG+ A++Y +K G + S +T +C ++
Sbjct: 149 TC---C------ESCGLGCEGGILGPAWDYWVKEGIVTASSKENHTGCEPYPFPKCEHHT 199
Query: 233 ANIGPEEQAKISN---------------FTM----------IPKNETVMAGYIVSTGPLA 267
pattern 237 ****
P +KI N +T + +E + I+ GP+
Sbjct: 200 KGKYPPCGSKIYNTPRCKQTCQRKYKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYGPVE 259
Query: 268 IAADAVE-WQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWG 326
+ E + Y G++ H I I+G+ +N PYW++ NSW DWG
Sbjct: 260 ASFTVYEDFLNYKSGIYKHITGEALGGHAIRIIGWGVEN-----KTPYWLIANSWNEDWG 314
Query: 327 EQGYIYLRRGKNTCGVSNFV 346
E GY + RG++ C + + V
Sbjct: 315 ENGYFRIVRGRDECSIESEV 334
>sp|P10605|CATB_MOUSE CATHEPSIN B PRECURSOR (CATHEPSIN B1)
Length = 339
Score = 84.6 bits (206), Expect = 3e-16
Identities = 66/253 (26%), Positives = 108/253 (42%), Gaps = 43/253 (16%)
Query: 128 WRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSL--SEQNLVDCDHECMEYE 185
W + +++QG CGSCW+F + + I N V++ S ++L+ C C
Sbjct: 90 WSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEVSAEDLLTC---C---- 142
Query: 186 GEEACDEGCNGGLQPNAYNYIIK----NGGIQTE------------------SSYPYTAE 223
C +GCNGG A+++ K +GG+ S P T E
Sbjct: 143 -GIQCGDGCNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIPPCEHHVNGSRPPCTGE 201
Query: 224 TGT-QCNFN-SANIGPE-EQAKISNFTMIPKNETV--MAGYIVSTGPLAIAADAV-EWQF 277
pattern 237 ** **
T +CN + A P ++ K +T + +V + I GP+ A ++
Sbjct: 202 GDTPRCNKSCEAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIYKNGPVEGAFTVFSDFLT 261
Query: 278 YIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGK 337
Y GV+ H I I+G+ +N + PYW+ NSW DWG+ G+ + RG+
Sbjct: 262 YKSGVYKHEAGDMMGGHAIRILGWGVENGV-----PYWLAANSWNLDWGDNGFFKILRGE 316
Query: 338 NTCGVSNFVSTSI 350
N CG+ + + I
Sbjct: 317 NHCGIESEIVAGI 329
>sp|P25802|CYS1_OSTOS CATHEPSIN B-LIKE CYSTEINE PROTEINASE 1 PRECURSOR
Length = 341
Score = 79.6 bits (193), Expect = 9e-15
Identities = 63/270 (23%), Positives = 106/270 (38%), Gaps = 46/270 (17%)
Query: 103 DLPVADYLDDEFINSIPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFIS 162
D V D +E + IP W ++ + +Q CGSCW+ S+ + + I+
Sbjct: 76 DEEVEDEELEENNDDIPESYDPRIQWANCSSLFHIPDQANCGSCWAVSSAAAMSDRICIA 135
Query: 163 QN--KLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNY-----IIKNGGIQTE 215
K V +S Q++V C C C +GC GG +A+ + ++ G T+
Sbjct: 136 SKGAKQVLISAQDVVSC---CTW------CGDGCEGGWPISAFRFHADEGVVTGGDYNTK 186
Query: 216 SSY-PYTAET----GTQCNFNSANIGPEEQAKISNFTMI------PKNETVMAGYIVSTG 264
pattern 237 ****
S PY G + + +G + + ++ P + Y +
Sbjct: 187 GSCRPYEIHPCGHHGNETYYGEC-VGMADTPRCKRRCLLGYPKSYPSDRYYKKAYQLKNS 245
Query: 265 PLAIAADAV-------------EWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKN 311
AI D + ++ Y G++ + H + ++G+ + K
Sbjct: 246 VKAIQKDIMKNGPVVATYTVYEDFAHYRSGIYKHKAGRKTGLHAVKVIGWGEE-----KG 300
Query: 312 MPYWIVKNSWGADWGEQGYIYLRRGKNTCG 341
PYWIV NSW DWGE G+ + RG N CG
Sbjct: 301 TPYWIVANSWHDDWGENGFFRMHRGSNDCG 330
>sp|P25793|CYS2_HAECO CATHEPSIN B-LIKE CYSTEINE PROTEINASE 2 PRECURSOR
Length = 342
Score = 78.4 bits (190), Expect = 2e-14
Identities = 59/266 (22%), Positives = 110/266 (41%), Gaps = 47/266 (17%)
Query: 118 IPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQN--KLVSLSEQNLV 175
IPP W+ +++Q CGSCW+ ST + + I+ K V++S +++
Sbjct: 87 IPPSYDPRDVWKNCTTFY-IRDQANCGSCWAVSTAAAISDRICIASKAEKQVNISATDIM 145
Query: 176 DCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQ------TESSYPY--------- 220
C C C +GC GG A+ Y I +G + + PY
Sbjct: 146 TC---C-----RPQCGDGCEGGWPIEAWKYFIYDGVVSGGEYLTKDVCRPYPIHPCGHHG 197
Query: 221 -------------TAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLA 267
pattern 237 ****
T +C + ++ + ++ ++ + I+ GP+
Sbjct: 198 NDTYYGECRGTAPTPPCKRKCRPGVRKMYRIDKRYGKDAYIVKQSVKAIQSEILKNGPV- 256
Query: 268 IAADAV--EWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADW 325
+A+ AV +++ Y G++ H + ++G+ +N N +W++ NSW DW
Sbjct: 257 VASFAVYEDFRHYKSGIYKHTAGELRGYHAVKMIGWGNEN-----NTDFWLIANSWHNDW 311
Query: 326 GEQGYIYLRRGKNTCGVSNFVSTSII 351
GE+GY + RG N CG+ ++ I+
Sbjct: 312 GEKGYFRIVRGSNDCGIEGTIAAGIV 337
>sp|P19092|CYS1_HAECO CATHEPSIN B-LIKE CYSTEINE PROTEINASE 1 PRECURSOR
Length = 342
Score = 77.6 bits (188), Expect = 4e-14
Identities = 59/266 (22%), Positives = 110/266 (41%), Gaps = 47/266 (17%)
Query: 118 IPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQN--KLVSLSEQNLV 175
IPP W+ +++Q CGSCW+ ST + + I+ K V++S +++
Sbjct: 87 IPPSYDPRDVWKNCTTFY-IRDQANCGSCWAVSTAAAISDRICIASKAEKQVNISATDIM 145
Query: 176 DCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQ------TESSYPY--------- 220
C C C +GC GG A+ Y I +G + + PY
Sbjct: 146 TC---C-----RPQCGDGCEGGWPIEAWKYFIYDGVVSGGEYLTKDVCRPYPIHPCGHHG 197
Query: 221 -------------TAETGTQCNFNSANIGPEEQAKISNFTMIPKNETVMAGYIVSTGPLA 267
pattern 237 ****
T +C + ++ + ++ ++ + I+ GP+
Sbjct: 198 NDTYYGECRGTAPTPPCKRKCRPGVRKMYRIDKRYGKDAYIVKQSVKAIQSEILRNGPV- 256
Query: 268 IAADAV--EWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADW 325
+A+ AV +++ Y G++ H + ++G+ +N N +W++ NSW DW
Sbjct: 257 VASFAVYEDFRHYKSGIYKHTAGELRGYHAVKMIGWGNEN-----NTDFWLIANSWHNDW 311
Query: 326 GEQGYIYLRRGKNTCGVSNFVSTSII 351
GE+GY + RG N CG+ ++ I+
Sbjct: 312 GEKGYFRIIRGTNDCGIEGTIAAGIV 337
>sp|P43507|CPR3_CAEEL CATHEPSIN B-LIKE CYSTEINE PROTEINASE 3 PRECURSOR
Length = 370
Score = 73.3 bits (177), Expect = 7e-13
Identities = 56/248 (22%), Positives = 98/248 (38%), Gaps = 39/248 (15%)
Query: 128 WRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVS--LSEQNLVDCDHECMEYE 185
W + ++NQ CGSCW+F + + I N +S ++++ C C
Sbjct: 102 WPDCNTIKLIRNQATCGSCWAFGAAEVISDRVCIQSNGTQQPVISVEDILSC---C---- 154
Query: 186 GEEACDEGCNGGLQPNAYNYIIKNGGIQ---------------------TESSYPYTAET 224
C GC GG A + +G + ES+ P + +T
Sbjct: 155 -GTTCGYGCKGGYSIEALRFWASSGAVTGGDYGGHGCMPYSFAPCTKNCPESTTP-SCKT 212
Query: 225 GTQCNFNSANIGPEEQAKISNFTMIP-KNETVMAGYIVSTGPLAIAADAVE-WQFYIGGV 282
pattern 237 ****
Q ++ + ++ S + + K+ T + I GP+ + E + Y GV
Sbjct: 213 TCQSSYKTEEYKKDKHYGASAYKVTTTKSVTEIQTEIYHYGPVEASYKVYEDFYHYKSGV 272
Query: 283 FDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGV 342
+ H + I+G+ +N + YW++ NSWG +GE+G+ +RRG N C +
Sbjct: 273 YHYTSGKLVGGHAVKIIGWGVENGV-----DYWLIANSWGTSFGEKGFFKIRRGTNECQI 327
Query: 343 SNFVSTSI 350
V I
Sbjct: 328 EGNVVAGI 335
>sp|P13823|SERA_PLAFG SERINE-REPEAT ANTIGEN PROTEIN PRECURSOR (P126) (111 KD ANTIGEN)
Length = 989
Score = 70.2 bits (169), Expect = 6e-12
Identities = 63/247 (25%), Positives = 102/247 (40%), Gaps = 46/247 (18%)
Query: 137 VKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGE--EACDEGC 194
V++QG C + W F++ ++E + + +S + +C Y+GE + CDEG
Sbjct: 579 VEDQGNCDTSWIFASKYHLETIRCMKGYEPTKISALYVANC------YKGEHKDRCDEGS 632
Query: 195 NGGLQPNAYNYIIKNGG-IQTESSYPYT-AETGTQC------------------NFNSAN 234
+ P + II++ G + ES+YPY + G QC N N N
Sbjct: 633 S----PMEFLQIIEDYGFLPAESNYPYNYVKVGEQCPKVEDHWMNLWDNGKILHNKNEPN 688
Query: 235 I----------GPEEQAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFD 284
pattern 237 ****
+ F I K E + G +++ I A+ V + G
Sbjct: 689 SLDGKGYTAYESERFHDNMDAFVKIIKTEVMNKGSVIAY----IKAENVMGYEFSGKKVQ 744
Query: 285 IPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSN 344
C ++ DH + IVGY + YWIV+NSWG WG++GY + T N
Sbjct: 745 NLCGDDTADHAVNIVGYGNYVNSEGEKKSYWIVRNSWGPYWGDEGYFKVDMYGPTHCHFN 804
Query: 345 FVSTSII 351
F+ + +I
Sbjct: 805 FIHSVVI 811
>sp|P32956|CC3_CARCN CYSTEINE PROTEINASE III (CC-III)
Length = 43
Score = 60.9 bits (145), Expect = 4e-09
Identities = 24/33 (72%), Positives = 27/33 (81%)
Query: 125 AFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEG 157
+ DWR +GAVTPVKNQG CGSCW+FST VEG
Sbjct: 4 SIDWRKKGAVTPVKNQGSCGSCWAFSTIATVEG 36
>sp|P32957|CC4_CARCN CYSTEINE PROTEINASE IV (CC-IV)
Length = 43
Score = 59.7 bits (142), Expect = 9e-09
Identities = 24/33 (72%), Positives = 27/33 (81%)
Query: 125 AFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEG 157
+ DWR +GAVTPVKNQG CGSCW+FST VEG
Sbjct: 4 SIDWRKKGAVTPVKNQGSCGSCWAFSTIVTVEG 36
>sp|Q06544|CYS3_OSTOS CATHEPSIN B-LIKE CYSTEINE PROTEINASE 3
Length = 174
Score = 59.3 bits (141), Expect = 1e-08
Identities = 31/103 (30%), Positives = 49/103 (47%), Gaps = 15/103 (14%)
Query: 249 IPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIF 308
I KN V+AG+IV ++ Y G++ + H + I+G+ +
Sbjct: 87 IMKNGPVVAGFIVYE----------DFAHYKSGIYKHTAGRMTGGHAVKIIGWGKE---- 132
Query: 309 RKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII 351
K PYW++ NSW DWGE+G+ + RG N C + V I+
Sbjct: 133 -KGTPYWLIANSWHDDWGEKGFYRMIRGINNCRIEEMVFAGIV 174
>sp|P32954|CC1_CARCN CYSTEINE PROTEINASE I (CC-I)
Length = 43
Score = 57.8 bits (137), Expect = 3e-08
Identities = 22/33 (66%), Positives = 27/33 (81%)
Query: 125 AFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEG 157
+ DWR +GAVTPV+NQG CGSCW+FS+ VEG
Sbjct: 4 SIDWRQKGAVTPVRNQGSCGSCWTFSSVAAVEG 36
>sp|P32955|CC2_CARCN CYSTEINE PROTEINASE II (CC-II)
Length = 43
Score = 56.2 bits (133), Expect = 1e-07
Identities = 22/31 (70%), Positives = 25/31 (79%)
Query: 127 DWRTRGAVTPVKNQGQCGSCWSFSTTGNVEG 157
DWR +GAVTPVK+Q CGSCW+FST VEG
Sbjct: 6 DWRQKGAVTPVKDQNPCGSCWAFSTVATVEG 36
>sp||CATL_CHICK_2 [Segment 2 of 2] CATHEPSIN L
Length = 42
Score = 51.9 bits (122), Expect = 2e-06
Identities = 20/39 (51%), Positives = 28/39 (71%), Gaps = 1/39 (2%)
Query: 314 YWIVKNSWGADWGEQGYIYLRRG-KNTCGVSNFVSTSII 351
YWIVKNSWG WG++GYIY+ + KN CG++ S ++
Sbjct: 4 YWIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAASYPLV 42
>sp|P12399|CT2A_MOUSE CTLA-2-ALPHA PROTEIN PRECURSOR
Length = 136
Score = 41.8 bits (96), Expect = 0.002
Identities = 31/101 (30%), Positives = 50/101 (48%), Gaps = 4/101 (3%)
Query: 9 LAVFTVFVSSRGIPPEEQ--SQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIA 66
L + + + S PP+ +++ E++ KF K Y+ E R +++ N KIE N
Sbjct: 17 LLILCLGMMSAAPPPDPSLDNEWKEWKTKFAKAYNLNEERHRRLVWEENKKKIEAHNADY 76
Query: 67 INHKADTKFGVNKFADLSSDEFK-NYYLNN-KEAIFTDDLP 105
K G+N+F+DL+ +EFK N Y N+ DLP
Sbjct: 77 EQGKTSFYMGLNQFSDLTPEEFKTNCYGNSLNRGEMAPDLP 117
>sp|P05689|CATX_BOVIN CATHEPSIN
Length = 73
Score = 40.2 bits (92), Expect = 0.006
Identities = 15/40 (37%), Positives = 24/40 (59%), Gaps = 5/40 (12%)
Query: 292 LDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYI 331
++H + + G+ + M YWIV+NSWG WGE G++
Sbjct: 9 INHIVSVAGWGVSD-----GMEYWIVRNSWGEPWGEHGWM 43
>sp|P12400|CT2B_MOUSE CTLA-2-BETA PROTEIN PRECURSOR
Length = 141
Score = 38.7 bits (88), Expect = 0.019
Identities = 25/85 (29%), Positives = 45/85 (52%), Gaps = 1/85 (1%)
Query: 6 LFVLAVFTVFVSSRGIP-PEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNL 64
+F+L + +S+ P P +++ E++ F K YS +E R +++ N KIE N
Sbjct: 20 VFLLILCLGMMSAAPSPDPSLDNEWKEWKTTFAKAYSLDEERHRRLMWEENKKKIEAHNA 79
Query: 65 IAINHKADTKFGVNKFADLSSDEFK 89
K G+N+F+DL+ +EF+
Sbjct: 80 DYERGKTSFYMGLNQFSDLTPEEFR 104
>sp|P23897|HSER_RAT HEAT-STABLE ENTEROTOXIN RECEPTOR PRECURSOR (GC-C) (INTESTINAL
GUANYLATE CYCLASE) (STA RECEPTOR)
Length = 1072
Score = 35.6 bits (80), Expect = 0.16
Identities = 32/120 (26%), Positives = 56/120 (46%), Gaps = 19/120 (15%)
Query: 15 FVSSRGIPPEEQSQFLEFQDK----FNKKYSHEEYLERFEIFKSNL-GKIEELNLIAINH 69
+V G PE+ +L + F++ S ++ L R E F+ L G+ + N+I +
Sbjct: 190 YVYKNGSEPEDCFWYLNALEAGVSYFSEVLSFKDVLRRSEQFQEILMGRNRKSNVIVMCG 249
Query: 70 KADTKFGVN---KFAD----LSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEE 122
+T + V K AD + D F N+Y F DD +Y+D+ + ++PPE+
Sbjct: 250 TPETFYNVKGDLKVADDTVVILVDLFSNHY-------FEDDTRAPEYMDNVLVLTLPPEK 302
>sp|P20736|BM86_BOOMI GLYCOPROTEIN ANTIGEN BM86 PRECURSOR (PROTECTIVE ANTIGEN)
Length = 650
Score = 35.2 bits (79), Expect = 0.22
Identities = 24/81 (29%), Positives = 36/81 (43%), Gaps = 5/81 (6%)
Query: 151 TTGNVEGQHFISQNKLVSLSEQNLVDC----DHECMEYEGEEACDEGCNGGLQPNAYNYI 206
TT N + KL + + + +C DHEC +++C E NG Q + +
Sbjct: 533 TTCNPKEIQECQDKKLECVYKNHKAECECPDDHECYREPAKDSCSEEDNGKCQSSGQRCV 592
Query: 207 IKNG-GIQTESSYPYTAETGT 226
I+NG + E S TA T T
Sbjct: 593 IENGKAVCKEKSEATTAATTT 613
>sp|P46992|YJR1_YEAST HYPOTHETICAL 43.0 KD PROTEIN IN CPS1-FPP1 INTERGENIC REGION
Length = 396
Score = 32.0 bits (71), Expect = 1.9
Identities = 39/191 (20%), Positives = 77/191 (39%), Gaps = 39/191 (20%)
Query: 77 VNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPPEE-------------- 122
VNKF D++++E + ++ + P+ADYL F + ++
Sbjct: 42 VNKFKDITNNESCTCEVGDRVWFSGKNAPLADYLSVHFRGPLKLKQFAFYTSPGFTVNNS 101
Query: 123 QTAFDW----------RTRGAVTPVKNQGQCGSCW-------SFSTTGNVEGQHFISQNK 165
+++ DW +T VT + + G+ C S + TG+ ++
Sbjct: 102 RSSSDWNRLAYYESSSKTADNVTFLNHGGEASPCLGNALSYASSNGTGSASEATVLADGT 161
Query: 166 LVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETG 225
L+S ++ ++ + C + ++ C +G P Y Y GG T + + E
Sbjct: 162 LISSDQEYIIYSNVSCPKSGYDKGCGVYRSG--IPAYYGY----GG--TTKMFLFEFEMP 213
Query: 226 TQCNFNSANIG 236
T+ NS++IG
Sbjct: 214 TETEKNSSSIG 224
>sp|P28493|PR5_ARATH PATHOGENESIS-RELATED PROTEIN 5 PRECURSOR (PR-5)
Length = 239
Score = 32.0 bits (71), Expect = 1.9
Identities = 24/93 (25%), Positives = 36/93 (37%), Gaps = 7/93 (7%)
Query: 137 VKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNG 196
++ G G C G V + + L + + N+V C C + ++ C G N
Sbjct: 137 IRPSGGSGDC---KYAGCVSDLNAACPDMLKVMDQNNVVACKSACERFNTDQYCCRGAND 193
Query: 197 GLQ---PNAYNYIIKNGGIQTESSYPYTAETGT 226
+ P Y+ I KN SY Y ET T
Sbjct: 194 KPETCPPTDYSRIFKN-ACPDAYSYAYDDETST 225
>sp|P54634|POLN_LORDV NON-STRUCTURAL POLYPROTEIN [CONTAINS: RNA-DIRECTED RNA POLYMERASE ;
THIOL PROTEASE 3C ; HELICASE (2C LIKE PROTEIN)]
Length = 1699
Score = 31.3 bits (69), Expect = 3.2
Identities = 13/31 (41%), Positives = 21/31 (66%)
Query: 17 SSRGIPPEEQSQFLEFQDKFNKKYSHEEYLE 47
SS+G+ EE ++ +++ N KYS EEYL+
Sbjct: 893 SSKGLSDEEYDEYKRIREERNGKYSIEEYLQ 923
>sp|Q02521|SPP2_YEAST SPLICEOSOME MATURATION PROTEIN SPP2
Length = 185
Score = 30.9 bits (68), Expect = 4.2
Identities = 24/99 (24%), Positives = 47/99 (47%), Gaps = 6/99 (6%)
Query: 30 LEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKF---GVNKF-ADLSS 85
L+ K KK ++ ++ + K+NL ++ +++HK +K ++KF D S
Sbjct: 6 LKLGSKTLKKNISKKTKKKNSLQKANLFDWDDAETASLSHKPQSKIKIQSIDKFDLDEES 65
Query: 86 DEFKNYYLNNKEAIFT--DDLPVADYLDDEFINSIPPEE 122
K + E T +D P+ +Y+ ++ N +P EE
Sbjct: 66 SSKKKLVIKLSENADTKKNDAPLVEYVTEKEYNEVPVEE 104
>sp|P41901|SPR3_YEAST SPORULATION-SPECIFIC SEPTIN
Length = 512
Score = 30.9 bits (68), Expect = 4.2
Identities = 17/58 (29%), Positives = 29/58 (49%), Gaps = 9/58 (15%)
Query: 60 EELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINS 117
+ +NLI + K+D L+ +E KN+ +E I D+PV + DE +N+
Sbjct: 237 KRVNLIPVIAKSDL---------LTKEELKNFKTQVREIIRVQDIPVCFFFGDEVLNA 285
>sp|Q01532|BLH1_YEAST CYSTEINE PROTEINASE 1 (Y3) (BLEOMYCIN HYDROLASE) (BLM HYDROLASE)
Length = 454
Score = 30.5 bits (67), Expect = 5.5
Identities = 21/66 (31%), Positives = 29/66 (43%), Gaps = 11/66 (16%)
Query: 111 DDEFINS--IPPEEQTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVS 168
DD +N + ++ F+ TPV NQ G CW F+ T +Q +L
Sbjct: 36 DDALLNKTRLQKQDNRVFNTVVSTDSTPVTNQKSSGRCWLFAAT---------NQLRLNV 86
Query: 169 LSEQNL 174
LSE NL
Sbjct: 87 LSELNL 92
>sp|P24896|NU5M_CAEEL NADH-UBIQUINONE OXIDOREDUCTASE CHAIN 5
Length = 527
Score = 30.5 bits (67), Expect = 5.5
Identities = 21/52 (40%), Positives = 26/52 (49%), Gaps = 7/52 (13%)
Query: 44 EYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNN 95
+YL + I+K K +L L IN K T F LSS FKNYYL +
Sbjct: 466 DYLAKNSIYKMKNLKFMDLFLNNINSKGYTLF-------LSSGMFKNYYLKS 510
>sp|P25648|SRB8_YEAST SUPPRESSOR OF RNA POLYMERASE B SRB8
Length = 1427
Score = 30.1 bits (66), Expect = 7.2
Identities = 22/89 (24%), Positives = 44/89 (48%), Gaps = 10/89 (11%)
Query: 21 IPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGV--- 77
+PP + S F++ + Y EE ++ E F NLG + ++ I H+ + K+ +
Sbjct: 1314 LPPFQVSSFVKETKLHSGDYGEEEDADQEESFSLNLG----IGIVEIAHENEQKWLIYDK 1369
Query: 78 --NKFADLSSDEFKNYYLNNKEAIFTDDL 104
+K+ S E ++++N +TDD+
Sbjct: 1370 KDHKYVCTFSME-PYHFISNYNTKYTDDM 1397
>sp|Q04723|PEPC_LACLC AMINOPEPTIDASE C
Length = 436
Score = 30.1 bits (66), Expect = 7.2
Identities = 11/20 (55%), Positives = 14/20 (70%)
Query: 311 NMPYWIVKNSWGADWGEQGY 330
N W V+NSWG D G++GY
Sbjct: 370 NSTKWKVENSWGKDAGQKGY 389
>sp|Q13867|BLMH_HUMAN BLEOMYCIN HYDROLASE (BLM HYDROLASE) (BMH)
Length = 455
Score = 29.7 bits (65), Expect = 9.4
Identities = 10/17 (58%), Positives = 13/17 (75%)
Query: 315 WIVKNSWGADWGEQGYI 331
W V+NSWG D G +GY+
Sbjct: 392 WRVENSWGEDHGHKGYL 408
>sp|P87362|BLMH_CHICK BLEOMYCIN HYDROLASE (BLM HYDROLASE) (BMH) (AMINOPEPTIDASE H)
Length = 455
Score = 29.7 bits (65), Expect = 9.4
Identities = 10/19 (52%), Positives = 14/19 (73%)
Query: 315 WIVKNSWGADWGEQGYIYL 333
W V+NSWG D G +GY+ +
Sbjct: 392 WRVENSWGEDRGNKGYLIM 410
>sp|P70645|BLMH_RAT BLEOMYCIN HYDROLASE (BLM HYDROLASE) (BMH)
Length = 454
Score = 29.7 bits (65), Expect = 9.4
Identities = 10/17 (58%), Positives = 13/17 (75%)
Query: 315 WIVKNSWGADWGEQGYI 331
W V+NSWG D G +GY+
Sbjct: 392 WRVENSWGEDHGHKGYL 408
Database: /home/peter/blast/data/swissprot
Posted date: Oct 10, 2000 10:43 AM
Number of letters in database: 31,984,247
Number of sequences in database: 88,780
Lambda K H
0.317 0.136 0.414
Lambda K H
0.270 0.0477 0.230
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 23348054
Number of Sequences: 88780
Number of extensions: 1039466
Number of successful extensions: 3135
Number of sequences better than 10.0: 162
Number of HSP's better than 10.0 without gapping: 118
Number of HSP's successfully gapped in prelim test: 8
Number of HSP's that attempted gapping in prelim test: 2557
Number of HSP's gapped (non-prelim): 148
length of query: 351
length of database: 31,984,247
effective HSP length: 50
effective length of query: 301
effective length of database: 27,545,247
effective search space: 8291119347
effective search space used: 8291119347
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.8 bits)
X3: 64 (24.9 bits)
S1: 41 (21.6 bits)
S2: 65 (29.7 bits)