Introduction

The substrate and protease binding model and nomenclature used in this software is popularly accepted and used and was given by Schechter and Berger [1]. The substrate comes and binds to specific binding sites present on the protease. Substrate here refers to protein present in the food source and protease refers to the protein digesting enzymes present in the human body. Schematic representation of the substrate-protease complex has been provided in the figure below [Fig-1]. For this web server the nomenclature used is as follows:
The sub-sites on the protease are named "S1-S4" on the left side of the point of cleavage and “S1’-S4’” on the right side of the point of cleavage. Similarly the amino acids on the substrate were named as “P1-P4” on the N terminal end of the point of cleavage and “P1’-P4’” on the C terminal end of the point of cleavage.
Fig 1 - Schematic representation of substrate-protease complex and its nomenclature
For simplicity only substrate part is represented in the examples given below. The protease and substrate may interact at other places but to generalize and simplify only the interaction of eight amino acids has been considered with the substrate. Detailed description of the proteases used and their specific cut site is given below.

Proteases naturally present in Human Body

  1. Enzyme Name: Elastase 1

Gene: CELA1

Uniprot ID: Q9UNI1

Organism: Homo sapiens (Humans)

EC Number: EC:3.4.21.36

Enzyme Category: hydrolases

PDB ID: na



Cut Site: Hydrolysis of proteins, including elastin. Preferential cleavage: Ala-|-Xaa.

Description: They belong to the family of serine proteases. Humans have six elastase genes which encode structurally similar proteins elastase 1, 2, 2A, 2B, 3A, and 3B. The encoded preprotein is proteolytically processed to generate the active protease. Elastase 1 is only expressed in skin keratinocytes. They are not expressed in pancreas as are other elastases.[2]
Deficiency: Elastase 1 is known to be associated with diseases like Steatorrhea and Pancreatic Acinar Cell Adenocarcinoma.


  1. Enzyme Name: Neutrophil elastase (Elastase 2)

Gene: ELANE

Uniprot ID: P08246

Organism: Homo sapiens (Humans)

EC Number: EC:3.4.21.37

Enzyme Category: hydrolases

PDB ID: 2RG3, 5A0A, 2Z7F


Protein Figure:



Cut Site: Hydrolysis of proteins, including elastin. Preferential cleavage: Val-|-Xaa > Ala-|-Xaa.

Description: They belong to the family of serine proteases. Humans have six elastase genes which encode structurally similar proteins elastase 1, 2, 2A, 2B, 3A, and 3B.. The encoded preprotein is proteolytically processed to generate the active protease. The enzyme may play a role in degenerative and inflammatory diseases through proteolysis of collagen-IV and elastin. It also inhibits C5a-dependent neutrophil enzyme release and chemotaxis.[3]
Deficiency: Mutations in this gene are related with cyclic neutropenia and severe congenital neutropenia (SCN). Neutropenia is a condition in which the number of neutrophils in our body decreases.

  1. Enzyme Name: Chymotrypsinogen B1

Gene: CTRB1

Uniprot ID: P17538

Organism: Homo sapiens (Humans)

EC Number: EC:3.4.21.1

Enzyme Category: hydrolases

PDB ID: na


Cut Site: Preferential cleavage: Tyr-|-Xaa, Trp-|-Xaa, Phe-|-Xaa, Leu-|-Xaa

Description: Chymotrypsinogen B1 is a protein from serine protease family of protease. These proteins are secreted in the cells of pancreas and are released in small intestine. They then undergo activation in small intestine. They cleave their substrate in small intestine.[4]
Deficiency: Inactivation of chymotrypsinogen will lead to digestion related problems.


  1. Enzyme Name: Chymotrypsin-C

Gene: CTRC

Uniprot ID: Q99895

Organism: Homo sapiens (Humans)

EC Number: EC:3.4.21.2

Enzyme Category: hydrolases

PDB ID: 4H4F


Protein Figure:



Cut Site: Preferential cleavage: Leu-|-Xaa, Tyr-|-Xaa, Phe-|-Xaa, Met-|-Xaa, Trp-|-Xaa, Gln-|-Xaa, Asn-|-Xaa.

Description: Chymotrypsinogen C is a protein from serine protease family of protease. This protein acts as a serum calcium decreasing factor. They also regulates activation and degradation of trypsinogen and procarboxypeptidase.[5]
Deficiency: Chymotrypsin C is related with diseases like Pancreatitis.


  1. Enzyme Name: Pancreatic Endopeptidase E

Gene: CELA3A

Uniprot ID: P09093

Organism: Homo sapiens (Humans)

EC Number: EC:3.4.21.70

Enzyme Category: hydrolases

PDB ID: na


Cut Site: Preferential cleavage: Ala-|-Xaa. Does not hydrolyze elastin.


Description: They belong to the family of serine proteases. Humans have six elastase genes which encode structurally similar proteins elastase 1, 2, 2A, 2B, 3A, and 3B. The encoded preprotein is proteolytically processed to generate the active protease. These proteins are secreted in the cells of pancreas and are released in small intestine. They then undergo activation in small intestine. They cleave their substrate in small intestine. They also help in intestinal transport and metabolism.[6]
Deficiency: Pancreatic Endopeptidase E is related with diseases like Holoprosencephaly.


  1. Enzyme Name: Enteropeptidase

Gene: TMPRSS15

Uniprot ID:P98073

Organism: Homo sapiens (Humans)

EC Number: EC:3.4.21.9

Enzyme Category: hydrolases

PDB ID: 4DGJ


Protein Figure:

Cut Site: Activation of trypsinogen by selective cleavage of 6-Lys-|-Ile-7 bond.

Description: These protein belongs to the family of trypsin peptidases which is a subpart of serine proteases. They consist of two chains joined together by disulfide bond. They are responsible for converting trypsinogen to trypsin.[7]
Deficiency: Mutation in this gene causes Enterokinase deficiency and Hymenolepiasis. Enterokinase deficiency is malabsorption disorders which may lead to diarrhea and other related problems. Hymenolepiasis is small intestine related diseases which is caused by infestation by tapeworm.


    Enzyme Name: Prostasin

Gene: PRSS8

Uniprot ID: Q16651

Organism: Homo sapiens (Humans)

EC Number: EC:3.4.21.-

Enzyme Category: hydrolases

PDB ID: 3DFJ, 3GYL, 3E16


Protein Figure:


Cut Site: RKRK-ISGK Possesses a trypsin-like cleavage specificity with a preference for poly-basic substrate

Description: These proteins belong to chymotrypsin family of serine protease. Its active form consist of two chains joined together by disulfide bond. The active enzyme is expressed in prostate epithelia in huge amount. It shows trypsin like substrate specificity. Apart from that it also proteolytically activates epithelial sodium channel. [8]


Deficiency: Prostatin is related to diseases like Follicular Ichthyosis and Chromophobe Renal Cell Carcinoma.


  1. Enzyme Name: Trypsin - 1

Gene: PRSS1

Uniprot ID: P07477

Organism: Homo sapiens (Humans)

EC Number: EC:3.4.21.4

Enzyme Category: hydrolases

PDB ID: 1FXY, 2RA3, 4WWY


Protein Figure:



Cut Site: Preferential cleavage: Arg-|-Xaa, Lys-|-Xaa.

Description: These protein belongs to the family of trypsin peptidases which is a subpart of serine proteases. These proteins are secreted in the cells of pancreas and are released in small intestine. They then undergo activation in small intestine. These genes are present on chromosome 7.[9]
Deficiency: Mutations in this gene are associated with hereditary pancreatitis.


  1. Enzyme Name: Pepsin

Gene: PGA4

Uniprot ID: P0DJD7

Organism: Homo sapiens (Humans)

EC Number: EC:3.4.23.1

Enzyme Category: hydrolases

PDB ID: 1PSO, 1PSN, 1QRP


Protein Figure:


Cut Site: Due to broad specificity of pepsin there are two types of cut site are present.


Description: This protein belongs to peptidase A1 family of endopeptidases. The encoded precursor is secreted by gastric chief cells. These proteins converts to active form in acidic conditions of the stomach. This protein helps in the digestion of dietary proteins. Deficiency: Pepsinogen levels in serum may serve as a biomarker for atrophic gastritis and gastric cancer. So changes in its level may affect these diseases in one or the other form.[10]

  1. Enzyme Name: Gastricsin

Gene: PGC

Uniprot ID: P20142

Organism: Homo sapiens (Humans)

EC Number: EC:3.4.23.3

Enzyme Category: hydrolases

PDB ID: 1AVF, 1HTR


Protein Figure:



Cut Site: More restricted specificity than pepsin A, but shows preferential cleavage at Tyr-|-Xaa bonds. High activity on hemoglobin.

Description: This protein belongs to family of aspartic proteinase. This protein is secreted in stomach and forms an important component of gastric mucosa. This protein becomes active in acidic conditions itself. [11]
Deficiency: Mutations in this gene or inactivation of this protein are related to gastric cancers. Also its expression level in serum may serve as a biomarker for atrophic gastritis.

External Proteases

  1. Enzyme Name: Fruit Bromelain

Gene: na

Uniprot ID: O23791

Organism: Ananas comosus (Pineapple)

(Ananas ananas)

EC Number: EC:3.4.22.33

Enzyme Category: hydrolases

PDB ID: na


Cut Site: Hydrolysis of proteins with broad specificity for peptide bonds. Bz-Phe-Val-Arg-|-NHMec is a good synthetic substrate[12]

Description: It is a type of glucoprotein proteinase. It is extracted from pineapple. Fruit Bromelain has many medical as well as industrial applications. It is also used in the prevention of diarrhoea, digestive aids, antithrombotic, oedema treatment and osteoarthritis and promotes the absorption of antibiotic drugs. It also has immunological effect by regulating and activating immune cells and their cytokine production. It is also related to blood coagulation, fibrinolysis, and platelet functions.[13]


  1. Enzyme Name: Stem Bromelain

Gene: na

Uniprot ID: P14518

Organism: Ananas comosus (Pineapple)

(Ananas ananas)

EC Number: EC:3.4.22.32

Enzyme Category: hydrolases

PDB ID: na

Cut Site: Broad specificity for cleavage of proteins, but strong preference for Z-Arg-Arg-|-NHMec among small molecule substrates.[14]


Description: It is a part of bromelain family and belongs to the class of cysteine endopeptidase. It is extracted from pineapple and forms major part of pineapple proteolytic enzyme. It has great therapeutic value. It is responsible for reversible inhibition of platelet aggregation, bronchitis, sinusitis, surgical traumas, and thrombophlebitis. It also causes enhanced absorption of drugs. It also shows anti-metastasis and anti-inflammatory activities.


  1. Enzyme Name: Ananain

Gene: AN1

Uniprot ID: P80884

Organism: Ananas comosus (Pineapple)

(Ananas ananas)

EC Number: EC:3.4.22.31

Enzyme Category: hydrolases

PDB ID: na

Cut Site: Hydrolysis of proteins with broad specificity for peptide bonds. Best reported small molecule substrate Bz-Phe-Val-Arg-|-NHMec, but broader specificity than fruit bromelain[15]

Description: Ananain is a part of cysteine protease and comes from stem of a pineapple. It contains around seven cysteine residues and has high structural similarity with caricain and papain. It is strongly inhibited by chicken egg-white cystatin. For full activation of the enzyme the presence of a reducing agent required. Due to its high activity, ananain can also be considered as a debriding agent for burn injuries.


  1. Enzyme Name: Papaya proteinase 4

Gene: na

Uniprot ID: P05994

Organism: Carica papaya (Papaya)

EC Number: EC:3.4.22.25

Enzyme Category: hydrolases

PDB ID: 1GEC


Protein Figure:

Cut Site: Preferential cleavage: Gly-|-Xaa, in proteins and in small molecule substrates.[16]

Description: It belongs to the cysteine family of proteases. But unlike other members of cysteine family protease, it does not get inhibited by chicken cystatin. This is attributed to the presence of conserved Gly residue. Similarly there are other amino acid substitution which are responsible for its unusual activity and makes it different from other family members.


  1. Enzyme Name: Chymopapain

Gene: na

Uniprot ID: P14080

Organism: Carica papaya (Papaya)

EC Number: EC:3.4.22.6

Enzyme Category: hydrolases

PDB ID: 1YAL


Protein Figure:

Cut Site: Specificity similar to papain[17]

Description: It is extracted from the latex of papaya. It belongs to the family of cysteine peptidase. It has great medicinal use. Chymopapain was also taken in form of injections. It was injected directly into a herniated disk in the spine to dissolve part of the disk and relieve the pain and other problems caused by the disk pressing on a nerve. But these injections also had some side effects. So they were discontinued.


  1. Enzyme Name: Caricain

Gene: na

Uniprot ID: P10056

Organism: Carica papaya (Papaya)

EC Number: EC:3.4.22.30

Enzyme Category: hydrolases

PDB ID: 1MEG, 1PCI, 1PP0


Protein Figure:

Cut Site: Specificity similar to papain[18]

Description: Caricain is obtained from papaya plant,Carica papaya. It is an example of enzyme therapy. Caricain has the ability to detoxify gliadin in whole wheat flour by proteolytically cleaving it. Thus caricain can be used in developing bread suitable for coeliacs and gluten intolerant individuals.


  1. Enzyme Name: Chymosin

Gene: CYM

Uniprot ID: P00794

Organism: Bos taurus (Bovine)

EC Number: EC:3.4.23.4

Enzyme Category: hydrolases

PDB ID: 4AUC, 3CMS, 1CZI


Protein Figure:

Cut Site: Broad specificity similar to that of pepsin A. Clots milk by cleavage of a single 104-Ser-Phe-|-Met-Ala-107 bond in kappa-chain of casein.[19]


Description: Chymosin is also known as renin. Though it is mainly found in young animals, it is also found in human infants for a very short period of time. Its main function is to coagulate milk in stomach. It is also used in the production of cheese in industries in the form of cheese.

References

  1. Schechter I., Berger A; On the size of the active site in proteases. I. Papain;Biochemical and Biophysical Research Communication (1967) 27: 157.
  2. Boros, Eszter, et al. "Overlapping specificity of duplicated human pancreatic elastase 3 isoforms and archetypal porcine elastase 1 provides clues to evolution of digestive enzymes." Journal of Biological Chemistry 292.7 (2017): 2690-2702.
  3. Reis, Rui L., and Julio San Román, eds. Biodegradable systems in tissue engineering and regenerative medicine. Crc Press, 2004.
  4. Kohlmeier, Martin. Nutrient Metabolism: Structures, Functions, and Genes. Academic Press, 2015.
  5. Rawlings, Neil D., and Guy Salvesen. "Handbook of proteolytic enzymes, vol. 3." (2013)
  6. Strauss, A. W., et al. "Characterization of an endopeptidase involved in pre-protein processing." Proceedings of the National Academy of Sciences 76.9 (1979): 4225-4229
  7. Boyer, Paul D., ed. Hydrolysis: peptide bonds. Vol. 3. Academic press, 1971.
  8. Zhenyue Tong, Beate Illek, Vikash J. Bhagwandin, George M. Verghese, and George H. Caughey. “Prostasin, a membrane-anchored serine peptidase, regulates sodium currents in JME/CF15 cells, a cystic fibrosis airway epithelial cell line.” American Journal of Physiology-Lung Cellular and Molecular Physiology 2004 287:5, L928-L935
  9. Berg, Jeremy M., John L. Tymoczko, and Lubert Stryer. Biochemistry (Loose-Leaf). Macmillan, 2008.
  10. Keil, B. Specificity of proteolysis. Springer-Verlag Berlin-Heidelberg-NewYork, pp.335. (1992)
  11. Rawlings, Neil D., and Guy Salvesen. "Handbook of proteolytic enzymes, vol. 3." (2013)
  12. Silveira, E., et al. "Expanded bed adsorption of bromelain (EC 3.4. 22.33) from Ananas comosus crude extract." Brazilian Journal of Chemical Engineering 26.1 (2009): 149-157.
  13. Corzo, Carlos A., Krzysztof N. Waliszewski, and Jorge Welti-Chanes. "Pineapple fruit bromelain affinity to different protein substrates." Food Chemistry 133.3 (2012): 631-635.
  14. Silveira, E., et al. "Expanded bed adsorption of bromelain (EC 3.4. 22.33) from Ananas comosus crude extract." Brazilian Journal of Chemical Engineering 26.1 (2009): 149-157.
  15. Rowan, Andrew D., and David J. Buttle. "[38] Pineapple cysteine endopeptidases." Methods in Enzymology. Vol. 244. Academic Press, 1994. 555-568.
  16. Risør, Michael W., et al. "Enzymatic and structural characterization of the major endopeptidase in the Venus flytrap digestion fluid." Journal of Biological Chemistry 291.5 (2016): 2271-2287.
  17. Zucker, Stanley, et al. "The proteolytic activities of chymopapain, papain, and papaya proteinase III." Biochimica et Biophysica Acta (BBA)-Protein Structure and Molecular Enzymology 828.2 (1985): 196-204.
  18. Elsamadony, M., et al. "Use of Carica papaya enzymes for enhancement of H2 production and degradation of glucose, protein, and lipids." Energy Procedia 75 (2015): 975-980.
  19. Henschel, Michael J., Michael J. Newport, and Veena Parmar. "Gastric proteases in the human infant." Neonatology 52.5 (1987): 268-272.