Computer-aided proteolysis design – Protease activity modeling for cleavage site prediction

The Challenge

The growing bioactive peptide market offers vast opportunities to create health-promoting ingredients from underused protein side streams. However, discovering and optimising these peptides remains challenging due to the complexity of protease–substrate interactions and the lack of predictive tools linking enzyme activity to peptide release.

The challenge aimed to develop an intelligent computational pipeline capable of predicting optimal, naturally occurring protease–substrate pairings to guide in silico proteolysis design. This would enable the efficient, scalable, and regulatory-compliant upcycling of low-cost protein sources into high-value ingredients with biofunctional properties such as antioxidant or anti-inflammatory activity.

The Approach

The team built an AI-driven discovery framework designed to model protease efficiency and predict cleavage site specificity across diverse protein substrates. Drawing on the MEROPS database of over 110,000 proteases and the UniProt database covering thousands of proteomes, the pipeline integrated enzyme–substrate information with bioactive peptide datasets from known crop species.

Using machine learning tools such as PanCleave alongside molecular docking software including HADDOCK and AutoDock, the model predicted protease–substrate interactions and prioritised likely bioactive peptide release sites. The workflow mined protein sequences to identify promising substrates, refined cleavage predictions through ML-based scoring, and applied molecular docking to assess enzyme–peptide binding stability. Together, these components formed a scalable proteolysis prediction pipeline that can accelerate discovery cycles and reduce experimental trial-and-error.

The Outcome

The resulting computational workflow demonstrated the feasibility of computer-aided proteolysis design for targeted peptide discovery. It provides a faster, data-driven alternative to conventional enzymatic screening, identifying novel proteolytic pathways with high commercial potential.

By combining protease activity modelling with bioinformatics and molecular simulation, the platform supports the upcycling of industrial protein by-products into functional, health-promoting peptides. The approach aligns with EU non-GMO regulations and offers a promising path to bring bioactive peptide innovation closer to scalable, sustainable production for food and nutraceutical applications.

The Challenge

The Approach

The Outcome

Impact & Next Steps