Decoding the dark proteome: Deep learning-enabled discovery of druggable enzymes in Wuchereria bancrofti
2510.07337v1
q-bio.QM, cs.LG, q-bio.MN
2025-10-12
Авторы:
Shawnak Shivakumar, Jefferson Hernandez
Abstract
Wuchereria bancrofti, the parasitic roundworm responsible for lymphatic
filariasis, permanently disables over 36 million people and places 657 million
at risk across 39 countries. A major bottleneck for drug discovery is the lack
of functional annotation for more than 90 percent of the W. bancrofti dark
proteome, leaving many potential targets unidentified. In this work, we present
a novel computational pipeline that converts W. bancrofti's unannotated amino
acid sequence data into precise four-level Enzyme Commission (EC) numbers and
drug candidates. We utilized a DEtection TRansformer to estimate the
probability of enzymatic function, fine-tuned a hierarchical nearest neighbor
EC predictor on 4,476 labeled parasite proteins, and applied rejection sampling
to retain only four-level EC classifications at 100 percent confidence. This
pipeline assigned precise EC numbers to 14,772 previously uncharacterized
proteins and discovered 543 EC classes not previously known in W. bancrofti. A
qualitative triage emphasizing parasite-specific targets, chemical
tractability, biochemical importance, and biological plausibility prioritized
six enzymes across five separate strategies: anti-Wolbachia cell-wall
inhibition, proteolysis blockade, transmission disruption, purinergic immune
interference, and cGMP-signaling destabilization. We curated a 43-compound
library from ChEMBL and BindingDB and co-folded across multiple protein
conformers with Boltz-2. All six targets exhibited at least moderately strong
predicted binding affinities below 1 micromolar, with moenomycin analogs
against peptidoglycan glycosyltransferase and NTPase inhibitors showing
promising nanomolar hits and well-defined binding pockets. While experimental
validation remains essential, our results provide the first large-scale
functional map of the W. bancrofti dark proteome and accelerate early-stage
drug development for the species.