Abstract
Background
The growing global health burden necessitates precise therapeutics to mitigate pathogenesis and severe complications, which are increasing daily. Recently emerging and re-emerging viral infectious diseases, along with other ongoing complications from diverse infections and pathogens, contribute to global outbreaks. To address this, immediate and accurate therapeutic developments that can help manage this crisis are needed. Concerning the development of therapeutics, the conventional method-based drug design is time-consuming and requires a substantial investment of time to develop a drug against the pathogen successfully.
Main body of the abstract
To overcome these present obstacles, artificial intelligence (AI) came as a hope of revolutionizing the detection and advancement of pioneering, precise, cost- and time-effective drugs. AI uses advanced algorithms to improve the accuracy regarding target identification and further inhibitor selection. The pathogens were re-emerging daily, simultaneously, generating a huge amount of data with various specific properties and other essential details. Among them, some data can be helpful for therapeutic development. Using AI-based pipelines, tools, servers, databases, and useful resources to aid drug discovery, and employing different algorithms to examine the data, it was possible to identify a potential target that could aid therapeutic development; similarly, it also helped revolutionize the clinical aspects of drug discovery and the pharmaceutical industry by enabling more specific data handling. Moreover, it can help utilize available drugs and their significant details to address emerging and ongoing diseases through a drug repurposing-based approach using advanced AI-based computational analysis.
Short conclusion
Herein, this study offers the AI insight toward the drug discovery and development, how these approaches were utilized, and their advancements and challenges.
Similar content being viewed by others
Background
Artificial intelligence (AI) is one of the leading and authentic methods in drug discovery, employing various AI-assisted algorithms that help to reduce time and error rates with high accuracy. Drug discovery has faced challenges due to conventional methodologies that have not effectively addressed multiple diseases at a time and were specific. The successful drug design will take an average of 15 years, as this process utilizes various technologies and expertise for therapeutic development [1, 2]. Therefore, researchers developed AI-assisted methods to revolutionize the drug discovery process. AI is one of the most useful applications and applicable in resources, such as apps ranging from recommendations to classifications and diagnostics [3]. The implementation and acclaim of AI-driven developments in computer-aided drug design (CADD) and drug discovery have been acknowledged as the most innovative approach to reshaping conventional strategies in the pharmaceutical sector. CADD is the beginning of the computer-assisted approaches for developing and identifying the most promising compound methodology [2, 4,5,6]. In the process of discovering the drug, utilizing AI-assisted techniques helps mitigate failures among preclinical lead molecules. Before finalizing the most promising one, various properties such as molecular activity, physicochemical properties, pharmacokinetics, and bioactivity are applied and screened at the in silico level. In the AI-assisted drug discovery process, high-tech computer software programs and applications are essential as they can help evaluate and reveal the most promising one using the advanced machine learning (ML) algorithm, including its various fundamental and crucial aspects such as dataset preparation, model generation, and model evaluation etc. [7,8,9,10]. Over the years, in the drug development process, a significant surge in data digitization has occurred within the domain of the development of drugs and primarily utilized in the pharmaceutical sectors the beginning with high accuracy for the novel findings, as it can easily handle bulk data information with enhanced automation [11,12,13]. For successful drug discovery, the identified drug-like compounds must have a promising profile based on the validation; the AI-assisted algorithm can quickly identify the most promising hits and help understand their activity on the drug target by optimizing the structure [1, 14,15,16]. In the present situation, researchers are utilizing several AI-based applications for the drug discovery process. The basic representation of drug discovery is shown in Fig. 1, highlighting the key steps and the integration of AI, which can aid various stages of the process. Each step involves more precise analyses aimed at supporting the development of therapeutics. As AI-assisted studies increase daily, it is essential to understand the basic mechanisms of AI’s involvement and its application in drug design, and how these applications are beneficial. So, this study aims to explore AI in drug discovery and provide a brief idea about the available resources that assist the drug discovery process. Moreover, the pharmaceutical and clinical aspects, along with the critical insight, were discussed. Additionally, the more in-depth insight via a study based on AI-assisted drug design was also discussed, along with the advancement and challenges.
The representation of basic steps involved in AI-assisted drug discovery
AI-assisted application in drug discovery
In the present situation, AI is involved in several domains, including data collection, classification, dataset generation, and validation, among others, using various algorithms to maintain AI-assisted automation [15, 16]. Based on the different sets of applications, it can easily handle the bulk datasets with enhanced automation. Through consideration of drug exploration, several applications were developed to maintain accuracy and easily identify novel inhibitors for target pathogens. When it comes to drug discovery, things like ML, artificial neural networks (ANNs), deep learning (DL), random forest, support vector machines (SVMs), and deep neural networks (DNNs) are of particular interest to scientists. These algorithms aid in various tasks, such as bioactivity analysis, drug–target prediction, binding affinity calculation, and many more [1, 5]. The ML is a set of AI that can help analyze patterns in data, which is then classified to enhance the accuracy of a task. ANN is a subset of ML that can help solve biological problems regarding data and accelerate and optimize the drug discovery process [17, 18]. Deep learning is one of the most useful subsets of ML in the drug discovery field as it can help recognize the data pattern, easily handle complex data and further classification, and expedite and improve the drug development procedure [16, 19]. Random forest is also a part of ML, which can help to find and predict accurate results in drug design as it utilizes complex data and classifies it as a decision tree at training time, which is helpful to obtain the accuracy level based on the classification [20]. SVM is one of the most important ML-based applications that can help design a predictive model, as it can easily separate data points into different classes for better understanding [21]. Deep neural networks (DNNs) are one of the leading drug design applications. They can easily handle unlabeled and unstructured data based on their multiple hidden layers to represent complex datasets successfully and speed up the drug development procedure while improving the accomplishment rate [5, 22]. Apart from that, these applications have a more significant role in drug discovery, such as small molecule screening, virtual screening, model development, feature selection, quantitative structure–activity relationship (QSAR) modeling, pharmacophore modeling, predicting drug–drug interactions, absorption, distribution, metabolism, and excretion analysis (ADME/T), molecule contact within the target and ligands, the bioactivity and toxicity of analysis of the target compound and inhibitors, data integration and many [5, 16, 23,24,25]. Utilizing the different applications to deal with the drug discovery process, the researchers have applied various applications and successfully performed the study (Refer Table 2; Case study) and found that AI-assisted studies are more impactful due to advanced automation and can accelerate the method of drug development in the coming years for more effective outcomes. Moreover, Fig. 2 summarizes the stages involved, highlighting how different combinations were applied at various steps from raw data collection to final analysis to manage complexity and achieve accurate outcomes effectively.
Overview of the involved strategy in drug design through the integration of advanced applications
Tools and databases in drug discovery
At present, scientific researchers are keenly interested in drug development using advanced approaches that are easy to understand and offer high accuracy, leading to successful therapeutic outcomes. However, to retrieve, analyze, and understand the data, various tools and databases have been developed to aid therapeutic development. These resources are freely available and can help to retrieve the chemical compound and target sequence and their essential information, as Fig. 3 illustrates the involvement of tools, databases, servers, and various applications that support therapeutics design by generating diverse outputs, which collectively aid the drug discovery process. Also, it is helpful for the different types of biological property analysis, crucial in drug design. The useful tools and databases are represented in Table 1 with their corresponding URLs and remarks.
Overview of the various approaches in drug design. This figure illustrates how the raw data lead to the significant section, which is eventually helpful for the drug discovery process
To aid and accelerate the drug discovery process, several resources are available that are highly useful and lead to significant outcomes. The navigation, collection, analysis, and interpretation of data rely on these resources, which help researchers overcome various challenges. Advancing the approach through the integration of AI, ML, and other subsets of algorithms is a major part; however, to achieve more accurate results, researchers frequently update versions and enhance datasets to understand complex biological problems. Moreover, these resources were commonly utilized in different steps of the research, which aid the drug discovery process. Concerning drug discovery aspects, several resources are available for sequences, ligands, drug-likeness, and other areas. For example, IMPPAT 2.0, LOTOUS, DrugBank, ChEMBL, and COCONUT can be utilized for compound library generation, aiding in inhibitor design. Deep-PK and ADMETlab 2.0 can be useful for pharmacokinetic aspects, and several other tools are available for use based on the initial input data and study aims. Furthermore, concerning the molecular dynamics simulation, which is vital to analyze the stability of the complex, researchers are exploring insights by integrating ML. Brownless et al. demonstrate the implementation of ML models to MD simulation data [83], and Sodaei et al. highlight in their study the integration of MD simulations with ML [84]. Subsequently, to gain more impactful insight, enhancing resources by integrating AI will help revolutionize the discovery process.
Involvement of AI in drug development: a clinical perspective
AI has emerged as a revolutionary drug development method, streamlining every process from initial screening to experimental trials and post-market investigation. Incorporating AI into clinical settings can significantly accelerate drug development, lower costs, and enhance safety and efficacy. The clinical perspective of AI in drug development is particularly crucial as it directly impacts patient care, regulatory approvals, and the overall healthcare system. With advancements in ML, DL, and natural language processing (NLP), AI-driven approaches have been increasingly applied to enhance decision-making in clinical trials, optimize drug formulations, and personalise treatments based on patient-specific factors. Additionally, AI can integrate huge quantities of biological, pharmacological, and clinical data, enabling a more systematic drug development method [85,86,87]. AI-driven patient recruitment and stratification are transforming clinical trial design by addressing inefficiencies in traditional methods. A prominent example is IBM Watson for clinical trial matching, which leverages natural language processing (NLP) to investigate electronic health records (EHRs) and match patients to trials based on eligibility criteria [88]. This system integrates structured data (e.g., laboratory results, demographics) with genomic biomarkers, such as tumor mutation profiles, to stratify patients into molecularly defined subgroups. ML algorithms, including random forest models, predict patient eligibility and optimize cohort allocation [89]. These advancements emphasize AI’s capacity to harmonize multi-modal data streams, accelerating precision medicine initiatives.
Clinical trial optimization
Clinical trial optimization increasingly relies on predictive analytics and imaging-driven AI tools to enhance efficiency and precision. Like random survival forests and Cox proportional hazards algorithms, ML models enable dynamic risk stratification by analyzing time-to-event data, including adverse drug reactions or disease progression [90]. Adaptive trial designs like the I-SPY 2 trial in breast cancer employ Bayesian ML models to predict treatment response, enabling real-time arm adjustments. This approach reduced required sample sizes by 50% while maintaining statistical power [91]. Similarly, reinforcement learning frameworks optimize dosing schedules by simulating patient responses [92]. Imaging analysis tools like PathAI leverage convolutional neural networks (CNNs) to quantify tumor microenvironment features (e.g., immune cell infiltration, necrosis) from histopathology slides, enabling objective assessment of treatment efficacy. PathAI predicted immunotherapy response in melanoma with 85% accuracy by analyzing spatial patterns of PD-L1 expression [93]. Similarly, QuantX, an FDA-cleared AI platform, standardizes MRI-based tumor volume measurements in glioblastoma trials, achieving 92% inter-rater reliability compared to manual radiologist assessments [94]. These tools integrate with NLP pipelines to extract latent biomarkers from radiology reports, enhancing predictive models for patient stratification. By automating endpoint quantification and reducing inter-observer variability, AI-driven imaging accelerates trial timelines while improving reproducibility.
Drug repurposing and formulation
AI significantly enhances drug repurposing efforts by systematically analyzing molecular interactions within extensive biomedical datasets. An illustration is BenevolentAI’s discovery of baricitinib, a Janus kinase-signal transducer and activator of transcription (JAK-STAT) inhibitor originally licensed for rheumatoid arthritis as a potential COVID-19 treatment. By constructing a knowledge graph that integrated SARS-CoV-2 viral entry mechanisms with host inflammatory pathways, their algorithm identified baricitinib’s dual mechanism: suppression of viral endocytosis through AP2-associated protein kinase-1 (AAK1) inhibition and attenuation of cytokine storm pathology. Subsequent clinical evaluation in the Adaptive COVID-19 Treatment Trial-2 (ACTT-2) demonstrated a 38% mortality reduction [95,96,97].
Pharmacovigilance and post-market surveillance
AI-driven pharmacovigilance utilizes natural language processing (NLP) to extract real-world evidence (RWE) from electronic health records (EHRs) and digital patient communications. The FDA Sentinel Initiative exemplifies this approach, employing transformer-based architectures such as bidirectional encoder representations from transformers (BERT) to identify adverse drug events (ADEs) from unstructured clinical documentation. This methodology has demonstrated a 40% improvement in detection rates compared with traditional manual surveillance systems [98, 99]. Social media platforms are concurrently analyzed through sentiment recognition algorithms to detect patient-reported symptoms. However, notable limitations persist: A 2021 evaluation revealed that 30% of AI-flagged Twitter posts contained non-actionable content due to sarcasm or irrelevance, underscoring the necessity for human verification [100].
Personalized medicine and biomarker discovery
AI platforms such as DeepOmics are revolutionizing personalized medicine by integrating various omics-related data (genomics, transcriptomics, proteomics, metabolomics) to identify biomarkers that can predict patient response to immunotherapy. Moreover, Fig. 4 illustrates the navigation through omics insights, which further leads to personalized plans. These platforms utilize advanced ML algorithms to investigate bulk datasets and reveal synchronicity that human researchers might miss [101,102,103]. By identifying specific biomarkers associated with treatment response, healthcare personnel can fit treatments to specific patients, increasing the likelihood of successful outcomes. Digital twins, in retrospect, play a crucial role in simulating treatment outcomes by creating virtual replicas of individual patients based on their unique biological characteristics. By simulating different scenarios and treatment strategies, digital twins enable personalized treatment plans tailored to each individual’s precise requirements and characteristics [104]. This innovative approach upholds a great possibility for refining the patient’s outcome and helping revolutionize personalized medicine.
Personalized medicine and biomarker discovery. The figure illustrates the insight through the multi-omics based on their involvement in the data and biomarker finding and leads to personalized plans
Involvement of AI in drug development: a pharmaceutical sector perspective
AI is changing medicine development, which will revolutionize the pharmaceutical industry. From discovery to market approval, the normal drug development pipeline takes over 10 years and costs over $2 billion. With just 10% of new chemical entities receiving regulatory clearance, their success rate is low [105,106,107]. Drug development may be streamlined using AI to discover novel targets, enhance lead compounds, and accelerate clinical trials. AI-driven ML algorithms may identify synchronicity and linkage in large datasets that human researchers cannot, speeding up drug discovery. The effectiveness and safety of small chemicals can be predicted by reducing late-stage trial failures [107,108,109]. AI’s involvement in the development of drugs and its potential and limitations must be assessed as the pharmaceutical industry evolves. Significant limitations, including high failure rates, protracted timelines, and substantial costs, hinder the traditional drug development process [1, 86]. To address these challenges, innovative solutions are required to improve the efficiency and effectiveness of drug development. AI recently surfaced as a promising technological advancement that could revolutionize the drug development paradigm. By leveraging AI, researchers can work with bulk data, identify synchronicity, and potentially predict the outcomes, thereby speeding the target identification process, lead optimization, and clinical trial design [105, 110, 111]. Ultimately, integrating AI in drug development can facilitate the discovery, development, and delivery of novel therapeutics, improving patient outcomes and transforming the future of healthcare [110, 112]. Moreover, Fig. 5 demonstrates the insight from the process to solution, followed by the other aspect, which is essential to understand the drug development process precisely.
Traditional drug development (process, limitations, and challenges) and innovative solutions. The figure illustrates the major insight based on the individual aspects of classification of the traditional drug design
AI applications perspective
Integrating AI in drug discovery and development has transformed the pharmaceutical sector, enabling the rapid identification and creation of innovative therapeutic solutions. Among the diverse AI methodologies employed, ML is pivotal in analyzing vast datasets, recognizing outlines, and producing predictive models that accelerate drug discovery [7, 110, 113]. For example, ML algorithms can assess the binding affinities of small molecules to specific targets, thereby enabling the identification of lead compounds [114, 115]. Building on this, DL, a subdivision of ML, has been established as particularly applicable in deciphering complex biological information, like genomic and transcriptomic profiles, enhancing the precision of drug–target interactions [22, 89, 116]. Subsequently, NLP is emerging as a critical tool for extracting relevant data from extensive literature and experimental data. This capability aids in identifying novel drug targets, therapeutic applications, and potential safety concerns, thereby streamlining the drug development pipeline [117, 118]. Furthermore, computer vision has been increasingly used to analyze data from clinical imaging, such as X-rays and MRI, to identify specific biomarkers and disease characteristics. This approach supports the progress of therapies tailored to patient’s needs [119, 120]. These AI methodologies, ML, DL, NLP, and computer vision, have reportedly improved drug development’s efficacy, accuracy, and consistency. By integrating these advanced tools, the pharmaceutical industry is poised to expedite the discovery and formulation of innovative, effective, and safe treatments. These transformative potentials emphasize the profound impact of AI on advancing human health and improving quality of life.
AI-driven innovations from target identification to regulatory submission
The utilization of AI has notably transformed the medical sector, optimizing and improving many drug screening and development phases. Target identification and validation are crucial domains where AI has shown substantial influence. ML algorithms examine large datasets, encompassing genomic, proteomic, and disease pathway data, to discern prospective treatment targets and forecast their effectiveness [112, 114, 121]. AI-driven genomic data analysis has identified new targets for complex illnesses, including cancer and Alzheimer’s [96, 122]. Furthermore, AI models estimate the binding affinity of tiny molecules to various targets, hence aiding in identifying potential lead compounds [113, 114]. In lead optimization and drug design, AI has proven invaluable. Deep learning (DL) algorithms generate novel compounds with enhanced efficacy and safety profiles, significantly reducing development timelines [19, 116]. For instance, AI-optimized lead compounds have been developed for diseases such as MPOX virus and malaria, exhibiting improved potency and reduced toxicity [112, 123]. Furthermore, AI predicts these compounds’ pharmacokinetic and pharmacodynamic properties, aiding in their optimization and selection for further development. In preclinical testing, AI models analyze large datasets to predict compound toxicity, minimizing dependence on animal testing and speeding the development of safer therapeutics [96, 109, 112]. AI also has a major part in experimental trial design and patient recruitment. ML algorithms evaluate EHR and genetic data to identify suitable participants, optimize trial protocols and improve recruitment efficiency [111, 113]. Moreover, AI predicts clinical trial outcomes and patient responses, enabling personalized treatment strategies. In regulatory processes, natural language processing (NLP) tools streamline the analysis and generation of regulatory documents, reducing administrative burdens and expediting approvals [96, 117, 118]. Notable examples of AI-driven success in drug development include in silico Medicine’s identification of a novel fibrosis target, which led to the discovery of a potential therapeutic candidate [124]. Exscientia’s AI-designed molecule DSP-1181, a serotonin 5-HT1A receptor agonist for obsessive–compulsive disorder, entered clinical trials in record time [97, 125]. Additionally, recursion pharmaceuticals leveraged AI to identify drug repurposing opportunities for rare diseases [96]. These examples underscore AI’s transformative potential in accelerating drug development, improving safety, and delivering targeted therapies for unmet medical needs.
Pharmaceutical sector perspective: benefits, challenges, and future scope
Adopting AI in the pharmaceutical sector offers transformative benefits but presents significant challenges (Fig. 6), highlighting the potential impact. AI boosts efficiency and productivity by automating tasks like data analysis, target identification, and compound screening [5, 112, 113]. AI platforms like BenevolentAI have cut drug discovery time from years to just months. AI models improve decision-making and risk assessment by analyzing complex data to predict drug effectiveness, toxicity, and clinical trial results, helping to reduce expensive late-stage failures [112, 123, 126]. AI speeds up finding and creating new treatments by spotting new drug targets and improving lead compounds. A great example is Exscientia’s AI-designed molecule DSP-1181, which moved into clinical trials faster than ever. AI makes healthcare more personal by assessing an individual’s data to customize therapy, as shown by platforms like recursion pharmaceuticals [125, 127]. However, incorporating AI in the medical sector faces several obstacles (Fig. 6). Notably, the scarcity of high-quality and large-scale data, which are often fragmented or proprietary, poses a significant barrier to AI model development [107, 108]. Furthermore, the swift evolution of AI technologies outperforms the progress of homogenous regulatory outlines and validation guidelines, creating additional challenges for AI adoption in drug development. Intellectual property, data security, and talent acquisition/retention concerns impede AI integration [107, 112, 128]. To overcome these hurdles, industry leaders emphasize the importance of cross-industry collaboration and strategic investment in AI capabilities. Partnerships between pharmaceutical companies and technology giants (e.g., Microsoft) demonstrate the value of collaborative efforts in integrating AI into drug discovery processes. Similarly, investments in AI-driven platforms (Pfizer, Chemistry42, PandaOmics) highlight the potential of AI as an enhancer in investigational trials and patient enrollment. Ultimately, while AI adoption in the pharmaceutical sector presents challenges, its transformative potential for drug development and patient care necessitates continued focus and investment [96, 101, 102, 129].
AI adoption in the pharmaceutical sector. The figure illustrates how the involvement of AI has enhanced the pharmaceutical sector and has limitations
Case studies: AI involvement stories in pharmaceutical companies
Prominent case studies, such as Exscientia and BenevolentAI, illustrate the effective incorporation of AI in drug development. Exscientia, an innovative pharmaceutical firm, has utilized ML and DL algorithms to improve drug candidate selection and refine drug development. Exscientia significantly reduced the duration of preclinical drug discovery from years to months by employing AI-driven platforms [96, 101, 102, 130]. The company’s partnership with Sumitomo Dainippon Pharma led to the inaugural AI-designed drug candidate commencing clinical trials after 12 months, illustrating AI’s capacity to expedite drug development schedules. BenevolentAI has effectively incorporated AI into its drug development pipeline via approaches such as NLP and ML. The company’s technology evaluates extensive biological data, encompassing scholarly literature and clinical trial information, to discover new drug candidates and repurpose current medications for alternative uses [112, 123, 126]. In an exemplary situation, baricitinib is recognized as a prospective treatment for severe COVID-19, according to its capacity to obstruct viral entrance into host cells [95]. This finding resulted in clinical trials and eventual approval by regulatory bodies, underscoring the significance of AI in expediting medication repurposing initiatives. The insights derived from these case studies highlight the necessity of amalgamating domain expertise with AI technology, promoting multidisciplinary cooperation, and guaranteeing rigorous validation of AI-generated conclusions. Pharmaceutical businesses must define explicit objectives for AI deployment, invest in robust data infrastructure, and provide openness in algorithmic decision-making processes [130]. By using these best practices, organizations may reduce the risks linked to AI adoption and leverage its revolutionary potential. AI-driven methodologies have effectively improved drug development efficiency, precision, and creativity, presenting a hopeful outlook for the medical division. As the industry progresses, AI is anticipated to have a more substantial role in influencing the impending drug development process. The strategic amalgamation of AI can transform the healthcare division, facilitating the fast creation of innovative, effective, and safe treatments that enhance human health and quality of life. AI and future technologies can improve drug development. AI and blockchain technology can improve clinical trial data quality and traceability, enabling secure and transparent sharing of sensitive data [107, 131]. The Internet of Things (IoT) allows real-time patient health nursing, improving AI models and tailoring treatment techniques. Oncology and uncommon disorders are promising areas for AI use. AI can analyze complex information and find subtle patterns to speed up the development of new therapies for these difficult ailments [96, 126]. In drug development, explainable and transparent AI models that build pharmaceutical industry trust are needed. The lack of interpretability in modern black-box AI systems inhibits regulatory and healthcare practitioner acceptability [97, 108, 132]. Thus, interpretable AI models like symbolic reasoning or rule-based systems are essential for accountability and human supervision. AI can expedite pharmaceutical manufacturing and supply chain management, cut costs, and improve product quality. Automation and predictive analytics improve procedures, reduce waste, and deliver drugs on time. To use AI’s disruptive potential in drug discovery, pharmaceutical companies should invest in robust data infrastructure and diverse knowledge and collaborate with technological and academic institutions (Fig. 7). AI-driven innovations should be integrated into regulatory frameworks, stressing openness and ethics in algorithmic decision-making. Researchers should address data privacy, bias reduction, and model validation to make AI systems reliable and fair. By working together, pharmaceutical ecosystem stakeholders may improve medicine research, development, and delivery for global patients. AI, combined with new technology, might revolutionize the pharmaceutical industry by enabling the quick creation of creative, effective, and safe medications that improve human health and quality of life. AI is expected to become more important in drug development, necessitating stakeholder engagement to harness its promise and overcome new difficulties.
Recommendations for pharmaceutical companies, regulatory agencies, and researchers to harness the potential of AI in drug development
AI-assisted case study in drug discovery
Due to advanced automation based on computer-assisted drug design, worldwide researchers are keenly interested in drug discovery. Several steps, such as data retrieval, dataset generation, descriptor calculation model development, optimization, validation, and finding a potential hit through the virtual screening and molecular docking analysis to comprehend the molecular activity with the target and identified molecules, are the essential steps that are most employed in the process to discover potential key molecules[133,134,135]. Also, to confirm whether the identified molecules have promising profiles, the researchers are performing the drug-likeliness analysis, so it will be helpful to determine whether the identified compound has a significant function and can inhibit the corresponding target [5, 136]. This section will help us understand how worldwide researchers are utilizing different algorithms to successfully identify potential compounds that can be helpful for therapeutic development, as shown in Table 2.
Exploring and utilizing the advanced computational steps, along with the integration of AI, is revolutionizing the drug discovery process. Subsequently, the researchers are employing these approaches, yielding significant outcomes in different emerging, re-emerging, and ongoing pathogens and associated complications.
Zhang et al. employed various ML methods to formulate a classification models that assist in acetyl-CoA carboxylase inhibitor (ACC) identification, and based on the subsequent steps and overall findings, suggest that the employed strategy is significant based on the identification of promising hits [137]. Similarly, Che et al. conducted a study on Interleukin-1 receptor-associated kinase-1 (IRAK1), which is crucial in inflammation and autoimmune diseases, to identify novel inhibitors. Moreover, with the integration of AI and other applications, this study aims to improve the IRAK1-VS protocol, and based on the overall findings, it suggests that the involvement of AI can yield in the VS strategy [23].
Subsequently, the African swine fever virus (ASFV) attacks domestic pigs of all ages and causes a fatal viral disease. No vaccines or antiviral medications are available, and there have been no reports of human damage. Concerning this, Choi et al. employed docking and ML models, along with subsequent steps, to find ASFV inhibitors and also conducted in vitro investigations on the final top complex. The overall employed strategy in this study can be a promising step toward the ASFV antiviral agent identification [138]. Based on recent emerging and re-emerging viral infections, researchers found that repurposing approved drugs can lead to therapeutic development and help reduce infection. Concerning this, Delijewski et al. employed AI-based drug discovery and identified the best promising repurposing candidates by examining FDA-approved compounds. Based on the employed subsequent steps and validation, the study suggests zafirlukast as a promising candidate [24].
Moreover, an unbalanced expression may lead to severe complications; therefore, Narendra et al. targeted Aldehyde dehydrogenases (ALDHs) by employing different ML models and subsequently took a step to find the selective ALDH1A1 inhibitors. Further study suggests that the identified diverse scaffolds-based inhibitors are promising and can serve as novel scaffolds for ALDH1A1 inhibitors [139]. Similarly, Tang B et al. employed a deep Q-learning network with a fragment-based pharmacological design approach to detect innovative covalent inhibitors targeting SARS-CoV-2, considering the other essential steps, and also suggested that the ADQN–FBDD and related pipeline can be utilized for other emerging diseases through structure-based drug discoveries [140]. Focusing on the different ML models concept algorithm, Raju et al. performed a study to identify selective CYP1B1 inhibitors via different ML models, along with other subsequent steps [14]. Similarly, another study by Nguyen et al. employed an ML-based model to analyze the effects of O. basilicum essential oils on the breast cancer cell line MCF-7, assessing their inhibitory properties. Based on the employed different steps and validation, the author suggests that the designed model can potentially screen the bioactive compounds, and the identified compounds can lead to breast cancer treatment [141]. A prevalent metabolic disorder among people who have both type 1 and 2 diabetes, diabetic neuropathy affects approximately 463 million adults in 2019, and the cases were increasing day by day; to overcome this, Kashyap et al. built the 3D CNN prediction models to evaluate the Aldose reductase (ALR2) inhibitors, considering the various steps [142]. Moreover, Patel et al. employed an AI-based DeepRepurpose framework to identify potential inhibitors of the Monkeypox virus (MPXV), which is a recently emerged viral infection. Based on the subsequent steps employed, the overall finding shows that Elvitegravir is a promising inhibitor; however, experimental validation is needed to ensure efficacy [143]. Following the recent trend, Chikhale et al. focused on generative AI and subsequent steps to explore new EthR ligands. In this study, the authors utilized the REINVENT4, which is a GenAI tool; moreover, the authors suggest that the designed framework can be applied to other drug resistance-based complications [135]. Similarly, concerning the drug resistance mechanism, Garg et al. employed the AI-ML along with an in silico approach to find anti-TB compounds. In this study, the authors initially employed various in silico steps and further integrated the AI-ML to validate the findings. Moreover, the authors suggest that further experimental validations are required [144]. Another study by Suliman et al. [145] focused on the derivative insights of pyrazolone, employing AI-generated derivatives to overcome fungal infections. Based on various validation steps, the study suggested a therapeutic candidate that could be used to address the infection, and also emphasized the need for future experimental validation to confirm the outcomes [145]. Colorectal cancer (CRC) is one of the ongoing complications in infected individuals, and to make an effort toward the therapeutic design, Ali et al. targeted RPS20 to identify the plant-derived compound as a promising inhibitor. In this study, the author initially identified Indirubin as a promising candidate and, further, through AI integration, enhanced its activity in 13 different ligands and further evaluated its activity with the docking and subsequent steps. The overall study shows the AI-enhanced Indirubin derivative has superior binding affinity and also suggests that further experimental evaluation is required to ensure efficacy and safety [146]. As researchers in different therapeutic design steps are utilizing the integration of AI, Vaidya et al. have taken a further extension and framed an AI/ML-based predictive algorithm “Anti-EBV” to identify promising antivirals against the Epstein-Barr virus (EBV). In this investigation, the author designed the “Anti-EBV” web server, which is the first platform dedicated to identifying antivirals against EBV [147].
At present, the different algorithms of AI, ML, and DL integrating with the in silico approach are being utilized by researchers worldwide to address ongoing severe complications and improve therapeutic efficacy and novel drug identification. The employed different algorithms in successful design studies suggest and revealed the other significant roles of AI, such as to deal with the bulk dataset, design pipeline and framework to screen the promising compounds, design models to evaluate the efficacy, using framework for antiviral against the viral infection, employing combined algorithms that can be helpful to validate the compounds which may assist in the more accurate finding and many more. Although the use of AI is beneficial, it still has certain limitations. Further improvements are needed in model accuracy, streamlined steps and pipelines for handling bulk datasets, accurate compound prioritization, and the generation of more meaningful hypotheses, ultimately enhancing the drug development process.
Advancement and challenges
Compared to the conventional method, advancements in drug design using AI and algorithms are being utilized by researchers worldwide to make outcomes more accurate. The developed drug must be specific and bind to the target to successfully treat the infection. With the advancement of AI, researchers can easily find potential drugs among the bulk dataset, which can easily bind to the target proteins and genes associated with specific diseases. Based on that information, the researchers can quickly begin the drug discovery, leading to successful inhibitor identification [2, 6, 148]. Although complications from many diseases and pathogens continue to emerge, significant advancements have been made in therapeutics, with several drugs already available and others currently undergoing clinical trials. Using the advanced AI drug repurposing approach, researchers can easily find the existing approved drugs that may be useful for treating emerging and re-emerging diseases [143, 149, 150]. To transform and accelerate the drug discovery process, numerous databases containing chemical compounds, natural products, and approved drugs comprising millions of entries are freely available. By integrating AI-assisted models, researchers can rapidly identify the most promising candidates from these large datasets and analyze their biological activities using various computational models and pipelines. Compared to conventional methods, AI-assisted drug design facilitates the identification of the most promising inhibitors through high-throughput virtual screening, allowing researchers to rapidly prioritize candidates based on generated molecular attributes and key biological information [10, 17, 151]. Furthermore, the introduction of different AI-based algorithms and pipelines has made structure modeling easier, thereby facilitating de novo structure design. This has specifically aided the field of personalized medicine, where drugs can be tailored with preferred characteristics [152, 153]. To manage vast datasets, various cloud-based storage systems have also been introduced, which not only facilitate data storage but also support efficient data analysis [87, 154]. Subsequently, the introduction of AI has made drug discovery more efficient as it can integrate the data from different fields, enabling comprehensive analysis of complex biological data and improving the accuracy of predictions, thereby enhancing the efficacy of the drugs [155].
Although various algorithm-based advancements have significantly aided drug discovery, several challenges remain that need to be addressed to further improve therapeutic development. An AI-assisted study requires the most advanced high-tech computer system as it needs to deal with the bulk dataset, which generates the need for more money for the successful development of drugs; also, at the same time, it requires a person who is skilled with the generated data for further leads [7, 116]. However, the promising and successful treatments of the infected patient’s dataset generate concerns about the privacy aspects of the patients. Also, AI-assisted drug development requires a massive amount of data, but the retrieved data is often incomplete, which poses challenges for successful model development. The transparency of the model is another major challenge, as most AI models lack interpretability, making it difficult to understand the reasoning behind the generated results. Various models have also been shown to be biased and known to produce incorrect predictions [156, 157]. Subsequently, there remains a gap between in silico and in vivo studies, which may not align with computational findings [158]. Additionally, the models are often limited to specific datasets, and when provided with unfamiliar data, they may fail to generate accurate results, thereby hindering analysis [133, 159]. Lastly, ethical and regulatory concerns must also be addressed, as most models do not clearly define ownership and accountability for their applications [134]. Despite the challenges, the integration of AI in drug discovery continues to advance rapidly. Researchers worldwide are enhancing interpretation, validation, and the quality and quantity aspects, as these are essential for successful therapeutics development.
Critical analysis of challenges and limitations in AI-assisted drug discovery
AI, along with its subsets ML and DL, provides substantial efficiency in drug discovery. However, the broader implementation and successful translation of these technologies face multiple interrelated challenges that require thorough analysis.
Data scarcity, quality, and generalizability
The efficacy of any deep learning or machine learning model is fundamentally contingent upon the quality, quantity, and consistency of the training data. Biomedical data are often fragmented, proprietary, or inconsistent among various laboratories and institutions. This constraint frequently compels models to be trained on limited datasets, resulting in high effectiveness for specific tasks (e.g., predicting toxicity for a restricted chemical class) but increasing the likelihood of failure in generalization when applied to new chemical spaces or complex, polygenic diseases. The absence of standardized data protocols undermines the robustness and reproducibility of insights derived from AI. Addressing this necessitates a collective commitment across the industry to ensure high-quality data curation, promote open-science initiatives, and establish standardized metadata practices.
The interpretability and “Black-Box” dilemma
When very complicated models like deep neural networks (DNNs) are used successfully, they often provide “black-box” outputs that are hard to understand. Although these models attain considerable predictive accuracy, researchers encounter substantial challenges in data interpretation due to the complexity of the generated outputs, which complicates the identification of the mechanical explanation underlying a prediction. This absence of explainable AI (XAI) constitutes a significant impediment, not only for scientists requiring mechanistic reasons to inform synthesis and optimization but also for regulatory entities, like the Food and Drug Administration (FDA). Regulators usually want explicit, mechanistic proof before they approve a drug. This makes it hard for clinical and market translation to use AI models that can’t be understood.
Economic and regulatory hurdles
As the current publication emphasizes, AI-assisted research necessitates sophisticated high-tech computer systems and proficient individuals to manage extensive datasets, hence increasing the overall cost demands for medication development. Also, the rapid growth of AI technology is now exceeding the growth of unified regulatory frameworks. This delay in regulation makes it hard for businesses to invest in the industry. In addition to concerns about patient data privacy and the challenges of owning intellectual property (IP) for newly created compounds, these issues require immediate global action to establish clear ethical, legal, and operational standards for fully harnessing AI in the process.
Conclusion
Integrating AI in drug discovery is revolutionizing the process in terms of cost and time. As several emerging and re-emerging infections occur daily, enormous progress has been made employing AI-assisted automation in the drug development method to find potential leads. Also, due to their accuracy, several tools and databases play a vital role in drug discovery, from data retrieval to optimization. Advancements in these tools and databases can significantly reduce time and cost, as manual methods are tedious and time-consuming. Moreover, incorporating AI in clinical processes leads to improved accuracy through the various stages. However, the AI in drug design still has some limitations and challenges, such as data privacy and ethical concerns; advancements in drug discovery can address these issues effectively. This study compiles numerous details of AI in drug discovery. It offers understandings into its key steps, real-time applications through case studies, involvement in clinical trials, pharmaceutical aspects, and associated limitations. This comprehensive approach aims to help researchers understand AI’s critical stages and contributions to drug development.
Data Availability
No datasets were generated or analysed during the current study.
References
Paul D, Sanap G, Shenoy S, Kalyane D, Kalia K, Tekade RK (2021) Artificial intelligence in drug discovery and development. Drug Discov Today 26(1):80. https://doi.org/10.1016/j.drudis.2020.10.010
Vemula D, Jayasurya P, Sushmitha V, Kumar YN, Bhandari V (2023) CADD, AI and ML in drug discovery: a comprehensive review. Eur J Pharm Sci 181:106324. https://doi.org/10.1016/j.ejps.2022.106324
Lou P, Fang A, Zhao W, Yao K, Yang Y, Hu J (2023) Potential target discovery and drug repurposing for coronaviruses: study involving a knowledge graph-based approach. J Med Internet Res 25:e45225. https://doi.org/10.2196/45225
Shafiq N, Jannat A, Munir H, Rashid M, Parveen S (2023) Exploring the potential of FDA approved anti-diabetic drugs for repurposing against COVID-19: a core combination of multiple computational strategies and integrated artificial intelligence. J Biomol Struct Dyn. https://doi.org/10.1080/07391102.2023.2234993
Blanco-Gonzalez A, Cabezon A, Seco-Gonzalez A, Conde-Torres D, Antelo-Riveiro P, Pineiro A, Garcia-Fandino R (2023) The role of AI in drug discovery: challenges, opportunities, and strategies. Pharmaceuticals (Basel). https://doi.org/10.3390/ph16060891
Patel CN, Georrge JJ, Modi KM, Narechania MB, Patel DP, Gonzalez FJ, Pandya HA (2018) Pharmacophore-based virtual screening of catechol-o-methyltransferase (COMT) inhibitors to combat Alzheimer’s disease. J Biomol Struct Dyn 36(15):3938–3957. https://doi.org/10.1080/07391102.2017.1404931
Mak KK, Pichika MR (2019) Artificial intelligence in drug development: present status and future prospects. Drug Discov Today 24(3):773–780. https://doi.org/10.1016/j.drudis.2018.11.014
Wang Y, Guo Y, Kuang Q, Pu X, Ji Y, Zhang Z, Li M (2015) A comparative study of family-specific protein-ligand complex affinity prediction based on random forest approach. J Comput Aided Mol Des 29(4):349–360. https://doi.org/10.1007/s10822-014-9827-y
Andrianov AM, Shuldau MA, Furs KV, Yushkevich AM, Tuzikov AV (2023) AI-driven de novo design and molecular modeling for discovery of small-molecule compounds as potential drug candidates targeting SARS-CoV-2 main protease. Int J Mol Sci. https://doi.org/10.3390/ijms24098083
Giordano D, Biancaniello C, Argenio MA, Facchiano A (2022) Drug design by pharmacophore and virtual screening approach. Pharmaceuticals (Basel). https://doi.org/10.3390/ph15050646
Jarab AS, Abu Heshmeh SR, Al Meslamani AZ (2023) Artificial intelligence (AI) in pharmacy: an overview of innovations. J Med Econ 26(1):1261–1265. https://doi.org/10.1080/13696998.2023.2265245
Ranchon F, Chanoine S, Lambert-Lacroix S, Bosson JL, Moreau-Gaudry A, Bedouch P (2023) Development of artificial intelligence powered apps and tools for clinical pharmacy services: a systematic review. Int J Med Inform 172:104983. https://doi.org/10.1016/j.ijmedinf.2022.104983
Zhu C, Cai T, Jin Y, Chen J, Liu G, Xu N, Shen R, Chen Y, Han L, Wang S, Wu C, Zhu M (2020) Artificial intelligence and network pharmacology based investigation of pharmacological mechanism and substance basis of Xiaokewan in treating diabetes. Pharmacol Res 159:104935. https://doi.org/10.1016/j.phrs.2020.104935
Raju B, Verma H, Narendra G, Sapra B, Silakari O (2022) Multiple machine learning, molecular docking, and ADMET screening approach for identification of selective inhibitors of CYP1B1. J Biomol Struct Dyn 40(17):7975–7990. https://doi.org/10.1080/07391102.2021.1905552
Sharma A, Virmani T, Pathak V, Sharma A, Pathak K, Kumar G, Pathak D (2022) Artificial intelligence-based data-driven strategy to accelerate research, development, and clinical trials of COVID vaccine. BioMed Res Int 2022:7205241. https://doi.org/10.1155/2022/7205241
Qureshi R, Irfan M, Gondal TM, Khan S, Wu J, Hadi MU, Heymach J, Le X, Yan H, Alam T (2023) AI in drug discovery and its clinical relevance. Heliyon 9(7):e17575. https://doi.org/10.1016/j.heliyon.2023.e17575
Priya S, Tripathi G, Singh DB, Jain P, Kumar A (2022) Machine learning approaches and their applications in drug discovery and design. Chem Biol Drug Des 100(1):136–153. https://doi.org/10.1111/cbdd.14057
Sadybekov AV, Katritch V (2023) Computational approaches streamlining drug discovery. Nature 616(7958):673–685. https://doi.org/10.1038/s41586-023-05905-z
Ramsundar B, Eastman P, Walters P, Pande V (2019) Deep learning for the life sciences: applying deep learning to genomics, microscopy, drug discovery, and more. O’Reilly Media
Cano G, Garcia-Rodriguez J, Garcia-Garcia A, Perez-Sanchez H, Benediktsson JA, Thapa A, Barr A (2017) Automatic selection of molecular descriptors using random forest: application to drug discovery. Expert Syst Appl 72:151–159. https://doi.org/10.1016/j.eswa.2016.12.008
Janairo JIB (2023) Support vector machine in drug design. In: Cheminformatics, QSAR and Machine Learning Applications for Novel Drug Development. Elsevier, pp 161–179. https://doi.org/10.1016/B978-0-443-18638-7.00021-9
Askr H, Elgeldawi E, Aboul Ella H, Elshaier Y, Gomaa MM, Hassanien AE (2023) Deep learning in drug discovery: an integrative review and future challenges. Artif Intell Rev 56(7):5975–6037. https://doi.org/10.1007/s10462-022-10306-1
Che J, Feng R, Gao J, Yu H, Weng Q, He Q, Dong X, Wu J, Yang B (2020) Evaluation of artificial intelligence in participating structure-based virtual screening for identifying novel interleukin-1 receptor associated kinase-1 inhibitors. Front Oncol 10:1769. https://doi.org/10.3389/fonc.2020.01769
Delijewski M, Haneczok J (2021) AI drug discovery screening for COVID-19 reveals zafirlukast as a repurposing candidate. Med Drug Discov 9:100077. https://doi.org/10.1016/j.medidd.2020.100077
Parwez S, Panigrahi L, Ahmed S, Siddiqi MI (2023) Machine learning-based predictive modeling, virtual screening and biological evaluation studies for identification of potential inhibitors of MMP-13. J Biomol Struct Dyn 41(15):7190–7203. https://doi.org/10.1080/07391102.2022.2117738
Sayers EW, Cavanaugh M, Clark K, Pruitt KD, Sherry ST, Yankie L, Karsch-Mizrachi I (2023) GenBank 2024 update. Nucleic Acids Res. https://doi.org/10.1093/nar/gkad903
UniProt C (2023) UniProt: the Universal Protein Knowledgebase in 2023. Nucleic Acids Res 51(D1):D523–D531. https://doi.org/10.1093/nar/gkac1052
Gilson MK, Liu T, Baitaluk M, Nicola G, Hwang L, Chong J (2016) BindingDB in 2015: a public database for medicinal chemistry, computational chemistry and systems pharmacology. Nucleic Acids Res 44(D1):D1045-1053. https://doi.org/10.1093/nar/gkv1072
Mendez D, Gaulton A, Bento AP, Chambers J, De Veij M, Felix E, Magarinos MP, Mosquera JF, Mutowo P, Nowotka M, Gordillo-Maranon M, Hunter F, Junco L, Mugumbate G, Rodriguez-Lopez M, Atkinson F, Bosc N, Radoux CJ, Segura-Cabrera A, Hersey A, Leach AR (2019) ChEMBL: towards direct deposition of bioassay data. Nucleic Acids Res 47(D1):D930–D940. https://doi.org/10.1093/nar/gky1075
Sorokina M, Merseburger P, Rajan K, Yirik MA, Steinbeck C (2021) COCONUT online: collection of open natural products database. J Cheminform 13(1):2. https://doi.org/10.1186/s13321-020-00478-9
Wishart DS, Feunang YD, Guo AC, Lo EJ, Marcu A, Grant JR, Sajed T, Johnson D, Li C, Sayeeda Z, Assempour N, Iynkkaran I, Liu Y, Maciejewski A, Gale N, Wilson A, Chin L, Cummings R, Le D, Pon A, Knox C, Wilson M (2018) DrugBank 5.0: a major update to the DrugBank database for 2018. Nucleic Acids Res 46(D1):D1074–D1082. https://doi.org/10.1093/nar/gkx1037
Yang W, Soares J, Greninger P, Edelman EJ, Lightfoot H, Forbes S, Bindal N, Beare D, Smith JA, Thompson IR, Ramaswamy S, Futreal PA, Haber DA, Stratton MR, Benes C, McDermott U, Garnett MJ (2013) Genomics of drug sensitivity in cancer (GDSC): a resource for therapeutic biomarker discovery in cancer cells. Nucleic Acids Res 41(Database issue):D955-961. https://doi.org/10.1093/nar/gks1111
Szklarczyk D, Franceschini A, Wyder S, Forslund K, Heller D, Huerta-Cepas J, Simonovic M, Roth A, Santos A, Tsafou KP, Kuhn M, Bork P, Jensen LJ, von Mering C (2015) STRING v10: protein-protein interaction networks, integrated over the tree of life. Nucleic Acids Res 43(Database issue):D447-452. https://doi.org/10.1093/nar/gku1003
Kuhn M, Letunic I, Jensen LJ, Bork P (2016) The SIDER database of drugs and side effects. Nucleic Acids Res 44(D1):D1075-1079. https://doi.org/10.1093/nar/gkv1075
Daina A, Michielin O, Zoete V (2017) SwissADME: a free web tool to evaluate pharmacokinetics, drug-likeness and medicinal chemistry friendliness of small molecules. Sci Rep 7:42717. https://doi.org/10.1038/srep42717
Pires DE, Blundell TL, Ascher DB (2015) pkCSM: predicting small-molecule pharmacokinetic and toxicity properties using graph-based signatures. J Med Chem 58(9):4066–4072. https://doi.org/10.1021/acs.jmedchem.5b00104
Yang H, Lou C, Sun L, Li J, Cai Y, Wang Z, Li W, Liu G, Tang Y (2019) admetSAR 2.0: web-service for prediction and optimization of chemical ADMET properties. Bioinformatics 35(6):1067–1069. https://doi.org/10.1093/bioinformatics/bty707
Wang R, Fang X, Lu Y, Wang S (2004) The PDBbind database: collection of binding affinities for protein-ligand complexes with known three-dimensional structures. J Med Chem 47(12):2977–2980. https://doi.org/10.1021/jm030580l
Rutz A, Sorokina M, Galgonek J, Mietchen D, Willighagen E, Gaudry A, Graham JG, Stephan R, Page R, Vondrasek J, Steinbeck C, Pauli GF, Wolfender JL, Bisson J, Allard PM (2022) The LOTUS initiative for open knowledge management in natural products research. Elife 11:e70780. https://doi.org/10.7554/eLife.70780
Irwin JJ, Tang KG, Young J, Dandarchuluun C, Wong BR, Khurelbaatar M, Moroz YS, Mayfield J, Sayle RA (2020) ZINC20-A free ultralarge-scale chemical database for ligand discovery. J Chem Inf Model 60(12):6065–6073. https://doi.org/10.1021/acs.jcim.0c00675
Kim S, Chen J, Cheng T, Gindulyte A, He J, He S, Li Q, Shoemaker BA, Thiessen PA, Yu B, Zaslavsky L, Zhang J, Bolton EE (2023) PubChem 2023 update. Nucleic Acids Res 51(D1):D1373–D1380. https://doi.org/10.1093/nar/gkac956
Vivek-Ananth RP, Mohanraj K, Sahoo AK, Samal A (2023) IMPPAT 2.0: an enhanced and expanded phytochemical atlas of Indian medicinal plants. ACS Omega 8(9):8827–8845. https://doi.org/10.1021/acsomega.3c00156
Townshend RJ, Vögele M, Suriana P, Derry A, Powers A, Laloudakis Y, Balachandar S, Jing B, Anderson B, Eismann S (2020) Atom3d: Tasks on molecules in three dimensions. arXiv preprint arXiv:201204035
Varadi M, Bertoni D, Magana P, Paramval U, Pidruchna I, Radhakrishnan M, Tsenkov M, Nair S, Mirdita M, Yeo J, Kovalevskiy O, Tunyasuvunakool K, Laydon A, Zidek A, Tomlinson H, Hariharan D, Abrahamson J, Green T, Jumper J, Birney E, Steinegger M, Hassabis D, Velankar S (2023) AlphaFold Protein Structure Database in 2024: providing structure coverage for over 214 million protein sequences. Nucleic Acids Res. https://doi.org/10.1093/nar/gkad1011
Alipanahi B, Delong A, Weirauch MT, Frey BJ (2015) Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning. Nat Biotechnol 33(8):831–838. https://doi.org/10.1038/nbt.3300
Ding X, Zhang B (2021) DeepBAR: a fast and exact method for binding free energy computation. J Phys Chem Lett 12(10):2509–2515. https://doi.org/10.1021/acs.jpclett.1c00189
Liu Z, Du J, Fang J, Yin Y, Xu G, Xie L (2019) DeepScreening: a deep learning-based screening web server for accelerating drug discovery. Database (Oxford) 2019:baz104. https://doi.org/10.1093/database/baz104
Rifaioglu AS, Nalbat E, Atalay V, Martin MJ, Cetin-Atalay R, Dogan T (2020) DEEPScreen: high performance drug-target interaction prediction with convolutional neural networks using 2-D structural compound representations. Chem Sci 11(9):2531–2557. https://doi.org/10.1039/c9sc03414e
Lee I, Keum J, Nam H (2019) DeepConv-DTI: prediction of drug-target interactions via deep learning with convolution on protein sequences. PLoS Comput Biol 15(6):e1007129. https://doi.org/10.1371/journal.pcbi.1007129
Huang K, Fu T, Glass LM, Zitnik M, Xiao C, Sun J (2021) DeepPurpose: a deep learning library for drug-target interaction prediction. Bioinformatics 36(22–23):5545–5547. https://doi.org/10.1093/bioinformatics/btaa1005
Mayr A, Klambauer G, Unterthiner T, Hochreiter S (2016) DeepTox: toxicity prediction using deep learning. Front Environ Sci 3:80. https://doi.org/10.3389/fenvs.2015.00080
Wang C, Zhang Y (2017) Improving scoring-docking-screening powers of protein-ligand scoring functions using random forest. J Comput Chem 38(3):169–177. https://doi.org/10.1002/jcc.24667
Xu Y, Ma J, Liaw A, Sheridan RP, Svetnik V (2017) Demystifying multitask deep neural networks for quantitative structure-activity relationships. J Chem Inf Model 57(10):2490–2504. https://doi.org/10.1021/acs.jcim.7b00087
Sanchez-Lengeling B, Outeiral C, Guimaraes GL, Aspuru-Guzik A (2017) Optimizing distributions over molecular space. An objective-reinforced generative adversarial network for inverse-design chemistry (ORGANIC). https://doi.org/10.26434/chemrxiv.5309668.v3
Tang YC, Gottlieb A (2021) Explainable drug sensitivity prediction through cancer pathway enrichment. Sci Rep 11(1):3128. https://doi.org/10.1038/s41598-021-82612-7
Gomez-Bombarelli R, Wei JN, Duvenaud D, Hernandez-Lobato JM, Sanchez-Lengeling B, Sheberla D, Aguilera-Iparraguirre J, Hirzel TD, Adams RP, Aspuru-Guzik A (2018) Automatic chemical design using a data-driven continuous representation of molecules. ACS Cent Sci 4(2):268–276. https://doi.org/10.1021/acscentsci.7b00572
Yash K, O’Hagan Steve SS, Neil S, Roberts T-t, Danushka B, Kell Douglas B (2020) DeepGraphMol, a multi-objective, computational strategy for generating molecules with desirable properties: a graph convolution and reinforcement learning approach. bioRxiv
Zhu Z, Shi C, Zhang Z, Liu S, Xu M, Yuan X, Zhang Y, Chen J, Cai H, Lu J (2022) Torchdrug: A powerful and flexible machine learning platform for drug discovery. arXiv preprint arXiv:220208320
Steiner S, Wolf J, Glatzel S, Andreou A, Granda JM, Keenan G, Hinkley T, Aragon-Camarasa G, Kitson PJ, Angelone D, Cronin L (2019) Organic synthesis in a modular robotic system driven by a chemical programming language. Science. https://doi.org/10.1126/science.aav2211
Łapińska N, Pacławski A, Szlek J, Mendyk A (2024) SerotoninAI: serotonergic system focused, artificial intelligence-based application for drug discovery. J Chem Inf Model 64(7):2150–2157. https://doi.org/10.1021/acs.jcim.3c01517
Bai Q, Tan S, Xu T, Liu H, Huang J, Yao X (2021) MolAICal: a soft tool for 3D drug design of protein targets by artificial intelligence and classical algorithm. Brief Bioinform 22(3):bbaa161
Yang X, Zhang B, Wang S, Lu Y, Chen K, Luo C, Sun A, Zhang H (2023) OTTM: an automated classification tool for translational drug discovery from omics data. Brief Bioinform 24(5):bbad301
Seo H, Tkachuk D, Ho C, Mammoliti A, Rezaie A, Madani Tonekaboni SA, Haibe-Kains B (2020) SYNERGxDB: an integrative pharmacogenomic portal to identify synergistic drug combinations for precision oncology. Nucleic Acids Res 48(W1):W494–W501
Li YH, Yu CY, Li XX, Zhang P, Tang J, Yang Q, Fu T, Zhang X, Cui X, Tu G (2018) Therapeutic target database update 2018: enriched resource for facilitating bench-to-clinic research of targeted therapeutics. Nucleic Acids Res 46(D1):D1121–D1127
Hussain A, Afzal O, Yasmin S, Haider N, Altamimi ASA, Martinez F, Acree WE Jr, Ramzan M (2023) Preferential solvation study of rosuvastatin in the {PEG400 (1)+ Water (2)} cosolvent mixture and GastroPlus software-based in vivo predictions. ACS Omega 8(14):12761–12772
Hu Q, Wang Z, Meng J, Li W, Guo J, Mu Y, Wang S, Zheng L, Wei Y (2024) OpenDock: a PyTorch-based open-source framework for protein–ligand docking and modelling. Bioinformatics 40(11):btae628
Eberhard P, Kern M, Aichem M, Borlinghaus H, Klein K, Delp J, Suciu I, Moser B, Dietrich D, Leist M (2024) PathwayNexus: a tool for interactive metabolic data analysis. Bioinformatics 40(6):btae310
Ullah M, Akbar S, Raza A, Zou Q (2024) DeepAVP-TPPred: identification of antiviral peptides using transformed image-based localized descriptors and binary tree growth algorithm. Bioinformatics 40(5):btae305
Zhu Y, Zhao L, Wen N, Wang J, Wang C (2023) DataDTA: a multi-feature and dual-interaction aggregation framework for drug–target binding affinity prediction. Bioinformatics 39(9):btad560
Zhao H, Zhang X, Zhao Q, Li Y, Wang J (2023) MSDRP: a deep learning model based on multisource data for predicting drug response. Bioinformatics 39(9):btad514
Amar D, Vizel A, Levy C, Shamir R (2018) ADEPTUS: a discovery tool for disease prediction, enrichment and network analysis based on profiles from many diseases. Bioinformatics 34(11):1959–1961
Kanehisa M, Sato Y (2020) KEGG Mapper for inferring cellular functions from protein sequences. Protein Sci 29(1):28–35
Sunseri J, Koes DR (2016) Pharmit: interactive exploration of chemical space. Nucleic Acids Res 44(W1):W442–W448
Myung Y, de Sá AG, Ascher DB (2024) Deep-PK: deep learning for small molecule pharmacokinetic and toxicity prediction. Nucleic Acids Res 52(W1):W469–W475
Li X, Zhan M, Fang J, Liu G, Tang Y, Li W (2025) MMPK: a multimodal deep learning framework to predict human oral pharmacokinetic parameters. J Med Chem 68(15):16678–16690
Minnich AJ, McLoughlin K, Tse M, Deng J, Weber A, Murad N, Madej BD, Ramsundar B, Rush T, Calad-Thomson S (2020) AMPL: a data-driven modeling pipeline for drug discovery. J Chem Inf Model 60(4):1955–1968
Perazzolo S (2024) SAAM II: a general mathematical modeling rapid prototyping environment. CPT Pharmacomet Syst Pharmacol 13(7):1088–1102
Xiong G, Wu Z, Yi J, Fu L, Yang Z, Hsieh C, Yin M, Zeng X, Wu C, Lu A, Chen X, Hou T, Cao D (2021) ADMETlab 2.0: an integrated online platform for accurate and comprehensive predictions of ADMET properties. Nucleic Acids Res 49(W1):W5–W14. https://doi.org/10.1093/nar/gkab255
Cihan Sorkun M, Mullaj D, Koelman JVA, Er S (2022) ChemPlot, a Python library for chemical space visualization. Chem-Methods 2(7):e202200005
Gautam V, Gupta R, Gupta D, Ruhela A, Mittal A, Mohanty SK, Arora S, Gupta R, Saini C, Sengupta D (2022) deepGraphh: AI-driven web service for graph-based quantitative structure–activity relationship analysis. Brief Bioinform 23(5):bbac288
Kadan A, Ryczko K, Lloyd E, Roitberg A, Yamazaki T Guided Multi-objective Generative AI for Structure-based Drug Design. Chem Sci 16(29):13196-210. https://doi.org/10.1039/D5SC01778E
Correia J, Capela J, Rocha M (2024) Deepmol: an automated machine and deep learning framework for computational chemistry. J Cheminform 16(1):136
Brownless A-LR, Rheaume E, Kuo KM, Kamerlin SC, Gumbart JC (2025) Using machine learning to analyze molecular dynamics simulations of biomolecules. J Phys Chem B. https://doi.org/10.1021/acs.jpcb.4c08824
Sodaei Z, Ekrami S, Hashemianzadeh SM (2025) Machine learning analysis of molecular dynamics properties influencing drug solubility. Sci Rep 15(1):26955. https://doi.org/10.1038/s41598-025-11392-1
Shah P, Kendall F, Khozin S, Goosen R, Hu J, Laramie J, Ringel M, Schork N (2019) Artificial intelligence and machine learning in clinical development: a translational perspective. NPJ Digit Med 2(1):69
Niazi SK (2023) The coming of age of AI/ML in drug discovery, development, clinical testing, and manufacturing: the FDA perspectives. Drug Des Devel Ther. https://doi.org/10.2147/dddt.s424991
Qureshi R, Irfan M, Gondal TM, Khan S, Wu J, Hadi MU, Heymach J, Le X, Yan H, Alam T (2023) AI in drug discovery and its clinical relevance. Heliyon 9(7):e17575
Kantor K, Morzy M (2024) Machine learning and natural language processing in clinical trial eligibility criteria parsing: a scoping review. Drug Discov Today. https://doi.org/10.1016/j.drudis.2024.104139
Abraham J, Heimberger AB, Marshall J, Heath E, Drabick J, Helmstetter A, Xiu J, Magee D, Stafford P, Nabhan C (2021) Machine learning analysis using 77,044 genomic and transcriptomic profiles to accurately predict tumor type. Transl Oncol 14(3):101016
Qiu X, Gao J, Yang J, Hu J, Hu W, Kong L, Lu JJ (2020) A comparison study of machine learning (random survival forest) and classic statistic (Cox proportional hazards) for predicting progression in high-grade glioma after proton and carbon ion radiotherapy. Front Oncol 10:551420
Lin R, Lee JJ (2020) Novel bayesian adaptive designs and their applications in cancer clinical trials. Computational and Methodological Statistics and Biostatistics: Contemporary Essays in Advancement. Springer International Publishing, Cham, pp 395–426
Ekundayo F (2024) Reinforcement learning in treatment pathway optimization: a case study in oncology. Int J Sci Res Arch 13(02):2187–2205. https://doi.org/10.30574/ijsra.2024.13.2.2450
Salo I, Nordlund L, Eklund L, Ho J, Soini M, Kumar D, Yeong J, Guan F, Metsälä E (2024) Advancements and applications of AI technologies in pathology: a scoping review. Comput Methods Biomech Biomed Eng Imaging Vis 12(1):2396595
Cè M, Caloro E, Pellegrino ME, Basile M, Sorce A, Fazzini D, Oliva G, Cellina M (2022) Artificial intelligence in breast cancer imaging: risk stratification, lesion detection and classification, treatment planning and prognosis—a narrative review. Explor Target Anti-tumor Ther 3(6):795
Favalli EG, Biggioggero M, Maioli G, Caporali R (2020) Baricitinib for COVID-19: a suitable treatment? Lancet Infect Dis 20(9):1012–1013. https://doi.org/10.1016/S1473-3099(20)30262-0
Gangwal A, Lavecchia A (2024) AI-driven drug discovery for rare diseases. J Chem Inf Model. https://doi.org/10.1021/acs.jcim.4c01966
Yingngam B, Sethabouppha B (2024) Advanced AI applications for drug discovery. In: Advances in Computational Intelligence for the Healthcare Industry 4.0. IGI Global, pp 42–86
Walsh CG, Wilimitis D, Chen Q, Wright A, Kolli J, Robinson K, Ripperger MA, Johnson KB, Carrell D, Desai RJ (2024) Scalable incident detection via natural language processing and probabilistic language models. Sci Rep 14(1):23429
Zhao Y, Yu Y, Wang H, Li Y, Deng Y, Jiang G, Luo Y (2022) Machine learning in causal inference: application in pharmacovigilance. Drug Saf 45(5):459–476
Kim J, Neumann L, Paul P, Day ME, Aratow M, Bell DS, Doctor JN, Hinske LC, Jiang X, Kim KK (2021) Privacy-protecting, reliable response data discovery using COVID-19 patient observations. J Am Med Inform Assoc 28(8):1765–1776
Ivanenkov YA, Polykovskiy D, Bezrukov D, Zagribelnyy B, Aladinskiy V, Kamya P, Aliper A, Ren F, Zhavoronkov A (2023) Chemistry42: an AI-driven platform for molecular design and optimization. J Chem Inf Model 63(3):695–701
Kamya P, Ozerov IV, Pun FW, Tretina K, Fokina T, Chen S, Naumov V, Long X, Lin S, Korzinkin M (2024) PandaOmics: an AI-driven platform for therapeutic target and biomarker discovery. J Chem Inf Model 64(10):3961–3969
Tran NH, Zhang X, Li M (2018) Deep omics. Proteomics 18(2):1700319. https://doi.org/10.1002/pmic.201700319
Laubenbacher R, Mehrad B, Shmulevich I, Trayanova N (2024) Digital twins in medicine. Nat Comput Sci 4(3):184–191
DiMasi JA, Grabowski HG, Hansen RW (2016) Innovation in the pharmaceutical industry: new estimates of R&D costs. J Health Econ 47:20–33
Hay M, Thomas DW, Craighead JL, Economides C, Rosenthal J (2014) Clinical development success rates for investigational drugs. Nat Biotechnol 32(1):40–51
Gupta S, Kumar V, Gupta P (2025) Role of artificial intelligence in health sector. In: Challenges in Information, Communication and Computing Technology. CRC Press, pp 690–695
Jiménez-Luna J, Grisoni F, Schneider G (2020) Drug discovery with explainable artificial intelligence. Nature Machine Intelligence 2(10):573–584
Mullowney MW, Duncan KR, Elsayed SS, Garg N, van der Hooft JJ, Martin NI, Meijer D, Terlouw BR, Biermann F, Blin K (2023) Artificial intelligence for natural product drug discovery. Nat Rev Drug Discovery 22(11):895–916
Hanna MG, Pantanowitz L, Dash R, Harrison JH, Deebajah M, Pantanowitz J, Rashidi HH (2025) Future of artificial intelligence -machine learning trends in pathology and medicine. Modern Pathol 38:100705. https://doi.org/10.1016/j.modpat.2025.100705
Ahire Y, Patil J, Chordiya H, Deore R, Bairagi V (2024) Advanced applications of artificial intelligence in pharmacovigilance: current trends and future perspectives. J Pharm Res 23(1):23–33
Yang Y, Cheng F (2025) Artificial intelligence streamlines scientific discovery of drug–target interactions. Br J Pharmacol. https://doi.org/10.1111/bph.17427
Al Hagbani T, Alamoudi JA, Bajaber MA, Alsayed HI, Al-Fanhrawi HJ (2025) Theoretical investigations on analysis and optimization of freeze drying of pharmaceutical powder using machine learning modeling of temperature distribution. Sci Rep 15(1):948
Ballester PJ, Mitchell JB (2010) A machine learning approach to predicting protein–ligand binding affinity with applications to molecular docking. Bioinformatics 26(9):1169–1175
Ramaprasad ASE, Smith MT, McCoy D, Hubbard AE, La Merrill MA, Durkin KA (2022) Predicting the binding of small molecules to nuclear receptors using machine learning. Brief Bioinform 23(3):bbac114
Lavecchia A (2019) Deep learning in drug discovery: opportunities, challenges and future prospects. Drug Discov Today 24(10):2017–2032. https://doi.org/10.1016/j.drudis.2019.07.006
McNicholas BA, Madden MG, Laffey JG (2025) Natural language processing in critical care: opportunities, challenges, and future directions. Intensive Care Med. https://doi.org/10.1007/s00134-024-07776-y
Kumar V, Iqbal MI, Rathore R (2025) Natural Language Processing (NLP) in Disease Detection—A Discussion of How NLP Techniques Can Be Used to Analyze and Classify Medical Text Data for Disease Diagnosis. AI in Disease Detection: Advancements and Applications:53–75
Esteva A, Chou K, Yeung S, Naik N, Madani A, Mottaghi A, Liu Y, Topol E, Dean J, Socher R (2021) Deep learning-enabled medical computer vision. NPJ digital medicine 4(1):5. https://doi.org/10.1038/s41746-020-00376-2
Sharma R, Kumar N, Sharma V (2025) Computer Vision for Disease Detection—An Overview of How Computer Vision Techniques Can Be Used to Detect Diseases in Medical Images, Such as X‐Rays and MRIs. AI in Disease Detection: Advancements and Applications: 77–98. https://doi.org/10.1002/9781394278695.ch4
Yingngam B (2024) Machine Learning Applications for Drug Repurposing. Artificial Intelligence and Machine Learning in Drug Design and Development: 251–294. https://doi.org/10.1002/9781394234196.ch9
Guo K, Wu M, Soo Z, Yang Y, Zhang Y, Zhang Q, Lin H, Grosser M, Venter D, Zhang G (2023) Artificial intelligence-driven biomedical genomics. Knowl-Based Syst 279:110937. https://doi.org/10.1016/j.knosys.2023.110937
Thieme AH, Zheng Y, Machiraju G, Sadee C, Mittermaier M, Gertler M, Salinas JL, Srinivasan K, Gyawali P, Carrillo-Perez F (2023) A deep-learning algorithm to classify skin lesions from mpox virus infection. Nat Med 29(3):738–747. https://doi.org/10.1038/s41591-023-02225-7
Rahman MH, Biswas P, Dey D, Hannan MA, Sahabuddin M, Araf Y, Kwon Y, Emran TB, Ali MS, Uddin MJ (2022) An in-silico identification of potential flavonoids against kidney fibrosis targeting TGFβR-1. Life (Basel) 12(11):1764'. https://doi.org/10.3390/life12111764
Lavecchia A (2025) Transform Drug Discovery and Development With Generative Artificial Intelligence. Generative Artificial Intelligence for Biomedical and Smart Health Informatics:489–537. https://doi.org/10.1002/9781394280735.ch25
Smalley E (2017) AI-powered drug discovery captures pharma interest. Nat Biotechnol 35: 604–605. https://doi.org/10.1038/nbt0717-604
Lv Q, Zhou F, Liu X, Zhi L (2023) Artificial intelligence in small molecule drug discovery from 2018 to 2023: does it really work? Bioorg Chem 141:106894. https://doi.org/10.1016/j.bioorg.2023.106894
Aundhia C, Parmar G, Talele C, Shah N, Talele D (2025) Impact of artificial intelligence on drug development and delivery. Curr Top Med Chem. https://doi.org/10.2174/0115680266324522240725053634
Schuhmacher A, Haefner N, Honsberg K, Goldhahn J, Gassmann O (2023) The dominant logic of Big Tech in healthcare and pharma. Drug Discov Today 28(2):103457. https://doi.org/10.1016/j.drudis.2022.103457
Saini JPS, Thakur A, Yadav D (2025) AI driven innovations in pharmaceuticals: optimizing drug discovery and industry operations. RSC Pharmaceutics. https://doi.org/10.1039/d4pm00323c
Mishra SK, Jeba Praba J, Georrge JJ (2024) An emerging trends of bioinformatics and big data analytics in healthcare. Walter de Gruyter GmbH & Co KG, Berlin, Germany. https://doi.org/10.1515/9783111398549-007
Vora LK, Gholap AD, Jetha K, Thakur RRS, Solanki HK, Chavda VP (2023) Artificial intelligence in pharmaceutical technology and drug delivery design. Pharmaceutics 15(7):1916. https://doi.org/10.3390/pharmaceutics15071916
Sankar R, Alam MN (2024) Artificial intelligence in drug development and delivery: opportunities, challenges, and future directions. Journal of Angiotherapy 8(8):1–10. https://doi.org/10.25163/angiotherapy.880326
Tiwari PC, Pal R, Chaudhary MJ, Nath R (2023) Artificial intelligence revolutionizing drug development: exploring opportunities and challenges. Drug Dev Res 84(8):1652–1663. https://doi.org/10.1002/ddr.22115
Chikhale RV, Choudhary R, Eldesoky GE, Kolpe MS, Shinde O, Hossain D (2025) Generative AI, molecular docking and molecular dynamics simulations assisted identification of novel transcriptional repressor EthR inhibitors to target Mycobacterium tuberculosis. Heliyon. https://doi.org/10.1016/j.heliyon.2025.e42593
Beckers M, Sturm N, Sirockin F, Fechner N, Stiefl N (2023) Prediction of small-molecule developability using large-scale in silico ADMET models. J Med Chem 66(20):14047–14060. https://doi.org/10.1021/acs.jmedchem.3c01083
Zhang Y, Wang Y, Zhou W, Fan Y, Zhao J, Zhu L, Lu S, Lu T, Chen Y, Liu H (2019) A combined drug discovery strategy based on machine learning and molecular docking. Chem Biol Drug Des 93(5):685–699. https://doi.org/10.1111/cbdd.13494
Choi J, Yun JS, Song H, Shin YK, Kang YH, Munashingha PR, Yoon J, Kim NH, Kim HS, Yook JI, Tark D, Lim YS, Hwang SB (2021) Prediction of African Swine Fever Virus inhibitors by molecular docking-driven machine learning models. Molecules. https://doi.org/10.3390/molecules26123592
Narendra G, Raju B, Verma H, Sapra B, Silakari O (2021) Multiple machine learning models combined with virtual screening and molecular docking to identify selective human ALDH1A1 inhibitors. J Mol Graph Model 107:107950. https://doi.org/10.1016/j.jmgm.2021.107950
Tang B, He F, Liu D, He F, Wu T, Fang M, Niu Z, Wu Z, Xu D (2022) AI-Aided design of novel targeted covalent inhibitors against SARS-CoV-2. Biomolecules. https://doi.org/10.3390/biom12060746
Nguyen TK, Le Nguyen TN, Nguyen K, Nguyen HVT, Tran LTT, Ngo TXT, Pham PTV, Tran MH (2022) Machine learning-based screening of MCF-7 human breast cancer cells and molecular docking analysis of essential oils from Ocimum basilicum against breast cancer. J Mol Struct 1268:133627. https://doi.org/10.1016/j.molstruc.2022.133627
Kashyap K, Mahapatra PP, Ahmed S, Buyukbingol E, Siddiqi MI (2023) Identification of potential aldose reductase inhibitors using convolutional neural network-based in silico screening. J Chem Inf Model 63(20):6261–6282. https://doi.org/10.1021/acs.jcim.3c00547
Patel CN, Mall R, Bensmail H (2023) AI-driven drug repurposing and binding pose meta dynamics identifies novel targets for Monkeypox virus. J Infect Public Health 16(5):799–807. https://doi.org/10.1016/j.jiph.2023.03.007
Garg I, Nagarajan K, Bhardwaj S, Tripathi AK, Jain A, Saxena V (2025) Docking and AI-based classification cum recurrent neural network validation for predicting anti-tubercular leads against dprE-1 enzyme targeter. Curr Proteomics 22(2):100013. https://doi.org/10.1016/j.curpro.2025.100013
Suliman M, Laraib A, Bibi S, Mohamed SMM, Alshahrani MY, Muhammad S (2025) Exploring AI-generated pyrazalone derivatives as antifungal agents: bringing together molecular docking and quantum chemical approaches. Comput Biol Chem 118:108502. https://doi.org/10.1016/j.compbiolchem.2025.108502
Ali N, Akbar R, Saleem A, Ali A, Ali A (2025) AI based natural inhibitor targeting RPS20 for colorectal cancer treatment using integrated computational approaches. Sci Rep 15(1):24906. https://doi.org/10.1038/s41598-025-07574-6
Vaidya H, Gautam S, Kumar M (2025) Anti-EBV: Artificial intelligence driven predictive modeling for repurposing drugs as potential antivirals against Epstein-Barr virus. Comput Struct Biotechnol J. https://doi.org/10.1016/j.csbj.2025.04.042
Harren T, Matter H, Hessler G, Rarey M, Grebner C (2022) Interpretation of structure-activity relationships in real-world drug design data sets using explainable artificial intelligence. J Chem Inf Model 62(3):447–462. https://doi.org/10.1021/acs.jcim.1c01263
Sitaula C, Shahi TB (2022) Monkeypox virus detection using pre-trained deep learning-based approaches. J Med Syst 46(11):78. https://doi.org/10.1007/s10916-022-01868-2
Srisongkram T, Weerapreeyakul N (2022) Drug repurposing against KRAS mutant G12C: A machine learning, molecular docking, and molecular dynamics study. Int J Mol Sci. https://doi.org/10.3390/ijms24010669
Kamdar J, Jeba Praba J, Georrge JJ (2020) Artificial intelligence in medical diagnosis: methods, algorithms and applications. Machine Learning with Health Care Perspective: Machine Learning and Healthcare: 27–37. https://doi.org/10.1007/978-3-030-40850-3_2
Albani FG, Alghamdi SS, Almutairi MM, Alqahtani T (2025) Artificial intelligence-driven innovations in oncology drug discovery: Transforming traditional pipelines and enhancing drug design. Drug Des Devel Ther. https://doi.org/10.2147/dddt.s509769
Das U (2025) Transforming precision medicine through generative AI: advanced architectures and tailored therapeutic design for patient-specific drug discovery. ChemistrySelect 10(36):e02448. https://doi.org/10.1002/slct.202502448
Ali H (2023) Quantum computing and AI in healthcare: accelerating complex biological simulations, genomic data processing, and drug discovery innovations. World J Adv Res Rev 20(2):1466–1484. https://doi.org/10.30574/wjarr.2023.20.2.2325
Ferreira FJN, Carneiro AS (2025) AI-driven drug discovery: a comprehensive review. ACS Omega 10(23):23889–23903. https://doi.org/10.1021/acsomega.5c00549
Alizadehsani R, Oyelere SS, Hussain S, Jagatheesaperumal SK, Calixto RR, Rahouti M, Roshanzamir M, De Albuquerque VHC (2024) Explainable artificial intelligence for drug discovery and development: a comprehensive survey. IEEE Access 12:35796–35812
Hassija V, Chamola V, Mahapatra A, Singal A, Goel D, Huang K, Scardapane S, Spinelli I, Mahmud M, Hussain A (2024) Interpreting black-box models: a review on explainable artificial intelligence. Cogn Comput 16(1):45–74. https://doi.org/10.1007/s12559-023-10179-8
Mittal R, Ho A, Adivikolanu H, Sawhney M, Lemos J, Mittal M, Hirani K (2025) Exploring the potential of computer simulation models in drug testing and biomedical research: a systematic review. Front Pharmacol 16:1644907. https://doi.org/10.3389/fphar.2025.1644907
Lever J, Cheng S, Casas CQ, Liu C, Fan H, Platt R, Rakotoharisoa A, Johnson E, Li S, Shang Z (2025) Facing & mitigating common challenges when working with real-world data: the data learning paradigm. J Comput Sci 85:102523. https://doi.org/10.1016/j.jocs.2024.102523
Acknowledgements
SKM acknowledges the Department of Bioinformatics, University of North Bengal, for the facility support to complete this work.
Funding
None.
Author information
Authors and Affiliations
Contributions
SKM contributed to conceptualization and writing original draft; JPJ, HM, and AS were involved in writing original draft and revision; and JJG contributed to editing, revision, and supervision.
Corresponding author
Ethics declarations
AI Disclosure
During the preparation of this manuscript, the author(s) used ChatGPT for only language editing and grammar correction to improve readability. After using this the author(s) reviewed and edited the content as needed and take(s) full responsibility for the content of the publication.
Ethical approval and consent to participate
Not applicable.
Consent for publication
The authors declare no conflict of interest.
Competing interests
The authors declare that they have no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Mishra, S.K., J, J.P., Mamoudou, H. et al. Navigation of drug discovery via artificial intelligence. Futur J Pharm Sci 12, 27 (2026). https://doi.org/10.1186/s43094-026-00954-3
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1186/s43094-026-00954-3







