Published on in Vol 7 (2026)

Preprints (earlier versions) of this paper are available at https://preprints.jmir.org/preprint/80539, first published .
Systematic Mining of Bioactive Compounds for Wound Healing From Cayratia Japonica Exosome-Like Nanovesicles: A Workflow Combining LC-MS and DeepSeek Models

Systematic Mining of Bioactive Compounds for Wound Healing From Cayratia Japonica Exosome-Like Nanovesicles: A Workflow Combining LC-MS and DeepSeek Models

Systematic Mining of Bioactive Compounds for Wound Healing From Cayratia Japonica Exosome-Like Nanovesicles: A Workflow Combining LC-MS and DeepSeek Models

1School of Basic Medical Sciences, Jinggangshan University, Ji'an, China

2Jiangxi Province Key Laboratory of Organ Development and Epigenetics, Clinical Medical Research Center, Affiliated Hospital of Jinggangshan University, College of Jinggangshan University, 28 Xueyuan Road, Qingyuan District, Ji'an, China

3University of Montpellier, Montpellier, France

4Department of Epidemiology & Biostatistics, School of Public Health, Southeast University, Nanjing, China

5Division of Molecular Signaling, Department of the Advanced Biomedical Research, Interdisciplinary Graduate School of Medicine, University of Yamanashi, Chuo, Japan

6College of Traditional Chinese Medicine and Pharmacy, Jinggangshan University, Ji'an, China

Corresponding Author:

Qiao-Jing Yan, PhD


Background: Plant-derived exosome-like nanovesicles (P-ELNs) effectively deliver bioactive compounds due to their high biocompatibility and low immunogenicity. While liquid chromatography-mass spectrometry (LC-MS) profiles compounds in complex samples, its analysis of large datasets remains limited by traditional methods. Recent advances in large language models (LLMs) and domain-specific systems have enhanced Chinese biomedical data processing and cross-modal pharmaceutical research.

Objective: This study aimed to create a multimodal framework of LC-MS combined with DeepSeek models for data mining of compounds with wound-healing properties from exosome-like nanovesicles derived from Cayratia japonica (CJ-ELNs).

Methods: LC-MS identified compounds enriched in CJ (n=3) and CJ-ELNs (n=3), and then compounds specifically enriched in CJ-ELNs were filtered via a four-step filtering workflow. The CJ-ELNs-specific compounds were processed by DeepSeek models for screening naturally active compounds with targeted functions of antioxidation, anti-inflammation, anticellular damage, antiapoptosis, wound healing and tissue regeneration, and cell proliferation.

Results: A multimodal framework of LC-MS combined with the DeepSeek-DF model was created. With the assistance of artificial intelligence (AI), a total of 46 naturally active compounds derived from CJ-ELNs with targeted functions were identified.

Conclusions: A self-designed multimodal framework of LC-MS, combined with DeepSeek models, rapidly and accurately identifies naturally active compounds from CJ-ELNs. This AI-powered system innovatively integrates the traditional analytical technique with modern LLMs, thus greatly favoring data mining of active ingredients in traditional Chinese medicine herbs.

JMIR Bioinform Biotech 2026;7:e80539

doi:10.2196/80539

Keywords



Plant-derived exosome-like nanovesicles (P-ELNs) contain abundant bioactive molecules, serving as novel carriers of natural products to mediate intercellular communication and mediate physiological processes [1,2]. P-ELNs are superior to conventional mammalian-derived exosomes, possessing unique advantages such as high biocompatibility, high skin permeability, low cytotoxicity and low immunogenicity [3,4]. Multiple in vitro and in vivo studies indicate that these P-ELNs possess intrinsic therapeutic activity, offering promise for disease treatment and enhancing human health [5,6]. Cayratia japonica, a traditional Chinese medicinal herb, is widely used for the treatment of traumatic injuries such as contusions and lacerations [7]. Recent clinical studies have confirmed that topical application of CJ ointment effectively alleviates local inflammation and promotes the repair and regeneration of damaged tissue, demonstrating favorable therapeutic outcomes in the management of postoperative infectious wounds around the anus [8]. However, research and application of exosome-like nanovesicles (ELNs) derived from CJ remain incomplete. Our research team successfully extracted and characterized a novel type of P-ELNs from the traditional Chinese medicinal herb Cayratia japonica, namely Cayratia japonica exosome-like nanovesicles (CJ-ELNs). They possess efficient delivery of bioactive compounds to wound sites, thus favoring tissue regeneration from infectious wound-related disorders. Bioactive constituents encapsulated within CJ-ELNs are dominant in wound healing. Consequently, the identification and characterization of bioactive compounds responsible for wound healing are of paramount significance.

Great strides have been made in the screening of active ingredients from natural products via omics techniques [9]. Liquid chromatography–mass spectrometry (LC-MS) has emerged as a powerful tool for profiling trace-level compounds in complex samples, although its performance in processing massive data is limited by traditional manual or rule-based analytical approaches [10,11]. In recent years, large-scale pretrained language models (LLMs), such as ChatGPT, GPT-4, and domain-specific systems like DeepSeek, have significantly transformed the landscape of biomedical data analysis and knowledge discovery [11,12]. These models exhibit powerful capabilities in natural language understanding, semantic reasoning, and prompt-based knowledge retrieval [13-15]. They are promising tools to assist omics analysis. In particular, DeepSeek models have been widely adopted for optimizing Chinese-language biomedical contexts, and supporting cross-modal tasks in pharmaceutical research, such as entity recognition, document summarization, and semantic ranking [16,17].

In this study, we innovatively created a multimodal framework of LC-MS combined with DeepSeek models for data mining of compounds with wound-healing properties from CJ-ELNs. This work illustrates the potential of artificial intelligence (AI) as a computational engine in natural compound discovery and offers a scalable solution for mining multimodal biochemical data.


Preprocessing of LC-MS Data

Untargeted metabolomic profiling of CJ and CJ-ELNs was performed by LC-MS. A total of 6 samples (including 3 CJ samples and 3 CJ-ELNs samples) were analyzed using a ultra-high-performance liquid chromatography (UHPLC) system coupled to a Q Exactive HF-X mass spectrometer (Thermo Scientific). Chromatographic separation was performed on an HSS T3 column (maintained at 40°C) with a 12-minute linear gradient from 2% to 98% mobile phase B at a flow rate of 0.3 mL/min. Mass spectrometry (MS) data were acquired in both positive and negative electrospray ionization (ESI) mode (± ESI) using a data-dependent acquisition strategy (top 10 most intense ions). Raw data were first converted to the mzML format using ProteoWizard, followed by processing, using Compound Discoverer 3.3 (Thermo Fisher Scientific) for peak alignment (with maximum retention time shift of 0.5 min and mass tolerance of 10 ppm) and normalization (using the median of maximum peak areas). Compound identification was achieved by matching MS/MS spectra against the following databases: mzCloud, LipidMaps, KEGG, HMDB, and MassBank. The matching criteria were set to a mass tolerance of 10 ppm and a minimum match factor threshold of 10. A four-step filtering workflow was designed to quantitatively identify target compounds as follows (Figure 1).

  1. Filtering of match confidence: compounds with spectral match scores ≥80 were retained [18];
  2. Filtering of unique compounds of CJ-ELNs: compounds identified in CJ and CJ-ELNs were compared with isolated compounds unique to CJ-ELNs;
  3. Filtering of biological relevance: candidate compounds were screened for associations with wound healing-related signaling pathways using the DeepSeek-Bio model;
  4. Semantic recognition and prompt engineering: final candidate molecules were refined through semantic analysis and prompt-based selection.

Common and unique compounds derived from CJ and CJ-ELNs were visualized in a Venn diagram, and a word cloud analysis was conducted via Python. Functions and tools, and databases of key terms used in this study are listed in Table 1.

Figure 1. A four-step filtering workflow. CJ-ELNs: Cayratia japonica exosome-like nanovesicle; LC-MS: liquid chromatography-mass spectrometry.
Table 1. Key terms, functions, tools and databases used in this study.
Key termsFunctionsTools/databases
mzMLStandardized data storageProteoWizard
DeepSeek-BioBiological pathway association analysisDeepseek 671B Model Network Edition
KEGG database
MorganDigital characterization of molecular structuresChemoinformatics software packages
PubMedBERTLiterature feature extractionPubMed.pro
Grad-CAMVisualization of model decisionsDeep learning frameworks (eg, PyTorch)
ASRautomatic semantic recognitionThe Great Prophecy Model of Human-Computer Interaction

Construction of a Multimodal Framework of LC-MS Combined With DeepSeek Models

A multimodal framework of LC-MS combined with the DeepSeek-DF model was created, consisting of two major components of the input and output layers. The input layer integrated structural features of compounds (Morgan fingerprints), quantitative features (z score normalization), and literature-derived features (PubMedBERT embeddings). The core architecture was listed in Figure 2. Additionally, the output layer used multitask learning to simultaneously predict wound-healing activity via Sigmoid output and mechanism category via Softmax output.

Figure 2. The core architecture of the input layer.

Interpretability-Based Filtering

The Automated Semantic Recognition (ASR) module and prompt engineering techniques of DeepSeek-R1 32B, as well as web searching were used to interpret the potential biological functions of the screened candidate compound with an annotation of functional labels. A plausibility assessment was then performed based on predefined criteria, including antioxidation, anti-inflammation, anticellular damage, antiapoptosis, wound healing and tissue regeneration, and cell proliferation. Each compound was evaluated and categorized using the following scoring scheme: √ (confirmed), × (not supported), and ? (uncertain). Taking the metabolite (-)-Epicatechin 3-O-gallate as an example, its function, category and possibility in the involvement of wound healing, tissue regeneration, antioxidant, and anticellular damage were predicted via the multimodal framework (Table 2). Following this preliminary filtering, manual curation was conducted to eliminate compounds of nonplant origin and those with low abundances. Ultimately, a refined set of characteristic natural products from CJ-ELNs with potential wound-healing properties was selected.

Table 2. Functions, categories and possibility in the involvement of biological processes of representative metabolites.
CompoundFunctionsCategoriesPossibility
(-)-Epicatechin 3-O-gallateAntioxidant, anti-inflammatory, anti-cancer, cardiovascular protection, glucose and lipid metabolism regulation.Organic compound, antioxidant factor, anti-inflammatory factor, energy metabolism, phenolic factorWound healing: ×, tissue regeneration: ×, antioxidant: √, anti-cellular damage: ?
RutinAntioxidant and anti-inflammatory, maintaining vascular resilience, reducing vascular permeability and fragility, exhibiting certain antiviral and anticancer effects.Flavonoids, antioxidant, anti-inflammatoryWound healing: ×, tissue regeneration: ×, antioxidant: √, anti-cellular damage: ?
CaffeineCentral nervous system stimulants, enhance mental alertness, alleviate fatigue.Organic compounds, alkaloids, energy metabolismWound healing: ×, tissue regeneration: ×, antioxidant: ×, anti-cellular damage: ×

Acidic Compounds Are Enriched in CJ-ELNs

After conversion and normalization of the raw LC-MS data, a total of 829 and 2212 compounds were identified from CJ and CJ-ELNs. A Venn diagram visualized 1881 specific compounds in CJ-ELNs (Figure 3). “Acid,” as the most frequent term across all entries of metabolite names, was detected by a word cloud analysis (Multimedia Appendix 1). It suggested that acidic compounds were highly enriched in CJ-ELNs.

Figure 3. Enrichment of acidic compounds in CJ-ELNs. (A) A Venn diagram visualizing an intersection of compounds identified from both CJ and CJ-ELNs and unique compounds in CJ-ELNs. CJ: Cayratia japonica; CJ-ELNs: Cayratia japonica exosome-like nanovesicle.

Rapid and Accurate Data Mining of Compounds in CJ-ELNs With Functional Properties

A total of 1881 candidate compounds enriched in CJ-ELNs were functionally annotated and classified using the self-designed multimodal framework of LC-MS combined with DeepSeek models. They were categorized into 20 distinct classes, including organic compounds, alkaloids, amino acids, biomolecules, organic acids, antioxidants, anti-inflammatory agents, energy metabolism-related molecules, phenolics, cytoprotective agents, alcohols, and others. Organic compounds were the leading category of compounds enriched in CJ-ELNs (Figure 4, Multimedia Appendix 2). Functionally, 43.33% (n=39) of compounds enriched in CJ-ELNs possessed the antioxidant property. With the assistance of DeepSeek, we specifically screened compounds enriched in CJ-ELNs with targeted functions of antioxidation, anti-inflammation, anticellular damage, antiapoptosis, wound healing and tissue regeneration, and cell proliferation.

Figure 4. Rapid and accurate data mining of compounds in CJ-ELNs with functional properties. Top 20 classifications of compounds enriched in CJ-ELNs. CJ-ELN: Cayratia japonica exosome-like nanovesicle.

Bioactive Compounds of CJ-ELNs Responsible for Wound Healing and Tissue Regeneration

We estimated the overall expression levels of compounds across the six target functions derived from the DeepSeek model within this multimodal framework, visualizing the results in radar chart format after log2-transformation. (Figure 5). Notably, compounds with the antioxidant function possessed the highest expression levels, proving the antioxidant mechanism of CJ-ELNs in wound repair. Finally, a secondary filtering of compounds with targeted functions was conducted. We manually excluded nonplant–derived compounds, including those of animal origin, synthetic chemicals, and other nonbotanical sources. In addition, compounds with low expression levels in CJ-ELNs were also removed. As a result, a total of 46 naturally active compounds derived from CJ-ELNs with targeted functions were identified (Figure 6 and Multimedia Appendix 3). Citric acid was the most abundant compound with the targeted functions, which was consistent with the finding from the word cloud analysis.

Figure 5. Radar plots visualizing bioactive compounds of Cayratia japonica exosome-like nanovesicles with targeted functions.
Figure 6. Expression levels (log2-transformed) of naturally active compounds derived from Cayratia japonica exosome-like nanovesicle identified by an integration of liquid chromatography-mass spectrometry and DeepSeek models.

Principal Findings

This study innovatively integrated DeepSeek models with LC-MS to successfully predict the major natural products of CJ-ELNs responsible for wound healing. DeepSeek’s ASR semantic recognition and prompt engineering worked together to generate initial classification labels. Moreover, an automatic assessment effectively, rapidly, and accurately achieved the goal of data mining of specific compounds for targeted functions.

AI techniques, particularly LLMs, have become an unstoppable force for reshaping medical research [19,20]. Traditionally, LC-MS is a powerful analytical technique to identify and quantify active ingredients in traditional Chinese medicine (TCM) herbs. However, a rapid and accurate recognition of compounds with targeted functions, and a quantitative analysis of trace concentrations in complicated samples can be challenging [21]. We expected that an integration of LC-MS and LLMs would benefit TCM research, including the acceleration of active ingredient screen, precise targeting of interested compounds for certain diseases, and anchoring the promising candidates for developing new drugs. DeepSeek is an intelligent system based on a large-scale pre-trained language model, exhibiting strong capabilities in text understanding, knowledge reasoning, and cross-modal collaborative analysis, particularly excelling in processing information within Chinese-language contexts [22,23]. It enables rapid processing and analyzing massive volumes of both unstructured and structured data, thus digging biological insights out of complex omics datasets [24,25].

In the present study, we first created a four-step filtering workflow and quantitatively identified target compounds from CJ-ELNs by LC-MS. The cloud word analysis emphasized the term of acid among screened compounds enriched in CJ-ELNs. Acidic compounds derived from traditional Chinese herbals are established for the role of clearing heat and detoxifying [26]. Numerous studies have reported that acidic compounds in plants exert antioxidant, antibacterial, and anti-inflammatory effects through mechanisms such as scavenging free radicals, alleviating oxidative stress, modulating inflammatory factors, stimulating fibroblast proliferation, promoting collagen deposition, enhancing epithelialization, and inducing angiogenesis [27,28]. To achieve a precise data mining of compounds with relevant functions, DeepSeek models lent a hand that specifically screened compounds in CJ-ELNs with targeted functions of antioxidation, anti-inflammation, anticellular damage, antiapoptosis, wound healing and tissue regeneration, and cell proliferation. Finally, naturally active compounds in CJ-ELNs were resurfaced for their promising potentials in wound repair. For example, studies have shown that baicalin accelerates the wound healing process by downregulating the expression of pro-inflammatory cytokines (IL-6 and IL-1β) while upregulating the anti-inflammatory factor IL-10, and by promoting the secretion of various growth factors (VEGF, FGF-2, PDGF-β, and CTGF) [29]. The combination of LC-MS with DeepSeek paves the way to further analyses of therapeutic targets from traditional Chinese herbs for wound healing and tissue regeneration [30,31].

Limitations in this study should be noted. Firstly, bioactive compounds derived from CJ-ELNs were mined via LC-MS and a single LLM, namely, DeepSeek-R1. Other cutting-edge LLMs such as Claude, GPT-4 and Liama [32] can be further analyzed for the assistance of LC-MS in identifying interested compounds. Secondly, the 46 naturally active compounds derived from CJ-ELNs with targeted functions should be validated in in vivo and in vitro experiments. Lastly, the workflow we have established requires further validation on independent datasets. We shall address the aforementioned issues in subsequent work, including evaluating the efficacy of compounds through cell migration and transdermal tissue compatibility assays, verifying their efficacy via macroscopic imaging and H&E staining following animal wound modelling interventions, and validating potential pathways involved through Western blot and immunohistochemical analysis.

Conclusion

We innovatively designed a multimodal framework of LC-MS combined with DeepSeek models that rapidly and accurately identify naturally active compounds from CJ-ELNs. This AI-powered system innovatively integrates the traditional analytical technique with modern large language models, showing a huge potential in modern medicine and TCM research.

Acknowledgments

We would like to thank the National Center for Regional Technology Transfer and Commercialization in Biomedical Sciences for providing technical testing support and EVLIXIR for providing Cayratia japonica exosome-like nanovesicles (CJ-ELNs) samples. We are also truly grateful to Prof. Elvis Agbo for his comprehensive guidance and help in revising and polishing the manuscript.

Funding

This study was supported by the National Natural Science Foundation of China (Grant No. 32460913), the Natural Science Foundation Project of Jiangxi Province (Grant No. 20232BAB205009), the Science and Technology Foundation of the Education Department of Jiangxi Province (Grant No. GJJ2201603), and the National Foreign Expert Projects (Y20240165).

Data Availability

The original data used for the current study are available upon reasonable request from the corresponding authors.

Authors' Contributions

Conceptualization: MXS, QF, QJY

Data curation: YPF, WJ

Formal analysis: YPF, WJ

Funding acquisition: MXS, QF, QJY

Investigation: WJ, YPF

Methodology: WJ, YPF

Project administration: MXS, QHY

Resources: MXS, QJY

Supervision: MXS, QF, QJY

Writing-original draft: WJ, YPF

Writing-review & editing: JX, JY, MXS, QF, QJY

Conflicts of Interest

None declared.

Multimedia Appendix 1

A word cloud of common compounds identified by liquid chromatography-mass spectrometry.

PNG File, 283 KB

Multimedia Appendix 2

Distribution of the classifications of compounds enriched in CJ-ELNs, distribution of functional compounds enriched in CJ-ELNs with targeted functions of wound healing and tissue regeneration, and distribution of compounds enriched in CJ-ELNs with all functional categories.

TIF File, 1337 KB

Multimedia Appendix 3

Function of 46 compounds.

XLSX File, 17 KB

  1. Subha D, Harshnii K, Madhikiruba KG, Nandhini M, Tamilselvi KS. Plant derived exosome- like nanovesicles: an updated overview. Plant Nano Biology. Feb 2023;3:100022. [CrossRef]
  2. Mu N, Li J, Zeng L, et al. Plant-derived exosome-like nanovesicles: current progress and prospects. Int J Nanomedicine. 2023;18:4987-5009. [CrossRef] [Medline]
  3. Dad HA, Gu TW, Zhu AQ, Huang LQ, Peng LH. Plant exosome-like nanovesicles: emerging therapeutics and drug delivery nanoplatforms. Mol Ther. Jan 2021;29(1):13-31. [CrossRef]
  4. Di Gioia S, Hossain MN, Conese M. Biological properties and therapeutic effects of plant-derived nanovesicles. Open Med. Nov 21, 2020;15(1):1096-1122. [CrossRef]
  5. Lian MQ, Chng WH, Liang J, et al. Plant-derived extracellular vesicles: recent advancements and current challenges on their use for biomedical applications. J Extracell Vesicles. Dec 2022;11(12):e12283. [CrossRef] [Medline]
  6. Karamanidou T, Tsouknidas A. Plant-derived extracellular vesicles as therapeutic nanocarriers. Int J Mol Sci. Dec 24, 2021;23(1):T-epublish. [CrossRef] [Medline]
  7. Sun J, Zhao P, Ding X, et al. Cayratia japonica prevents ulcerative colitis by promoting M2 macrophage polarization through blocking the TLR4/MAPK/NF-κB pathway. Mediators Inflamm. 2022;2022:1108569. [CrossRef] [Medline]
  8. Zhao X, Dai R, Wang J, et al. Analysis of the permeable and retainable components of Cayratia japonica ointment through intact or broken skin after topical application by UPLC-Q-TOF-MS/MS combined with in vitro transdermal assay. J Pharm Biomed Anal. Jan 20, 2024;238:115853. [CrossRef] [Medline]
  9. Wolfender JL, Litaudon M, Touboul D, Queiroz EF. Innovative omics-based approaches for prioritisation and targeted isolation of natural products - new strategies for drug discovery. Nat Prod Rep. Jun 19, 2019;36(6):855-868. [CrossRef] [Medline]
  10. Gros M, Petrović M, Barceló D. Development of a multi-residue analytical methodology based on liquid chromatography-tandem mass spectrometry (LC-MS/MS) for screening and trace level determination of pharmaceuticals in surface and wastewaters. Talanta. Nov 15, 2006;70(4):678-690. [CrossRef] [Medline]
  11. Gika HG, Wilson ID, Theodoridis GA. LC–MS-based holistic metabolic profiling. Problems, limitations, advantages, and future perspectives. Journal of Chromatography B. Sep 2014;966:1-6. [CrossRef]
  12. Wang B, Xie Q, Pei J, et al. Pre-trained language models in biomedical domain: a systematic survey. ACM Comput Surv. Oct 31, 2023;56:1-52. [CrossRef]
  13. Zhong W, Liu Y, Liu Y, et al. Performance of ChatGPT-4o and four open-source large language models in generating diagnoses based on China’s rare disease catalog: comparative study. J Med Internet Res. 2025;27:e69929-e69929. [CrossRef]
  14. Liverpool S, Mota CP, Sales CMD, et al. Engaging children and young people in digital mental health interventions: systematic review of modes of delivery, facilitators, and barriers. J Med Internet Res. Jun 23, 2020;22(6):e16317. [CrossRef] [Medline]
  15. Choudhury A, Shahsavar Y, Shamszare H. User intent to use Deepseek for health care purposes and their trust in the large language model: Multinational Survey Study. JMIR Hum Factors. May 26, 2025;12:e72867. [CrossRef] [Medline]
  16. Tordjman M, Liu Z, Yuce M, et al. Comparative benchmarking of the DeepSeek large language model on medical tasks and clinical reasoning. Nat Med. Aug 2025;31(8):2550-2555. [CrossRef] [Medline]
  17. W McGee R. Leveraging DeepSeek: an AI-powered exploration of traditional chinese medicine (Tai Chi and Qigong) for medical research. AJBSR. 2025;25(5):645-654. [CrossRef]
  18. Alseekh S, Aharoni A, Brotman Y, et al. Mass spectrometry-based metabolomics: a guide for annotation, quantification and best reporting practices. Nat Methods. Jul 2021;18(7):747-756. [CrossRef] [Medline]
  19. Haleem A, Javaid M, Khan IH. Current status and applications of artificial intelligence (AI) in medical field: an overview. Current Medicine Research and Practice. Nov 2019;9(6):231-237. [CrossRef]
  20. Tang X. The role of artificial intelligence in medical imaging research. BJR Open. 2020;2(1):20190031. [CrossRef] [Medline]
  21. Pang B, Zhu Y, Lu L, Gu F, Chen H. The applications and features of liquid chromatography‐mass spectrometry in the analysis of Traditional Chinese Medicine. Evid Based Complement Alternat Med. Jan 2016;2016(1). [CrossRef]
  22. Du K, Li A, Zuo QH, et al. Comparing artificial intelligence-generated and clinician-created personalized self-management guidance for patients with knee osteoarthritis: blinded observational study. J Med Internet Res. May 7, 2025;27:e67830. [CrossRef] [Medline]
  23. Huang T, et al. TCM-3ceval: a triaxial benchmark for assessing responses from large language models in traditional chinese medicine. arXiv. Preprint posted online on Mar 10, 2025. [CrossRef]
  24. Li F, Chen J, Luo W, et al. DeepPGDB: a novel paradigm for AI-guided interactive plant genomic database. Bioinformatics. Preprint posted online on 2025. [CrossRef]
  25. Luo E, et al. Benchmarking AI scientists in Omics data-driven biological research. arXiv. Preprint posted online on May 13, 2025. [CrossRef]
  26. Muluye RA, Bian Y, Alemu PN. Anti-inflammatory and antimicrobial effects of heat-clearing chinese herbs: a current review. J Tradit Complement Med. Apr 2014;4(2):93-98. [CrossRef]
  27. Guan S, Ge D, Liu TQ, Ma XH, Cui ZF. Protocatechuic acid promotes cell proliferation and reduces basal apoptosis in cultured neural stem cells. Toxicol In Vitro. Mar 2009;23(2):201-208. [CrossRef] [Medline]
  28. Yang D, Moh S, Son D, et al. Gallic acid promotes wound healing in normal and hyperglucidic conditions. Molecules. 2016;21(7):899. [CrossRef]
  29. Kim E, Ham S, Jung BK, Park JW, Kim J, Lee JH. Effect of baicalin on wound healing in a mouse model of pressure ulcers. IJMS. 24(1):329. [CrossRef]
  30. Zhao F, Li Q, Wang M, Xiong X. An AI agent-based system for retrieving compound information in Traditional Chinese Medicine. Information. 2025;16(7):543. [CrossRef]
  31. He J, et al. OpenTCM: a graphrag-empowered LLM-based system for traditional chinese medicine knowledge retrieval and diagnosis. arXiv. Preprint posted online on Apr 28, 2025. [CrossRef]
  32. Jaleel A, Aziz U, Farid G, et al. Evaluating the potential and accuracy of ChatGPT-3.5 and 4.0 in Medical Licensing and In-Training Examinations: systematic review and meta-analysis. JMIR Med Educ. 2025;11:e68070. [CrossRef]


AI: artificial intelligence
ASR: Automated Semantic Recognition
CJ-ELN: Cayratia japonica exosome-like nanovesicle
ELN: exosome-like nanovesicles
ESI: electrospray ionization
LC-MS: liquid chromatography-mass spectrometry
LLM: large language model
MS: mass spectrometry
P-ELN: Plant-derived exosome-like nanovesicle
TCM: traditional Chinese medicine
UHPLC: ultra-high-performance liquid chromatography


Edited by Zongliang Yue; submitted 12.Jul.2025; peer-reviewed by Vasileios Alevizos, Xuejia Kang; final revised version received 14.Oct.2025; accepted 19.Oct.2025; published 08.Jan.2026.

Copyright

© Qiang Fu, Wei Ji, Yu-Ping Fan, Jian Yao, Ming-Xia Song, Qiao-Jing Yan. Originally published in JMIR Bioinformatics and Biotechnology (https://bioinform.jmir.org), 8.Jan.2026.

This is an open-access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Bioinformatics and Biotechnology, is properly cited. The complete bibliographic information, a link to the original publication on https://bioinform.jmir.org/, as well as this copyright and license information must be included.