JMIR Bioinformatics and Biotechnology

Recent Articles

Theme Issue 2025: Artificial Intelligence in Oncology

Unpacking Genomic Biomarkers for Programmed Cell Death Receptor-1 Immunotherapy Success in Non–Small Cell Lung Cancer Using Deep Neural Networks: Quantitative Study

Background: Non-small cell lung cancer (NSCLC) is one of the leading causes of cancer-related mortality worldwide. PD-1 immunotherapy has shown promising results in the treatment of NSCLC; however, not all patients respond effectively to this treatment. Identifying predictive biomarkers for PD-1 therapy response is critical to improving patient outcomes and optimizing treatment strategies. Traditional methods of biomarker discovery often fall short in terms of accuracy and comprehensiveness, given their inability to effectively capture dependencies in multi-dimensional data. Recent advancements in deep learning provide a powerful approach to analyze complex genomic data and identify novel biomarkers that may predict therapeutic responses.

AI applications in genomics and pathology informatics, including ‘Generative AI’

Systematic Mining of Bioactive Compounds for Wound Healing From Cayratia Japonica Exosome-Like Nanovesicles: A Workflow Combining LC-MS and DeepSeek Models

Plant-derived exosome-like nanovesicles (P-ELNs) effectively deliver bioactive compounds due to their high biocompatibility and low immunogenicity. While LC-MS profiles compounds in complex samples, its analysis of large datasets remains limited by traditional methods. Recent advances in large language models (LLMs) and domain-specific systems now enhance Chinese biomedical data processing and cross-modal pharmaceutical research.

Theme Issue 2025: Artificial Intelligence in Oncology

Development and Validation of a Generative Artificial Intelligence-Based Pipeline for Automated Clinical Data Extraction From Electronic Health Records: Technical Implementation Study

Manual abstraction of unstructured clinical data is often necessary for granular clinical outcomes research but is time consuming and can be of variable quality. Large language models (LLMs) show promise in medical data extraction yet integrating them into research workflows remains challenging and poorly described.

Corrigenda and Addenda

Correction: Structural and Functional Impacts of SARS-CoV-2 Spike Protein Mutations: Insights From Predictive Modeling and Analytics

Bioinformatics, genomics, tools and databases

Immunogenicity of Adalimumab in Bacterial Molecular Mimicry: In Silico Analysis

Adalimumab, a monoclonal antibody targeting TNFα, treats autoimmune diseases but induces anti-drug antibodies in 30–60% of patients, reducing its efficacy.

Structural biology and molecular modeling

Structural and Functional Impacts of SARS-CoV-2 Spike Protein Mutations: Insights From Predictive Modeling and Analytics

The COVID-19 pandemic requires a deep understanding of SARS-CoV-2, particularly how mutations in the Spike Receptor Binding Domain (RBD) Chain E affect its structure and function. Current methods lack comprehensive analysis of these mutations at different structural levels.

Network biology

Protein-Protein Interactions in Papillary and Nonpapillary Urothelial Carcinoma Architectures: Comparative Study

Bladder cancer is a disease with complex perturbations in gene networks and heterogeneous in terms of histology, mutations, and prognosis. Advances in high-throughput sequencing technologies, genome-wide association studies, and bioinformatics methods have revealed greater insights into the pathogenesis of complex diseases. Network biology-based approaches have been used to identify the complex protein-protein interactions (PPIs) which can lead to potential drug targets. There is a need to better understand PPIs specific to urothelial carcinoma.

Immunoinformatics and Pathology informatics

Estimating Antigen Test Sensitivity via Target Distribution Balancing: Development and Validation Study

Sensitivity—expressed as percent positive agreement (PPA) with a reference assay—is a primary metric for evaluating lateral-flow antigen tests (ATs), typically benchmarked against a quantitative reverse transcription polymerase chain reaction (qRT-PCR). In SARS-CoV-2 diagnostics, ATs detect nucleocapsid protein, whereas qRT-PCR detects viral RNA copy numbers. Because observed PPA depends on the underlying viral-load distribution (proxied by the number cycle thresholds or Cts, which is inversely related to load), study-specific sampling can bias sensitivity estimates. Cohort differences—such as enrichment for high- or low-Ct specimens—therefore complicate cross-test comparisons, and real-world datasets often deviate from regulatory guidance to sample across the full concentration range. Although logistic models relating test positivity to Ct are well described, they are seldom used to re-weight results to a standardized reference viral-load distribution. As a result, reported sensitivities remain difficult to compare across studies, limiting both accuracy and generalizability

AI applications in genomics and pathology informatics, including ‘Generative AI’

Conversational Artificial Intelligence for Integrating Social Determinants, Genomics, and Clinical Data in Precision Medicine: Development and Implementation Study of the AI-HOPE-PM System

Integrating clinical, genomic, and social determinants of health (SDoH) data is essential for advancing precision medicine and addressing cancer health disparities. However, existing bioinformatics tools often lack the flexibility to perform equity-driven analyses or require significant programming expertise.

Theme Issue 2025: Artificial Intelligence in Oncology

Paired-Sample and Pathway-Anchored MLOps Framework for Robust Transcriptomic Machine Learning in Small Cohorts: Model Classification Study

Ninety percent of the 65,000 human diseases are infrequent, collectively affecting ~ 400 million peo-ple, substantially limiting cohort accrual. This low prevalence constrains the development of robust transcriptome-based machine learning (ML) classifiers. Standard data-driven classifiers typically require cohorts of over 100 subjects per group to achieve clinical accuracy while managing high-dimensional input (~25,000 transcripts). These requirements are infeasible for micro-cohorts of ~20 individuals, where overfitting becomes pervasive.

Theme Issue 2025: Artificial Intelligence in Oncology

Systemic Anticancer Therapy Timelines Extraction From Electronic Medical Records Text: Algorithm Development and Validation

The systemic treatment of cancer typically requires the use of multiple anticancer agents in combination and/or sequentially. Clinical narrative texts often contain extensive descriptions of the temporal sequencing of systemic anticancer therapy (SACT), setting up an important task that may be amenable to automated extraction of SACT timelines.

Theme Issue 2025: Artificial Intelligence in Oncology

Lung Cancer Diagnosis From Computed Tomography Images Using Deep Learning Algorithms With Random Pixel Swap Data Augmentation: Algorithm Development and Validation Study

Deep learning (DL) shows promise for automated lung cancer diagnosis, but limited clinical data restricts performance. While data augmentation (DA) helps, existing methods struggle with chest computed tomography (CT) scans across diverse DL architectures.