JMIR Bioinformatics and Biotechnology
Methods, web-based platforms, open data and open software tools for big data analytics, machine learning-based predictive models using genomic and imaging data, and information retrieval in biology and medicine.
JMIR Bioinformatics and Biotechnology is the official journal of the MidSouth Computational Biology and Bioinformatics Society
Editor-in-Chief:
Ece D. Uzun, MS, Ph.D., FAMIA, Director of Clinical Bioinformatics, Lifespan Academic Medical Center; Associate Director, Center for Clinical Cancer Informatics and Data Science (CCIDS); and Associate Professor of Pathology and Laboratory Medicine, Alpert Medical School, Brown University, USA
CiteScore 2.2
Recent Articles



Bladder cancer is a disease with complex perturbations in gene networks and heterogeneous in terms of histology, mutations, and prognosis. Advances in high-throughput sequencing technologies, genome-wide association studies, and bioinformatics methods have revealed greater insights into the pathogenesis of complex diseases. Network biology-based approaches have been used to identify the complex protein-protein interactions (PPIs) which can lead to potential drug targets. There is a need to better understand PPIs specific to urothelial carcinoma.

Sensitivity—expressed as percent positive agreement (PPA) with a reference assay—is a primary metric for evaluating lateral-flow antigen tests (ATs), typically benchmarked against a quantitative reverse transcription polymerase chain reaction (qRT-PCR). In SARS-CoV-2 diagnostics, ATs detect nucleocapsid protein, whereas qRT-PCR detects viral RNA copy numbers. Because observed PPA depends on the underlying viral-load distribution (proxied by the number cycle thresholds or Cts, which is inversely related to load), study-specific sampling can bias sensitivity estimates. Cohort differences—such as enrichment for high- or low-Ct specimens—therefore complicate cross-test comparisons, and real-world datasets often deviate from regulatory guidance to sample across the full concentration range. Although logistic models relating test positivity to Ct are well described, they are seldom used to re-weight results to a standardized reference viral-load distribution. As a result, reported sensitivities remain difficult to compare across studies, limiting both accuracy and generalizability

Integrating clinical, genomic, and social determinants of health (SDoH) data is essential for advancing precision medicine and addressing cancer health disparities. However, existing bioinformatics tools often lack the flexibility to perform equity-driven analyses or require significant programming expertise.

Ninety percent of the 65,000 human diseases are infrequent, collectively affecting ~ 400 million peo-ple, substantially limiting cohort accrual. This low prevalence constrains the development of robust transcriptome-based machine learning (ML) classifiers. Standard data-driven classifiers typically require cohorts of over 100 subjects per group to achieve clinical accuracy while managing high-dimensional input (~25,000 transcripts). These requirements are infeasible for micro-cohorts of ~20 individuals, where overfitting becomes pervasive.

The systemic treatment of cancer typically requires the use of multiple anticancer agents in combination and/or sequentially. Clinical narrative texts often contain extensive descriptions of the temporal sequencing of systemic anticancer therapy (SACT), setting up an important task that may be amenable to automated extraction of SACT timelines.


The protein A Disintegrin and Metalloprotease Domain-Containing 17 (ADAM-17), also called TACE, generally plays an important role in the cleavage of the Pro-Leu-Ala-Gln-Ala-/-Val-Arg-Ser-Ser-Ser sequence in the membrane-bound precursor of tumor necrosis factor alpha (TNF-α). This cleavage process has significant implications for inflammatory and immune responses, and recent studies suggest a role for ADAM-17 variants in SARS-CoV-2 infection.

Cancer is one of the leading causes of disease burden globally, and early and accurate diagnosis is crucial for effective treatment. This study presents a deep learning-based model designed to classify five common types of cancer in Saudi Arabia: Breast, Colorectal, Thyroid, Non-Hodgkin Lymphoma (NHL), and Corpus Uteri.

National and Ethnic Mutation Frequency Databases (NEMDBs) play a crucial role in documenting gene variations across populations, offering invaluable insights for gene mutation research and the advancement of precision medicine. These databases provide an essential resource for understanding genetic diversity and its implications for health and disease across different ethnic groups.

Previous machine learning approaches for prostate cancer detection using gene expression data have shown remarkable classification accuracies. However, prior studies overlook the influence of racial diversity within the population and the importance of selecting outlier genes based on expression profiles.
Preprints Open for Peer-Review
Open Peer Review Period:
-






