Abstract
Background: Adalimumab, a monoclonal antibody targeting tumor necrosis factor α, treats autoimmune diseases but induces antidrug antibodies in 30% to 60% of patients, reducing its efficacy.
Objective: This study aims to investigate molecular mimicry as a mechanism behind this immunogenicity, where bacterial immunoglobulin domains structurally resemble adalimumab’s light chain, triggering immune responses.
Methods: Using PSI-BLASTp (National Center for Biotechnology Information) and PRALINE (Center for Integrative Bioinformatics), there are 40 bacterial antigens homologous to adalimumab, with 8 clinically relevant strains.
Results: Structural analysis revealed 94% amino acid identity between the immunoglobulin domain of Escherichia coli strain B1 and adalimumab’s light chain, and 89.67% similarity with Corynebacterium pyruviciproducens. Root mean square deviation values confirmed strong structural homology. Additionally, 5 cross-reactive B-cell epitopes were predicted, suggesting overlapping surfaces that may promote immune cross-reactivity and antidrug antibody development.
Conclusions: This study represents a first step toward identifying a potential microbial factor driving antiadalimumab antibody formation. The predicted cross-reactive regions provide specific candidates for further in vitro validation to confirm molecular mimicry and refine epitope mapping. Understanding these mechanisms may ultimately inform the design of less immunogenic biologics and guide clinical strategies to predict and prevent antidrug antibody formation.
doi:10.2196/83872
Keywords
Introduction
Adalimumab is a fully human monoclonal antibody targeting tumor necrosis factor-α, a protein involved in inflammation in various chronic autoimmune conditions []. The Food and Drug Administration has approved adalimumab to treat several diseases: rheumatoid arthritis, ankylosing spondylitis, Crohn disease, ulcerative colitis, hidradenitis suppurativa, juvenile idiopathic arthritis, plaque psoriasis, psoriatic arthritis, and uveitis []. Despite the humanization of adalimumab, the amino acid sequences of both the heavy and light variable chains near the epitope binding regions within the complementarity-determining regions tend to elicit a robust immune response [-]. Immune complexes formed by adalimumab and antigens can reach 4000 kDa, and despite being humanized, they may still be recognized as foreign, triggering antidrug antibodies (ADAs) that reduce its effectiveness [,].
Approximately 30% to 60% of patients on treatment with adalimumab eventually experience a reduction in the effectiveness of the treatment []. This waning efficacy is believed to be due, in part, to immunogenicity, which refers to the body’s generation of antibodies targeting the biological drug []. One potential mechanism underlying this immune response is immune cross-reactivity, where the immune system interacts with structurally similar antigens from different sources. Among these, molecular mimicry—where bacterial proteins share structural homology with therapeutic antibodies like adalimumab—has emerged as a plausible explanation for this phenomenon [,]. Four requirements must be met for an infection to be implicated in the molecular mimicry-based development of an autoimmune response: there must be epidemiological data about exposure to the environmental agent and the development of autoimmunity, structural homology between human antigens (or medication) and pathogens, autoantibodies or autoreactive lymphocytes against both human and pathogen antigens, and in vivo evidence in animal models [,].
Immunoglobulin domains, widely recognized for their role in the structure and function of key immunological proteins, are highly conserved units across evolution []. Interestingly, these domains are not confined to the immune systems of higher organisms but are also present in bacterial proteins, including those of Escherichia coli and other enterobacteria [,]. Immunoglobulin-like domains are frequently found in these microorganisms in cell surface proteins and fimbrial organelles, where they play essential roles in host cell adhesion and invasion by pathogenic strains. They serve as structural components of pilus and nonpilus fimbrial systems and are members of the intimin or invasin family of outer membrane adhesins []. This dual functionality underscores their significance as a possible evolutionary mechanism through which pathogens leverage these conserved structures to evade or modulate the host immune response, facilitating infection and colonization [].
The immune cross-reactivity highlights the need for a comprehensive understanding of the immune responses induced by microorganisms in drugs. It has important implications for the development and use of pharmacological therapies. Currently, there is no evidence of immune cross-reactivity between adalimumab and microbial antigens. However, cross-reactive immune responses have been reported with other medications and vaccines, leading to thrombocytopenia and autoimmune diseases. Furthermore, patients with underlying autoimmune conditions who develop infections are more likely to produce anti-drug antibodies [,-].
Molecular mimicry has traditionally been investigated in the context of autoimmune diseases and vaccine responses, where microbial antigens resemble host proteins. However, its potential role in the immunogenicity of therapeutic antibodies remains largely unexplored. In this study, we apply this concept to examine whether infections could trigger cross-reactive immune responses toward biological drugs. Specifically, we investigate possible cross-immunogenicity between the immunoglobulin domain of clinically relevant Gram-positive and Gram-negative bacteria and adalimumab using in silico approaches. To our knowledge, this is the first study to propose a mechanistic link between bacterial antigens and adalimumab immunogenicity supported by sequence and structural evidence. By addressing this unexplored aspect of biologic drug immunogenicity, our work provides a novel conceptual framework that may guide future experimental validation and inform strategies to improve therapeutic antibody safety.
Methods
Study Design
A workflow image of the method is shown in Figure S1 in .
Sequences Analysis
Adalimumab’s amino acid sequence, identified by its DrugBank Accession Number (DB00051), was obtained from DrugBank [,]. The complete amino acid sequences of both the heavy (α) and light (β) chains corresponding to the Fab region were used for the analyses. These sequences include the variable and constant domains, ensuring full structural representation of the monoclonal antibody during alignment and comparison. Adalimumab sequence served as the input for a PSI-BLASTp (version 2.16.0; National Center for Biotechnology Information) search targeting bacterial homologs, using the identifier Bacteria (taxid:2). The length of matched subsequences in PSI-BLASTp is influenced by statistical significance thresholds and the iterative nature of the algorithm, which together help ensure that only meaningful alignments are included in the analysis []. Default settings were applied for the general search parameters. For subsequent analyses, amino acid sequences from bacteria of clinical significance to humans were selected [].
Antigens demonstrating a similarity of ≥30% were considered for further investigation. The amino acid sequences from the chosen microorganisms were aligned with the adalimumab light and heavy chains to ascertain identity levels and pinpoint conserved regions. The PRALINE tool (version 2; Center for Integrative Bioinformatics) [] facilitated the alignments by identifying regions of similarity, which may indicate functional, structural, or evolutionary relationships among the proteins being compared. The alignment parameters were configured to use BLOSUM62 as the exchange matrix using default parameters unless otherwise specified. Specifically, we used BLOSUM62 as the substitution matrix with a gap opening penalty of 11 and a gap extension penalty of 1 (standard for PSI-BLAST alignments). For the PSI-BLASTp search, 3 iterations were performed with an E-value threshold of 0.001 to enhance sensitivity and identify distant homologs []. E-value represents the number of random matches you would expect to find with a score equal to or better than the one observed []. Similarly, antigens with a similarity of ≥30% were advanced for further analysis. Also, a high-resolution Protein Data Bank (PDB) file (ID: 3wd5) was sourced from the PDB [,], enabling structural analysis.
Modeling Based on Homology
3D models of selected antigens, for which no reports exist in the PDB, were constructed based on homology using the SWISS-MODEL server (ProMod3) [,]. These initial models underwent further refinement with UCSF Chimera (version 1.1.3) [].
Antigens with experimentally resolved 3D structures were sourced directly from the PDB. The visualization of all models was achieved using PyMOL (version 3.0; Schrödinger, Inc) []. Structural homology assessments were conducted using the Ramachandran charts, the Quantitative Model Energy Analysis index, the RMSD metric, and Global Model Quality Estimate (GMQE) values were assessed for all models. RMSD focuses on precise structural alignment, while GMQE provides a broader model quality evaluation. Using both metrics allows researchers to make informed decisions about the reliability and applicability of their homology models in biological research [,].
B-Cell Epitope Prediction
The prediction of B-cell epitopes was carried out using the ElliPro server (version 3.0; IEDB Analysis Resource), using the default parameters that are a minimum score protrusion index (PI) threshold of 0.5 and a maximum distance of 6 Å between residue centers for defining discontinuous epitopes. ElliPro identified linear and discontinuous epitopes based on the protein’s 3D structure (PDB ID: 3WD5) using the PI of residues [,]. B-cell epitope prediction methods, such as those using the Ellipro server, generally achieve accuracy rates ranging from 65% to over 70%. Still, ongoing validation and refinement are necessary due to variability in sensitivity and specificity across different prediction tools []. Furthermore, epitopes previously identified for adalimumab were retrieved from the Immune Epitope Database (IEDB). This step was essential to explore the potential molecular mimicry between antigens from bacteria and those associated with adalimumab. Epitopes conserved between adalimumab and its bacterial homologs were visualized on the 3D model of the monoclonal antibody using PyMOL version 3.0.
Major Histocompatibility Complex Class II–Dependent T-Cell Epitope Prediction
T-cell epitope prediction was performed using the NetMHCIIpan version 4.1 web server (IEDB Analysis Resource) []. This platform uses an ensemble of deep neural networks trained on large quantitative binding datasets (IC₅₀ values) derived from multiple human leukocyte antigen (HLA) class II molecules, enabling panallelic prediction of peptide-HLA interactions.
The FASTA sequences of the VH and VL chains were analyzed separately. The alleles HLA-DRB1*01:01, HLA-DQA101:01/DQB105:01, HLA-DPA101:03/DPB102:01, and HLA-DQA105:01/DQB102:01 were selected due to their frequency and immunogenetic relevance in diverse human populations commonly used in therapeutic immunogenicity assessments. Epitope scanning was performed with a 15-amino-acid window (15-mer) sliding by one residue to ensure maximal coverage of overlapping peptides. The software generated an affinity score (nM) and a percentile rank for each predicted peptide. Following the server’s guidelines, peptides with rank ≤2.0% were classified as strong binders, while those with 2.0%<rank≤10% were considered weak binders.
Filtering, Ranking, and Epitope Selection
All prediction outputs were exported in CSV format. The data were subsequently filtered to retain only peptides with an affinity score ≥0.2 and low percentile rank, indicative of stable peptide–major histocompatibility complex (MHC) II complexes. Scores and ranks were compared across both antibody chains to identify high-affinity regions and potential immunogenic hotspots.
Population Coverage Analysis
To assess the global relevance and potential immunological reach of the predicted MHC class II–restricted T-cell epitopes, a population coverage (PC) analysis was conducted using the population coverage tool available at the IEDB [].
This analysis estimates the fraction of individuals within defined human populations that are likely to present one or more of the predicted epitopes, based on the distribution frequency of HLA alleles. The tool integrates the predicted epitope–HLA binding data obtained from NetMHCIIpan 4.1 with HLA genotypic frequencies derived from the Allele Frequency Net Database, which compiles large-scale datasets from diverse ethnic and geographical groups worldwide.
All epitopes predicted as strong or weak binders (rank ≤10%) across the selected HLA-DR, HLA-DP, and HLA-DQ alleles were included as input. The analysis was performed for multiple population sets, including global coverage, South American, and European cohorts, representing regions with significant therapeutic use of adalimumab and diverse HLA genetic backgrounds.
The algorithm computes several parameters: projected PC (%), representing the cumulative percentage of individuals expected to respond to at least one of the selected epitopes. Average number of epitope–HLA combinations recognized per individual, reflecting immune response redundancy. PC90 value (PC 90%), indicating the minimal number of epitope–HLA combinations required to cover 90% of the target population.
The results were exported in CSV format and visualized as bar and cumulative distribution plots to illustrate interregional variability in potential T-cell responsiveness. This step provides an estimate of the breadth and universality of the predicted epitope set, allowing prioritization of epitopes with the highest immunological representativeness across human populations.
Conservation Analysis of Immunoglobulin Domain in Bacteria
The conservation of amino acid residues from the immunoglobulin domain across various bacterial species in relation to adalimumab was analyzed using the Rate4Site algorithm on the ConSurf server (version 1.00) []. This algorithm calculates position-specific evolutionary rates using an empirical Bayesian approach. The rates are normalized and categorized into 9 grades, with highly conserved residues assigned a score of 9 and highly variable residues receiving a score of 1. The thresholds for these categories are based on the normalized evolutionary rates calculated by Rate4Site. Residues with scores of 1‐3 are considered highly variable, reflecting higher evolutionary rates and frequent mutations or substitutions across species. Conversely, residues with scores of 7 to 9 are classified as highly conserved, indicating minimal variability and strong evolutionary pressure to maintain their structure and function across species. These conservation rates were then visualized using the structural model of adalimumab obtained from the PDB using the Chimera tool [].
Allergenicity Prediction Using AllerTOP
The potential allergenicity of the predicted T-cell epitopes derived from the light chain of the adalimumab antibody was evaluated using the AllerTOP v.2.0 server [,]. This bioinformatics tool applies an alignment-independent approach based on auto- and cross-covariance transformation of protein sequences into uniform-length vectors, followed by machine learning classification using a k-nearest neighbor algorithm trained on a curated dataset of known allergens and nonallergens.
All epitopes predicted by NetMHCIIpan 4.1 as MHC class II binders (rank ≤10%) were used as input sequences in FASTA format. Each peptide was analyzed individually to determine its probability of being classified as “Probable Allergen” or “Probable Non-Allergen,” according to the physicochemical descriptors of amino acid residues (hydrophobicity, size, flexibility, and secondary structure propensity).
The results were automatically compared to the training dataset of AllerTOP, which includes more than 2400 allergenic and 2400 nonallergenic proteins, allowing for an indirect homology-free prediction of allergenic potential. The outcomes were exported and tabulated, recording for each peptide (1) the most similar protein identified, (2) the source organism, and (3) the allergenicity classification.
Peptides classified as “Probable Allergen” were further analyzed for potential molecular mimicry with known allergens of plant, fungal, or arthropod origin to assess possible cross-reactivity risks. The combined use of NetMHCIIpan 4.1 and AllerTOP v.2.0 allowed an integrated evaluation of both T-cell immunogenicity and allergenicity potential for the light chain–derived epitopes of adalimumab.
Ethical Considerations
This study was conducted entirely using in silico methods and publicly available, deidentified data. No human participants, personal information, or identifiable records were accessed; therefore, issues of privacy and confidentiality do not apply. No compensation was provided, as no participants, either human or animal subjects, were involved. Consequently, institutional ethical approval and informed consent were not required for this research.
Results
PSI-BLASTp
To explore potential molecular similarities between the monoclonal antibody and bacterial proteins, a PSI-BLASTp search was performed. This analysis was conducted to identify possible cross-reactive epitopes that could contribute to off-target interactions or immunological cross-reactivity, which are relevant for understanding antibody specificity and safety. The search revealed 40 significant matches between the monoclonal sequence and bacterial antigens. From these, 8 sequences corresponding to bacteria of clinical relevance were selected for further analysis (). Comparative alignment between the adalimumab light chain and the identified bacterial homologs demonstrated an average amino acid identity of 64%. This identity represents amino acid residues that are identical and located in the same position when the sequences are aligned. Higher sequence similarity increases the likelihood of cross-reactivity, as it suggests that the compared molecules may share structurally conserved epitopes capable of being recognized by the same antibodies. The most conserved region was located between residues 74 and 150 (), suggesting a potential structural or functional similarity in this segment. The analysis revealed that adalimumab light chain shares sequence homology with a bacterial protein containing an immunoglobulin domain, suggesting possible evolutionary or conformational parallels. Similar results were obtained for the heavy chain, although some alignments involved hypothetical bacterial proteins. Consequently, subsequent analyses focused on the heavy chain and the homologs that could be fully annotated (Table S1 in ).
| Bacteria | Antigen | Similarity (%) | Cod genbank | RMSD |
| Escherichia coli | IDCP | 94 | HEC3531043·1 | 0.2 |
| Corynebacterium pyruviciproducens | IDCP | 89.67 | WP_280195946·1 | 0.35 |
| Klebsiella pneumoniae | IDCP | 82.86 | WP_317090695·1 | 0.3 |
| Vibrio vulnificus | IDCP | 75.00 | MCA0777086·1 | 0.5 |
| Pseudomonas sp. | IDCP | 67.80 | MCX2891938·1 | 0.6 |
| Helicobacter pylori | IDCP | 62.84 | WP_304481324.1 | 0.5 |
| Salmonella enterica | IDCP | 55.56 | MBS2599521.1 | 0.9 |
| Staphylococcus aureus | IDCP | 43.84 | WP_282719268.1 | 0.8 |
aRMSD values below 1 Å indicate an exceptionally high degree of structural similarity, often reflecting near-identical alignments. This level of similarity is particularly relevant in studies of molecular mimicry, as it suggests that the structures may share conserved functional or antigenic regions, increasing the likelihood of cross-reactivity. Such low RMSD values underscore the robustness of the alignments and the potential biological significance of the identified matches.
bIDCP: immunoglobulin domain–containing protein.
To further characterize the similarities detected through PSI-BLASTp, pairwise alignments were conducted between the adalimumab light chain and the bacterial homologs. This analysis was designed to quantify the degree of sequence conservation and to identify bacterial species exhibiting the closest resemblance to the therapeutic antibody. The results showed identity values ranging from moderate to high (43.84%‐94%) between adalimumab and the bacterial antigens. The highest sequence conservation was observed with E. coli and Corynebacterium pyruviciproducens ( and ). These findings suggest that certain bacterial proteins share notable similarity with adalimumab, which may be relevant for understanding potential cross-reactive interactions.

Modeling and Structural Analysis
To evaluate whether the sequence similarities observed translated into comparable 3D conformations, structural models were generated for the bacterial antigens listed in . This analysis aimed to determine the extent to which these bacterial proteins might adopt folds resembling those of adalimumab, thereby providing structural evidence of potential mimicry. The resulting models showed that the bacterial antigens consistently adopted the characteristic immunoglobulin-like fold (). In the cases of C. pyruviciproducens, E. coli, Staphylococcus aureus, and Vibrio vulnificus, the predicted structures were organized as dimers. The GMQE scores indicated reliable model quality, with the lowest value (0.70) corresponding to Salmonella enterica. This parameter combines information from the sequence alignment and the quality of the structural template to estimate the expected accuracy of the final model; GMQE values range from 0 to 1, and scores above 0.6 are generally considered indicative of high-confidence structural predictions []. Furthermore, RMSD analyses demonstrated a high degree of structural similarity between adalimumab and the bacterial models (). RMSD values represent the average atomic distance between 2 superimposed structures, where values below 1 Å suggest a nearly identical spatial organization []. In particular, the remarkably low RMSD observed for E. coli (0.2 Å) provides compelling evidence of molecular mimicry, as such minimal deviation indicates that both molecules share an almost indistinguishable folding pattern. This structural conservation not only supports the sequence-based similarities but also implies that the bacterial proteins could expose epitopes in a conformation highly compatible with antibody recognition, thereby favoring potential cross-reactivity or recognition from adalimumab antibodies to bacterial proteins and adalimumab.


B-Cell Epitope Prediction
To explore whether the sequence similarities between adalimumab and bacterial homologs could translate into shared antigenic regions, an in silico prediction of B-cell epitopes was performed. This computational approach aimed to identify potential linear epitopes within adalimumab that might overlap with conserved bacterial sequences and therefore represent possible cross-reactive sites. The analysis predicted 5 B-cell epitopes in adalimumab showing varying degrees of similarity to bacterial homologs (). The predicted epitopes differed in length, with epitopes 1 and 5 containing the largest number of residues, while epitope 2 comprised only 4 residues shared among the bacterial antigens examined. Structural projection of the predicted epitopes revealed that potential cross-reactive regions are distributed across different areas of the antibody surface (). Additionally, surface modeling indicated that these epitopes collectively occupy a substantial portion of the molecular surface (). Although the predicted epitopes ranged from 4 to 16 amino acids, even short conserved sequences can be relevant for antigen recognition, as complementarity-determining regions within antibodies—often only 6‐20 residues long—are primarily responsible for specific antigen binding [].
| Epitope | Sequences | Start | End | Residues | Score |
| 1 | TLSKADYEKHKV | 220 | 232 | 12 | 0.802 |
| 2 | SSLQ | 116 | 119 | 4 | 0.5 |
| 3 | SGSGTD | 105 | 110 | 6 | 0.651 |
| 4 | SVGDR | 51 | 55 | 5 | 0.653 |
| 5 | HQGLSSPVTKSFNRGE | 240 | 255 | 16 | 0.676 |
aThe scores assigned to each predicted epitope range from 0 to 1, with values closer to 1 indicating a stronger prediction. A score of 0.802 for epitope 1, for example, suggests a high level of confidence in its conservation and potential functional relevance across bacterial homologs. The biological relevance of these results suggests that, among the entire antigen sequence, the region corresponding to the predicted epitope is the most likely to be immunogenic and therefore to be recognized by antibodies.

MHC Class II–Dependent T Epitope Prediction
T-cell epitope predictions using the IEDB MHC II binding tool identified multiple adalimumab-derived peptides with strong affinity (percentile ≤2%) for HLA-DRB1 alleles previously associated with anti-adalimumab antibodies. Both heavy- and light-chain regions contained potential CD4+ T-cell epitopes presented by these risk alleles, supporting a T-cell–dependent mechanism of immunogenicity (Tables S2 and S3 in ). Screening of the clinically relevant bacterial proteomes revealed peptides with comparable high-affinity binding to the same HLA molecules and partial sequence similarity to adalimumab epitopes (not shown). This overlap suggests that microbial antigens may share HLA-restricted motifs with adalimumab, potentially enabling cross-reactive T-cell responses that contribute to antidrug antibody formation.
PC Analysis
PC analysis performed using the IEDB Population Coverage tool showed that the predicted class II epitopes from the adalimumab light chain could be presented by approximately 15.2% of the European population ( and Table S4 in ). The mean number of epitope–HLA combinations recognized per individual was 3.19 (SD ), with a PC90 value of 2.48. These results indicate that a limited yet notable fraction of the population carries HLA alleles capable of presenting these epitopes, supporting the existence of potential interindividual variability in T-cell responsiveness to adalimumab.
Conservation Analysis of Immunoglobulin Domain in Bacteria
To evaluate whether the structural similarities between adalimumab and bacterial homologs reflect conserved evolutionary features, a conservation analysis of the immunoglobulin domain was performed using the ConSurf server. This analysis aimed to identify amino acid residues that are evolutionarily conserved across bacterial antigens and the antibody, which could indicate the maintenance of structural or functional motifs important for protein stability or interaction. The results, summarized in , highlight conserved residues mapped onto the amino acid sequence and the corresponding 3D structures. The cartoon representations illustrate that several regions of the immunoglobulin domain remain highly conserved among the analyzed bacterial species. The conservation score gradient, represented by ConSurf’s color scale from cyan (variable=grade 1) to purple (highly conserved=grade 9), emphasizes that key residues within the core of the domain exhibit strong conservation, suggesting evolutionary pressure to maintain structural integrity in these regions.

Allergenicity Prediction Using AllerTOP
Allergenicity analysis performed using the AllerTOP server revealed that several of the predicted adalimumab-derived peptides share sequence similarity with known or probable allergens. Notably, matches were identified with allergenic proteins from Sarcoptes scabiei (Sarc s 1), Artemisia vulgaris (Art v 3), and Malassezia sympodialis, as well as with plant-derived allergens such as hydroxyproline-rich glycoproteins from Oryza sativa and neoxanthin synthase from Solanum tuberosum. Some peptides also showed similarity to human proteins, including ATP synthase F1 assembly factor 2 and B-cell lymphoma 6 protein, suggesting potential immunological cross-reactivity. Overall, the presence of sequences with predicted allergenic properties indicates that these epitopes may elicit immune recognition and could contribute to the immunogenic potential of adalimumab.
Discussion
Principal Findings
This study identified several bacterial antigens that share significant sequence and structural similarities with adalimumab, particularly within immunoglobulin-like domains. Using in silico analyses, we found that these bacterial proteins exhibit conserved folds and low RMSD values relative to adalimumab, supporting the hypothesis of molecular mimicry. Such a resemblance may provide a mechanistic explanation for the development of ADAs and the loss of therapeutic response observed in some patients treated with adalimumab. To our knowledge, this is the first study to propose a mechanism of cross-reactivity between adalimumab and microbial antigens supported by in silico structural and sequence evidence. Moreover, it is the first report describing specific bacterial epitopes with potential clinical relevance in the context of adalimumab immunogenicity.
Up to 60% of patients who receive adalimumab eventually show a decline in the effectiveness of the treatment []. Therapeutic failure is thought to be caused by ADAs, but the reasons for its formation are still unknown. Some past infections may have contributed to the development of ADAs []. In this setting, molecular mimicry could explain why prior infections produce these antibodies. Molecular mimicry may cause antibodies to be generated during past infections to inadvertently neutralize adalimumab by cross-reacting with its epitopes, diminishing its therapeutic effect. So, we propose using bioinformatics to investigate this phenomenon. In support of this hypothesis, our in silico T-cell epitope predictions identified adalimumab peptides with strong HLA-DRB1 binding—alleles previously linked to anti-adalimumab antibodies—and revealed bacterial peptides capable of binding the same HLA molecules, suggesting a potential cross-reactive, T-cell–mediated mechanism underlying ADA formation.
To establish a link between certain microbes and molecular mimicry, we must follow a 4-tiered evidence approach [,]. Previously, an epidemiological connection was established []. Now, using in silico methods, we have identified potential antigens with significant identity, indicating possible mimicry. Although the analyzed fragments are relatively short, they were selected from clinically relevant bacteria due to their potential involvement in molecular mimicry processes. Information regarding their immunodominant regions is currently unavailable, as this is the first study addressing these specific bacterial-antibody similarities. This finding suggests that clinically relevant bacteria, including E. coli, C. pyruviciproducens, Klebsiella pneumoniae, V. vulnificus, Pseudomonas aeruginosa, Helicobacter pylori, S. enterica, and S. aureus, may harbor molecular motifs capable of inducing cross-reactivity. These microorganisms are of major medical concern as they encompass commensals with pathogenic potential, such as E. coli []; opportunistic pathogens linked to severe infections like C. pyruviciproducens [,]; and multidrug-resistant strains such as K. pneumoniae [,]. Others, including V. vulnificus and P. aeruginosa, are associated with life-threatening conditions such as necrotizing soft-tissue infections and ventilator-associated pneumonia, respectively [,]. Moreover, H. pylori, S. enterica, and S. aureus contribute to chronic and systemic diseases ranging from gastritis and typhoid fever to osteomyelitis and sepsis [-]. In addition, population coverage analysis using the IEDB tool showed that the predicted class II epitopes from the adalimumab light chain could be presented by approximately 15% of the European population, suggesting that only a subset of individuals possess HLA alleles capable of recognizing these epitopes and may therefore be more prone to T-cell–mediated immunogenic responses.
The observed sequence similarity within immunoglobulin-like domains highlights a plausible mechanism through which antibodies targeting microbial antigens could recognize adalimumab epitopes, potentially impairing its therapeutic efficacy. This warrants further experimental validation through in vitro inhibition assays and in vivo studies to confirm the immunological and clinical significance of these mimicry events. These predictions also provide candidate regions for in vitro validation, enabling the refinement of epitope mapping through assays such as enzyme-linked immunosorbent assay or peptide-binding tests. Additionally, the complete amino acid sequences of the bacterial antigens are available for future recombinant cloning and seroprevalence studies in adalimumab-treated patients.
In addition, allergenicity prediction using the AllerTOP server indicated that several adalimumab-derived peptides share sequence similarity with known or probable allergens from diverse sources, including S. scabiei, A. vulgaris, and O. sativa. This overlap suggests that certain epitopes may possess inherent immunostimulatory properties or cross-reactivity with environmental allergens, further supporting their potential contribution to the immunogenic profile of adalimumab.
Our results have cast a new light on the intricate role of molecular mimicry in drug development, merging concepts from diverse therapeutic domains. Historically, molecular structures resembling those of microbial antigens have been exploited to elicit immunological benefits, enhancing the body’s defenses against diseases and improving vaccine efficacy, and even cancer response [-]. However, our findings underscore a more intricate reality where these structural analogies, notably those present in adalimumab, bear the potential to induce ADA development. This duality reflects broader pharmaceutical experiences, where beneficial immunogenic mimicry can, in some contexts, inadvertently lead to immune cross-reactions with conditions like thrombocytopenia and infectious and autoimmune diseases [,-]. The resulting immunological disarray, wherein the body cannot differentiate between therapeutic agents and microbial antigens, might trigger an unwarranted immune response against the host’s own tissues, manifesting in a range of adverse clinical outcomes such as autoimmune disease relapses, serum sickness, hypersensitivity reactions, or symptoms of autoimmune disease. Thus, our study bridges the existing knowledge gap by revealing how the same molecular resemblances that have been leveraged for therapeutic gain may also carry risks that must be carefully navigated in the continuum of drug research and patient care.
Unraveling the connections of shared molecular similarities between drugs and various microorganisms—pathogens and commensals alike—can illuminate potential reasons behind the unintended effects some medications might have on the immune system’s response to treatment. Specifically, within the therapeutic framework, molecular mimicry may introduce complexities in treatment modalities due to its potential to elicit aberrant immune responses. Such phenomena can attenuate the efficacy of the medication or precipitate adverse immunological reactions through ADA development. We also consider it may be beneficial to adjust the drug’s formulation and design to lessen its immunogenic potential while maintaining therapeutic effectiveness, a task that is complex yet crucial [].
In the realm of precision medicine, the significance of molecular mimicry between monoclonal antibodies and microorganisms is profound. This study highlights the presence of antigens with immunoglobulin domains in microbes such as E. coli, H. pylori, and S. aureus, common colonizers [-], as well as in those linked to severe diseases, including K. pneumoniae, V. vulnificus, S. enterica, C. pyruviciproducens, and Pseudomonas species [,,,,]. These immunoglobulin domains, which are involved in diverse binding and molecular recognition processes, have been identified across a spectrum of functional groups, including molecular transport, morphoregulation, and cell adhesion to virus receptors, shape recognition, and toxin neutralization [,]. The remarkable functional versatility of the immunoglobulin superfamily extends to cell phenotype markers and regulators of gene transcription, among others. Thus, the structural and functional parallels found in this study underscore the need for careful consideration of molecular mimicry in drug development, particularly in the design of monoclonal antibodies, due to their potential to elicit unintended immune responses or interfere with microbial commensals critical to human health.
It is important to acknowledge the limitations of our study. The actual structures may differ from the models we propose, as in silico modeling and epitope prediction analyses are not definitive. Nevertheless, bioinformatic approaches offer significant advantages by efficiently guiding research efforts []. They play a crucial role in the initial evaluation of hypotheses, helping determine whether further in vitro studies are warranted. Additionally, bacteria may retain remnants of vectors through horizontal and vertical transmission, whether occurring naturally or during biotechnological processes. However, since not all the bacteria analyzed are associated with biotechnology, any bacterial contamination, if present, is more likely to result from natural processes and evolutionary mechanisms. Moreover, given that all the bacterial antigens studied contain an immunoglobulin-like domain—a feature widely reported across various organisms and particularly in enterobacteria []—we consider it unlikely that these findings are due to contamination.
While in silico predictions provide valuable preliminary insights, we acknowledge that they cannot fully confirm the occurrence of molecular mimicry in a biological context. Nonetheless, the results provide a rational basis for the design of forthcoming serological studies, guiding the selection of candidate antigens and regions with potential cross-reactivity. This work represents a first step toward understanding the mechanisms underlying cross-reactive immune responses between microbial antigens and therapeutic antibodies. The high degree of structural similarity observed suggests a biologically plausible mechanism of cross-reactivity, which we plan to further investigate through in vitro validation assays. In addition, we acknowledge that immunogenicity is a multifactorial process influenced not only by molecular mimicry but also by protein-specific properties, manufacturing conditions, and patient-related factors. Nonetheless, molecular mimicry remains one of the least explored contributors to therapeutic antibody immunogenicity, and our findings provide the first evidence-supported hypothesis proposing this mechanism in the context of adalimumab.
As we look toward future pharmaceutical innovation, the insights gained from this inquiry advocate for an integrated approach. This approach should encompass a thorough investigation of the interplay between drugs, the human microbiome, and pathogenic microorganisms. By doing so, we can strive to harness the positive aspects of molecular mimicry while mitigating its risks, thereby advancing the field of medicine with a more informed and cautious perspective.
Conclusions
In conclusion, examining adalimumab’s structural similarities with key microorganisms such as bacteria offers a nuanced perspective on molecular mimicry’s dual role in medicine. While its utility in enhancing therapeutic benefits is established, we urge a critical reevaluation based on our findings that raise the possibility of adverse immune reactions due to ADAs. Our results also point to the need to advance in the confirmation through in vitro and in vivo tests of this cross-reactivity, because this would make it a necessary and judicious approach to drug design, incorporating an integrated analysis of drug-pathogen-microbiome interactions to safeguard therapeutic efficacy and patient health.
Funding
The research was conducted without external funding. However, the article processing charge was supported by the 2025 Paul Kayser / RRF Global Award from the Retina Research FoundationThe research was conducted without external funding. However, the article processing charge was supported by the 2025 Paul Kayser / RRF Global Award from the Retina Research Foundation
Data Availability
Additional data is available from the corresponding author on request.
Authors' Contributions
DIP-S, AD-L-T, and MM conceptualized the study. DIP-S and MM curated the data. Formal analysis was performed by DIP-S, MM, GM-S, AS, and AD-L-T. Investigation was carried out by DIP-S, MM, AS, and AD-L-T. Methodology development was conducted by DIP-S, MM, GM-S, and AD-L-T. Validation was performed by AS and OC. Visualization was prepared by DIP-S. The original draft was written by DIP-S, while DIP-S, MM, GM-S, AS, AD-L-T, and OC contributed to reviewing and editing the manuscript.
Conflicts of Interest
None declared.
Computational workflow.
PNG File, 299 KBMultiple alignment between adalimumab light chain and homologs. The multiple alignments revealed a 64% identity among the compared amino acid sequences. The highest conservation is observed between residues 71 and 159.
PNG File, 855 KBPSI-BLASTp results and predicted class II T-cell epitopes for adalimumab chains, including HLA-DRB1*11:01–restricted peptides and projected population coverage.
DOCX File, 26 KBPopulation coverage analysis for class II epitopes predicted from the adalimumab light chain using the Immune Epitope Database (IEDB) population coverage tool. The bar and cumulative distribution plots represent the percentage of individuals in the European population predicted to recognize at least 1 human leukocyte antigen (HLA)–epitope combination, indicating an estimated coverage of approximately 15%.
PNG File, 257 KBReferences
- Ellis CR, Azmat CE. Adalimumab. StatPearls Publishing; 2024. URL: http://www.ncbi.nlm.nih.gov/books/NBK557889/ [Accessed 2024-02-26]
- Thaper D, Prabha V. Molecular mimicry: an explanation for autoimmune diseases and infertility. Scand J Immunol. Aug 2018;88(2):e12697. [CrossRef] [Medline]
- Strand V, Gonçalves J, Hickling TP, Jones HE, Marshall L, Isaacs JD. Immunogenicity of biosimilars for rheumatic diseases, plaque psoriasis, and inflammatory bowel disease: a review from clinical trials and regulatory documents. BioDrugs. Feb 2020;34(1):27-37. [CrossRef] [Medline]
- Cludts I, Spinelli FR, Morello F, Hockley J, Valesini G, Wadhwa M. Anti-therapeutic antibodies and their clinical impact in patients treated with the TNF antagonist adalimumab. Cytokine. Aug 2017;96:16-23. [CrossRef] [Medline]
- Segal Y, Shoenfeld Y. Vaccine-induced autoimmunity: the role of molecular mimicry and immune crossreaction. Cell Mol Immunol. Jun 2018;15(6):586-594. [CrossRef] [Medline]
- Kanduc D. Peptide cross-reactivity: the original sin of vaccines. Front Biosci (Schol Ed). Jun 2012;4(4):1393-1401. [CrossRef] [Medline]
- Cusick MF, Libbey JE, Fujinami RS. Molecular mimicry as a mechanism of autoimmune disease. Clin Rev Allergy Immunol. Feb 2012;42(1):102-111. [CrossRef] [Medline]
- Bodelón G, Palomino C, Fernández LÁ. Immunoglobulin domains in Escherichia coli and other enterobacteria: from pathogenesis to applications in antibody technologies. FEMS Microbiol Rev. Mar 2013;37(2):204-250. [CrossRef] [Medline]
- Hüttener M, Hergueta J, Bernabeu M, et al. Roles of proteins containing immunoglobulin-like domains in the conjugation of bacterial plasmids. mSphere. Feb 2022;7(1):e0097821. [CrossRef] [Medline]
- Dotan A, Shoenfeld Y. Perspectives on vaccine induced thrombotic thrombocytopenia. J Autoimmun. Jul 2021;121:102663. [CrossRef] [Medline]
- Bozkurt B, Kamat I, Hotez PJ. Myocarditis with COVID-19 mRNA vaccines. Circulation. Aug 2021;144(6):471-484. [CrossRef] [Medline]
- Garg RK, Paliwal VK. Spectrum of neurological complications following COVID-19 vaccination. Neurol Sci. Jan 2022;43(1):3-40. [CrossRef] [Medline]
- Bragazzi NL, Bridgewood C, Sharif K, et al. HPV vaccines and lupus: current approaches towards preventing adverse immune cross-reactivity. Expert Rev Vaccines. Jan 2019;18(1):31-42. [CrossRef] [Medline]
- Hausmann OV, Seitz M, Villiger PM, Pichler WJ. The complex clinical picture of side effects to biologicals. Med Clin North Am. Jul 2010;94(4):791-804. [CrossRef] [Medline]
- Hässler S, Bachelet D, Duhaze J, et al. Clinicogenomic factors of biotherapy immunogenicity in autoimmune disease: a prospective multicohort study of the ABIRISK consortium. PLoS Med. Oct 2020;17(10):e1003348. [CrossRef] [Medline]
- Knox C, Law V, Jewison T, et al. DrugBank 3.0: a comprehensive resource for “OMICS” research on drugs. Nucleic Acids Res. Jan 2011;39(Database issue):D1035-D1041. [CrossRef] [Medline]
- Guzenko D, Burley SK, Duarte JM. Real time structural search of the Protein Data Bank. PLoS Comput Biol. Jul 2020;16(7):e1007970. [CrossRef] [Medline]
- Hu G, Kurgan L. Sequence similarity searching. Curr Protoc Protein Sci. Feb 2019;95(1):e71. [CrossRef] [Medline]
- Johnson M, Zaretskaya I, Raytselis Y, Merezhuk Y, McGinnis S, Madden TL. NCBI BLAST: a better web interface. Nucleic Acids Res. Jul 1, 2008;36:W5-W9. [CrossRef] [Medline]
- PRALINE multiple sequence alignment. Center for Integrative Bioinformatics. URL: https://www.ibi.vu.nl/programs/pralinewww [Accessed 2025-04-25]
- Protein BLAST: search protein databases using a protein query. National Center for Biotechnology Information. URL: https://blast.ncbi.nlm.nih.gov/Blast.cgi?PROGRAM=blastp&PAGE_TYPE=BlastSearch&BLAST_SPEC=&LINK_LOC=blasttab&LAST_PAGE=blastx [Accessed 2024-12-18]
- BLAST statistics: the expect value. National Center for Biotechnology Information. URL: https://www.nlm.nih.gov/ncbi/workshops/2023-08_BLAST_evol/e_value.html [Accessed 2024-12-12]
- Research Collaboratory for Structural Bioinformatics Protein Data Bank. URL: https://www.rcsb.org [Accessed 2025-12-02]
- Wheeler DL, Barrett T, Benson DA, et al. Database resources of the National Center for Biotechnology Information. Nucleic Acids Res. Jan 2006;34:D173-D180. [CrossRef] [Medline]
- Mooers BHM. Simplifying and enhancing the use of PyMOL with horizontal scripts. Protein Sci. Oct 2016;25(10):1873-1882. [CrossRef] [Medline]
- Waterhouse A, Bertoni M, Bienert S, et al. SWISS-MODEL: homology modelling of protein structures and complexes. Nucleic Acids Res. Jul 2, 2018;46(W1):W296-W303. [CrossRef] [Medline]
- Pettersen EF, Goddard TD, Huang CC, et al. UCSF Chimera: a visualization system for exploratory research and analysis. J Comput Chem. Oct 2004;25(13):1605-1612. [CrossRef] [Medline]
- Baker D, Sali A. Protein structure prediction and structural genomics. Science. Oct 2001;294(5540):93-96. [CrossRef] [Medline]
- Ponomarenko J, Bui HH, Li W, et al. ElliPro: a new structure-based tool for the prediction of antibody epitopes. BMC Bioinformatics. Dec 2, 2008;9:514. [CrossRef] [Medline]
- ElliPro: antibody epitope prediction. Immune Epitope Database. URL: http://tools.iedb.org/ellipro/ [Accessed 2024-12-18]
- MHC-II binding predictions. IEDB Analysis Resource. URL: https://tools.iedb.org/mhcii/ [Accessed 2025-12-02]
- Population coverage. IEDB Analysis Resource. URL: https://tools.iedb.org/population/ [Accessed 2025-12-02]
- Ashkenazy H, Abadi S, Martz E, et al. ConSurf 2016: an improved methodology to estimate and visualize evolutionary conservation in macromolecules. Nucleic Acids Res. Jul 8, 2016;44(W1):W344-W350. [CrossRef] [Medline]
- AllerTOP. Drug Design and Bioinformatics Lab. URL: https://www.ddg-pharmfac.net/AllerTOP [Accessed 2025-12-02]
- Dimitrov I, Bangov I, Flower DR, Doytchinova I. AllerTOP v.2—a server for in silico prediction of allergens. J Mol Model. Jun 2014;20(6). [CrossRef] [Medline]
- Holm L. Using Dali for protein structure comparison. In: Gáspári Z, editor. Structural Bioinformatics: Methods and Protocols. Humana; 2020:29-42. [CrossRef]
- Svilenov HL, Sacherl J, Protzer U, Zacharias M, Buchner J. Mechanistic principles of an ultra-long bovine CDR reveal strategies for antibody design. Nat Commun. Nov 2021;12(1):6737. [CrossRef] [Medline]
- Rendón MA, Saldaña Z, Erdem AL, et al. Commensal and pathogenic Escherichia coli use a common pilus adherence factor for epithelial cell colonization. Proc Natl Acad Sci U S A. Jun 19, 2007;104(25):10637-10642. [CrossRef] [Medline]
- Tong J, Liu C, Summanen PH, Xu H, Finegold SM. Corynebacterium pyruviciproducens sp. nov., a pyruvic acid producer. Int J Syst Evol Microbiol. May 2010;60(Pt 5):1135-1140. [CrossRef] [Medline]
- Fernandez LV, Fortuny AS, Rodriguez EF. Mastitis por Corynebacterium pyruviciproducens y Corynebacterium amycolatum en mujeres inmunocompetentes no lactantes. Rev Argent Microbiol. Jan 2021;53(1):39-42. [CrossRef] [Medline]
- Arato V, Raso MM, Gasperini G, Berlanda Scorza F, Micoli F. Prophylaxis and treatment against Klebsiella pneumoniae: current insights on this emerging anti-microbial resistant global threat. Int J Mol Sci. Apr 14, 2021;22(8):4042. [CrossRef] [Medline]
- Li D, Huang X, Rao H, et al. Klebsiella pneumoniae bacteremia mortality: a systematic review and meta-analysis. Front Cell Infect Microbiol. 2023;13:1157010. [CrossRef] [Medline]
- Yun NR, Kim DM. Vibrio vulnificus infection: a persistent threat to public health. Korean J Intern Med. Nov 2018;33(6):1070-1078. [CrossRef] [Medline]
- Reynolds D, Kollef M. The epidemiology and pathogenesis and treatment of Pseudomonas aeruginosa infections: an update. Drugs (Abingdon Engl). Dec 2021;81(18):2117-2131. [CrossRef] [Medline]
- Mladenova I. Clinical relevance of Helicobacter pylori infection. JCM. 2021;10(16):3473. [CrossRef] [Medline]
- Galán-Relaño Á, Valero Díaz A, Huerta Lorenzo B, et al. Salmonella and salmonellosis: an update on public health implications and control strategies. Animals (Basel). Nov 27, 2023;13(23):3666. [CrossRef] [Medline]
- Touaitia R, Mairi A, Ibrahim NA, Basher NS, Idres T, Touati A. Staphylococcus aureus: a review of the pathogenesis and virulence mechanisms. Antibiotics (Basel). May 6, 2025;14(5):470. [CrossRef] [Medline]
- Bondareva M, Budzinski L, Durek P, et al. Cross-regulation of antibody responses against the SARS-CoV-2 spike protein and commensal microbiota via molecular mimicry. Cell Host Microbe. Nov 8, 2023;31(11):1866-1881. [CrossRef] [Medline]
- Li N, Zhang G, Chen Y, et al. Identification of the mimotopes within the major capsid protein L1 of human papillomavirus types 18 and 45, using neutralizing monoclonal antibodies. Int J Biol Macromol. Mar 2021;174:587-595. [CrossRef] [Medline]
- Horwacik I, Golik P, Grudnik P, et al. Structural basis of GD2 ganglioside and mimetic peptide recognition by 14G2a antibody. Mol Cell Proteomics. Oct 2015;14(10):2577-2590. [CrossRef] [Medline]
- Gilchuk P, Murin CD, Milligan JC, et al. Analysis of a therapeutic antibody cocktail reveals determinants for cooperative and broad ebolavirus neutralization. Immunity. Feb 18, 2020;52(2):388-403. [CrossRef] [Medline]
- Pham NB, Meng WS. Protein aggregation and immunogenicity of biotherapeutics. Int J Pharm. Jul 2020;585:119523. [CrossRef] [Medline]
- Martinson JNV, Walk ST. Escherichia coli residency in the gut of healthy human adults. EcoSal Plus. Sep 2020;9(1):10. [CrossRef] [Medline]
- Maixner F, Krause-Kyora B, Turaev D, et al. The 5300-year-old Helicobacter pylori genome of the Iceman. Science. Jan 8, 2016;351(6269):162-165. [CrossRef] [Medline]
- Gherardi G. Staphylococcus aureus infection: pathogenesis and antimicrobial resistance. Int J Mol Sci. May 3, 2023;24(9):8182. [CrossRef] [Medline]
- Gal-Mor O. Persistent infection and long-term carriage of typhoidal and nontyphoidal salmonellae. Clin Microbiol Rev. Jan 2019;32(1):e00088-18. [CrossRef] [Medline]
- Qin S, Xiao W, Zhou C, et al. Pseudomonas aeruginosa: Pathogenesis, virulence factors, antibiotic resistance, interaction with host, technology advances and emerging therapeutics. Sig Transduct Target Ther. 2022;7(1):199. [CrossRef]
Abbreviations
| ADA: antidrug antibody |
| GMQE: Global Model Quality Estimate |
| HLA: human leukocyte antigen |
| IDCP: immunoglobulin domain-containing protein |
| IEDB: Immune Epitope Database |
| MHC: major histocompatibility complex |
| PC: population coverage |
| PDB: Protein Data Bank |
| PI: protrusion index |
| RMSD: root mean square deviation |
Edited by Gunther Eysenbach, Huihuang Yan; submitted 09.Sep.2025; peer-reviewed by Ishwerpreet Kaur Jawanda, Jonaid Ahmad Malik, Juan Urrego; final revised version received 05.Nov.2025; accepted 14.Nov.2025; published 08.Dec.2025.
Copyright© Diana Isabel Pachón-Suárez, Germán Mejía-Salgado, Oscar Correa, Andrés Sánchez, Marlon Munera, Alejandra de-la-Torre. Originally published in JMIR Bioinformatics and Biotechnology (https://bioinform.jmir.org), 8.Dec.2025.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Bioinformatics and Biotechnology, is properly cited. The complete bibliographic information, a link to the original publication on https://bioinform.jmir.org/, as well as this copyright and license information must be included.

