Backgrounds:
Systemic lupus erythematosus (SLE) is an autoimmune disease with multi-organ involvement, and lupus nephritis (LN) is a severe manifestation. Long non-coding RNAs (lncRNAs) have been implicated in regulating immune responses in autoimmune diseases. LOC100130476, a lncRNA located on chromosome 6q23.3, has been linked to inflammation and cancer progression, but its role in SLE and LN remains unclear.
Methods:
We studied the association between the rs80213143 variant at LOC100130476 and SLE susceptibility in a Chinese Han cohort, using SNP genotyping and Bonferroni correction for multiple comparisons. Functional annotations were conducted to explore the effects of rs80213143 on transcription factor binding and gene expression. eQTL analysis was performed to assess the variant’s impact on immune cell gene expression.
Results:
Within LOC100130476, the strongest association was observed at rs80213143 (p = 2.5 × 10−7), which was successfully replicated (p = 2.64 × 10−9) in an independent cohort. The combined analysis of both discovery and replication cohorts reinforced the genetic association (pmeta = 2.04 × 10−14). The risk C allele was linked to more severe renal involvement, including higher 24-h proteinuria and serum creatinine levels. Functional annotations indicated that rs80213143 potentially influences immune cell functionality through regulatory motif alterations. The expression of LOC100130476 was abnormally upregulated in the whole blood of SLE patients, particularly in lupus nephritis patients. Moreover, the expression of LOC100130476 was significantly upregulated in the biopsy samples of lupus nephritis patients. Differentially expressed genes in whole blood between SLE patients and healthy donors, positively associated with LOC100130476 expression, were significantly enriched in pathways involving T cell receptor signaling, antigen presentation, interferon response, and apoptosis. Furthermore, LOC100130476 showed positive associations with genes differentially expressed between LN patients' renal biopsy tissues and adjacent normal renal tissues, enriched in leukocyte-mediated immunity, inflammatory responses, extracellular matrix and tissue repair pathways, and the PI3K signaling network.
Conclusion:
The rs80213143 variant in LOC100130476 is associated with SLE susceptibility and renal involvement. Its elevated expression in lupus nephritis suggests it may be an important factor in disease pathogenesis and a potential biomarker for lupus nephritis.
IntroductionSystemic lupus erythematosus (SLE) is a prototypical autoimmune disease characterized by the production of autoantibodies against nuclear antigens, leading to inflammation and multi-organ involvement. The pathogenesis of SLE, while not fully understood, is influenced by genetic, environmental, and immunological factors. Recent advances have highlighted the role of long non-coding RNAs (lncRNAs) in the pathogenesis of SLE. LncRNAs, crucial members of the non-coding RNA family, are RNA transcripts exceeding 200 nucleotides without protein-coding capability (Kapranov et al., 2007). Identified as immune regulators, their dysregulation plays a role in autoimmune diseases, including SLE (Tsai et al., 2020).
Previous studies have demonstrated abnormalities in lncRNA expression in SLE patients. In peripheral blood mononuclear cells (PBMCs), 8,868 lncRNAs showed differential expression between SLE patients and healthy donors, with 3,657 upregulated and 5,211 downregulated (Luo et al., 2018). Similarly, in monocyte-derived dendritic cells (moDCs), 163 lncRNAs exhibited differential expression between SLE and controls (Wang et al., 2018). The versatile role of lncRNA in immunity and inflammation is highlighted by its correlation with SLE disease activity. Specifically, the expression levels of ENST00000604411.1, ENST00000501122.2, NEAT1 in moDCs, and lnc7514 in PBMCs positively correlated with clinical disease activity (Wang et al., 2018; Zhang et al., 2016; Wang et al., 2019). In patients positive for anti-dsDNA, lnc7514 levels were markedly lower than in those negative for anti-dsDNA (Wang et al., 2019). Furthermore, lnc3643 expression levels were associated with C-reactive protein and erythrocyte sedimentation rates in SLE patients (Wang et al., 2019).
Evidence from genome-wide association studies (GWAS) and candidate gene analyses has highlighted the association of single nucleotide polymorphisms (SNPs) with susceptibility to SLE. The variant rs13259960, situated in an intronic enhancer region of SLEAR, has been linked to increased susceptibility to SLE (P = 1.03 × 10−11), influencing SLEAR expression that correlates positively with cell death in the peripheral blood of SLE patients (Fan et al., 2020). Similarly, the risk alleles rs205764 and rs547311, located in the promoter region of linc00513, are known to elevate linc00513 expression by enhancing promoter activity, contributing to SLE. Linc00513 acts as a positive regulator of the type I interferon pathway and correlates positively with the IFN score in SLE patients (Xue et al., 2018).
We reviewed previous GWAS data with a focus on lncRNA-coding genes (Sun et al., 2016). Among the Chinese Han population from Beijing, rs80213143 at LOC100130476 emerged as a locus significantly associated with increased susceptibility to systemic lupus erythematosus (SLE) (p = 2.5 × 10−7, OR 2.63, 95% CI 1.80–3.84). Furthermore, this study replicated the genetic association between rs80213143 and SLE susceptibility and explored the role of LOC100130476 in the pathogenesis and progression of SLE.
MethodsParticipantsWe utilized genetic association results from a previous GWAS (Sun et al., 2016) involving a Beijing cohort as our discovery set, which included 490 SLE patients and 493 controls. These GWAS summary-level association data were obtained from a publicly available, previously published study (Sun et al., 2016) and were used for secondary analysis in the present study. An independent replication cohort, consisting of 1003 SLE patients and 815 geographically matched, unrelated healthy controls, was recruited from Henan, Central China. All SLE participants fulfilled the 1982 American College of Rheumatology (ACR) classification criteria for SLE (Tan et al., 1982), as revised by Hochberg in 1997 (Hochberg, 1997). Ethical approval for the study was granted by the Medical Ethics Committee of Zhengzhou University First Hospital (2019-KY-247), ensuring compliance with the Declaration of Helsinki. Informed consent was obtained from all participants.
Polymorphism selection and genotypingIn the discovery cohort, we included all SNPs within the LOC100130476 gene region covered by the ImmunoChip, as outlined previously (Sun et al., 2016). The publicly available GWAS datasets used in this study were derived from previously published analyses in which PCA-based adjustment had already been performed. In the replication cohort, the SNP rs80213143 at LOC100130476, identified as the most significant, was genotyped using the Sequenom MassARRAY system, achieving a genotyping completion rate of over 97%.
Variant annotation and expression analysis across immune cellsWe conducted annotations of the variants' regulatory elements through HaploReg v4.2 (Ward and Kellis, 2012) and RegulomeDB v2.2 databases (Boyle et al., 2012). For assessing cell type-specific expression quantitative trait loci (eQTL) and the expression of target genes, ImmuNexUT (Immune Cell Gene Expression Atlas from the University of Tokyo) (Ota et al., 2021), which encompasses data on 28 immune cell types from patients with various immune diseases and healthy individuals, was utilized.
Gene expression analysisTotal RNA was extracted and isolated from whole blood and renal tissues (renal biopsy samples and paracancerous kidney tissues) using TRIzol Reagent (Life Technologies) following the manufacturer’s protocol. Whole genome RNA sequencing (RNA-seq) was performed with PE150 (Illumina, San Diego, CA, USA) (Qi et al., 2021). LOC100130476 has been characterized as a polyadenylated long non-coding RNA transcript (PMID: 30594489), thereby allowing for its accurate detection and quantification in poly(A)-selected RNA-seq datasets, including those utilized in our analysis. For the whole blood sequencing, the cohort comprised 99 individuals, including 57 LN patients, 18 SLE patients without renal impairment, and 24 healthy controls. In the case of renal tissue sequencing, the sample set included seven lupus nephritis renal biopsy tissues and four adjacent normal renal tissues. This cohort was previously utilized in our earlier studies (Qi et al., 2021; Singh et al., 2024), where RNA-seq data generation and processing were conducted.
Statistical analysisHardy-Weinberg Equilibrium (HWE) was evaluated using a goodness-of-fit chi-square (χ2) test. The P value for HWE in the control population of rs80213143 was 0.654, indicating adherence to HWE. Statistical significance for genetic association analysis was determined using the Chi-square test. Analysis of LOC100130476 expression in whole blood and renal tissues was conducted using independent samples t-tests or one-way ANOVA, as appropriate, with a significance threshold set at p < 0.05. Data analysis was performed using SPSS software (Version 19.0; SPSS Inc., IL).
ResultsAssociation between LOC100130476 gene polymorphisms and SLE susceptibilityLOC100130476 is located on chromosome 6, positions 138,144,807 to 138,189,370, as defined by RefSeq genes. Out of 34 SNPs within LOC100130476 spanning approximately 45 kb and covered by the ImmunoChip, 31 were successfully genotyped (Sun et al., 2016) (Supplementary Table S1). Twenty-three SNPs were found to be associated with SLE susceptibility with p-values <0.05 (Supplementary Table S1).
To adjust for multiple comparisons in our genetic association analysis, we applied the Bonferroni correction method to reduce Type I error, setting a more stringent significance level. Accordingly, a p-value of less than 1.61 × 10−3 was deemed significant after Bonferroni adjustment. Of the 31 SNPs analyzed, 20 were significantly associated with SLE susceptibility following Bonferroni correction (Figure 1).

Genetic association analysis of LOC100130476 polymorphisms with SLE susceptibility. The plot presented p-values on a -log10 scale plotted against chromosomal positions (Mb) on chromosome 6, highlighting the SNPs analyzed. The SNP rs80213143 was emphasized, exhibiting the most significant association with SLE susceptibility within the LOC100130476 locus. A horizontal line marked the Bonferroni-corrected significance threshold at p < 1.61 × 10−3. This visualization was created using LocusZoom.
The strongest association was detected at rs80213143 (p = 2.5 × 10−7, OR 2.63, 95%CI 1.80–3.84) (Sun et al., 2016), which was successfully replicated in an independent cohort from Henan, Central China (p = 2.64 × 10−9, OR 2.32, 95%CI 1.74–3.07). A meta-analysis combining the discovery and replication cohorts underscored this genetic association (pmeta = 2.04 × 10−14, OR 2.42, 95%CI 1.93–3.04) as shown in Figure 1 and Table 1.
SNPMinor alleleDiscovery population (SLE vs. control, 490/493)Replication population (SLE vs. control, 1003/815)Meta analysis (SLE vs. control, 1493/1308)MAF (%)POR (95% CI)MAF (%)POR (95% CI)POR (95% CI)rs80213143C10/4.12.50 × 10−72.63 (1.80–3.84)9.57/4.372.64 × 10−92.32 (1.74–3.07)2.04 × 10−142.42 (1.93–3.04)Association of LOC100130476 polymorphisms with systemic lupus erythematosus susceptibility.
Association between LOC100130476 rs80213143 genotypes and clinical phenotypes in SLE patientsGiven the robust genetic association of rs80213143 with SLE, we next evaluated its relationship with clinical and laboratory phenotypes (Table 2). The results revealed that the C allele, identified as the risk allele, was significantly associated with more severe renal involvement. Specifically, patients carrying the C allele (CC + CG group) exhibited higher 24-h proteinuria levels [median (IQR): 0.69 (0.05–2.12) g vs. 0.60 (0.25–1.30) g, p = 0.014] and higher serum creatinine levels [median (IQR): 64 (48–72) µmol/L vs. 55 (47–66)µmol/L, p = 0.018] compared to the GG group.
Clinical phenotypesCC + CG (n = 176)GG (n = 796)p valueDemographicsAge at onset (years)30.5 ± 11.931.6 ± 13.20.265Male gender (%)5.17.70.236MucocutaneousMalar rash (%)28.424.00.220Discoid rash (%)0.60.80.792Photosensitivity (%)4.54.00.751Oral ulcers (%)9.16.80.284MusculoskeletalNonerosive arthritis (%)27.328.10.816SerositisPleuritis/Pericarditis (%)8.07.90.819Neurologic disorder (%)4.53.30.403Renal24 h proteinuria (g)0.69 (0.05–2.12)0.60 (0.25–1.30)0.014Serum creatinine (umol/L)55 (47–66)64 (48–72)0.018HematologicLeukopenia (%)20.626.30.118Thrombocytopenia (%)19.425.60.087ImmunologicAnti-dsDNA titre (%)66.961.90.247Anti-Sm antibodies (%)17.517.80.923C3 level (g/L)0.70 + 0.360.73 + 0.360.360C4 level (g/L)0.13 + 0.130.14 + 0.130.210Association between LOC100130476 rs80213143 genotypes and clinical phenotypes in SLE patients.
Bold values indicate statistical significance (p < 0.05).
Preliminary findings further suggested that patients with the C allele tended to have earlier disease onset and higher prevalence of mucocutaneous manifestations, neurologic disorders, and serositis. Additionally, a trend toward higher anti-dsDNA positivity rates and lower serum complement levels (C3 and C4) was observed in the CC + CG group. However, these differences were not statistically significant.
Functional annotation of SLE-associated SNPsTo explore the functional relevance of the associated SNPs, we utilized RegulomeDB for annotation of known and predicted regulatory elements. According to RegulomeDB, rs80213143 was assigned a rank of five and a score of 0.005, indicating its location in 833 chromatin state regions and five motif regions (Table 3). We extended our functional annotation to include all SNPs genotyped in the discovery cohort (Table 3). The ranking and scoring system provided different metrics to evaluate the regulatory potential of a variant. Notably, other variants such as rs653520 (rank 1b) and rs73564258 (score 0.70044) exhibited even stronger evidence of regulatory potential.
SNPsRankScoreChIP dataChromatin stateAccessibilityMotifsQTL datars6535201b0.322510683388364rs6571801f0.1954908332723rs6575971f0.2227108331603rs27882881f0.554362383314804rs729807481f0.667032833812rs177798701f0.553241833002rs1118830381f0.554365583346901rs94948831f0.554368833702rs6001441f0.5532463833002rs77538731f0.5543668331104rs77672641f0.5532432833002rs119704111f0.222710833102rs77741011f0.4285811833052rs94948861f0.554361198331102rs596990631f0.5532427833002rs615934131f0.5532439833002rs596930831f0.667031358332612rs735642582b0.7004457833220rs562321063a0.551341833110rs8012677040.609065683351900rs937630340.609064833800rs937630440.60906678337700rs774677950.134540833100rs8021314350.0050833150rs5716317060.3202308330100rs67036970.513920833003rs938953670.184120833000rs691832970.184120833000Functional annotations of SNPs analyzed in LOC100130476 gene with RegulomeDB.
Allelic variation at rs80213143 (G vs. C) is predicted to alter binding affinities for several transcription factors including HAND2, PTF1A, TAL1, TCF3, TWIST1, and ZBTB18 (Figure 2A). Using HaploReg, we examined the regulatory motif alterations in greater detail. The analysis revealed that substitution of the reference G allele with the risk C allele resulted in diminished binding affinities for the Hand1_1 motif (score reduced from 12.4 to 7.6) and Hand1_2 motif (12.7–0.7). Similar reductions were observed for the RP58 motif (from 17.7 to 12.1) and the TAL1_known2 motif (from 11 to −0.9) (Figure 2B). These results suggest that the risk allele may impair transcription factor binding, potentially altering downstream gene expression.

Analysis of the regulatory potential of rs80213143 and its QTL effects. (A) Sequence motifs for transcription factors HAND2, PTF1A, TAL1/TCF3, TWIST1, and ZBTB18 were potentially impacted by the rs80213143 locus as annotated by RegulomeDB. (B) Binding affinity scores for motifs Hand1_1, Hand1_2, RP58, and TAL1_known2 demonstrated reduced binding affinity when the reference G allele was substituted with the C allele. (C) Expression levels of LOC100130476 (also known as AL357060.2 in the Immune Cell Gene Expression Atlas) across various immune cell types showed significant differences between genotypes observed in intermediate monocytes (Int Mono), non-classical monocytes (NC Mono), and neutrophils (Neu), indicating a potential cell type-specific eQTL effect of the rs80213143 locus on LOC100130476 expression. (D) Violin plots represented the normalized expression ratio of the lncRNA RP11-356I2.4 transcript in skin tissue samples from individuals with varying genotypes at rs80213143. The left panel showed data from sun-exposed lower leg skin, and the right panel showed data from non-sun-exposed (suprapubic) skin.
eQTL analysis and cell type-specific effects of rs80213143Subsequently, we focused on investigating the quantitative trait locus (QTL) effects of the rs80213143 variant. SLE is a prototypical autoimmune disease, and we subsequently investigated the impact of rs80213143 on expression across various human immune cell types, according to results from the Immune Cell Gene Expression Atlas database, developed by Ota et al. at the Department of Allergy and Rheumatology, Graduate School of Medicine, the University of Tokyo (Ota et al., 2021) (Figure 2C). Several plots show a notable difference in expression between genotypes. In “Int Mono” (intermediate monocytes), the expression level is significantly different between the genotypes, with a p-value of 4.7e-05. Similarly, “NC Mono” (non-classical monocytes) and “Neu” (neutrophils) show significant eQTL effects with p-values of 0.00042 and 0.0014, respectively, suggesting that the gene expression levels in these cell types are influenced by the genotype at rs80213143. However, most of the other cell types do not show statistically significant differences in gene expression levels across genotypes. This implies that the impact of rs80213143 on the expression of LOC100130476 may be cell type-specific.
Our examination of the GTEx database indicated a significant eQTL effect of rs80213143 on the expression of RP11-356I2.4, a transcript variant of LOC100130476 (Figure 2D). The eQTL effect was observed to be modulated by environmental exposure, particularly to sunlight, which is known to exacerbate the manifestation of SLE. The data revealed that the risk allele C at rs80213143 was associated with lower expression levels of RP11-356I2.4 in skin exposed to sunlight (lower leg) compared to skin not exposed to sunlight (suprapubic area). This finding suggested a potential interaction between the environmental factor of sunlight exposure and the genetic risk allele in the pathogenesis of SLE, with the allele C potentially conferring a greater decrease in expression under sun-exposed conditions.
LOC100130476 expression in SLE and lupus nephritisWe next compared LOC100130476 expression levels in peripheral blood and renal tissues. LOC100130476 expression was significantly higher in SLE patients than in controls (p < 0.001) (Figure 3A). Notably, lupus nephritis patients exhibited an increased level of LOC100130476 compared to both controls (p < 0.001) and SLE patients without renal impairment (p = 0.002) (Figure 3B). However, no significant difference in LOC100130476 expression was observed between controls and SLE patients without renal impairment (p = 0.646) (Figure 3B). Additionally, in renal biopsy samples, LOC100130476 expression was markedly elevated in lupus nephritis patients compared to paracancerous kidney tissue used as control (p = 0.008) (Figure 3C).

Expression analysis of LOC100130476 in SLE patients and controls, with a focus on LN. The expression of LOC100130476 was analyzed in whole blood samples from SLE patients compared to controls (A), and further stratified between SLE patients without renal impairment, lupus nephritis (LN) patients, and controls (B). Additionally, LOC100130476 expression was compared between renal biopsy samples from LN patients and paracancerous kidney tissues serving as controls (C).
Correlation of LOC100130476 with pathogenic pathways in SLE and LNTo further elucidate how LOC100130476 may contribute to the pathogenesis of SLE and LN, we conducted a correlation analysis between differentially expressed genes in SLE and LN and LOC100130476 expression (Figure 4).

Enrichment analysis of pathways correlated with LOC100130476 expression in SLE and LN (A) T Cell Receptor Signaling Pathways: This panel illustrated the enrichment of genes that were positively correlated with LOC100130476 expression in pathways related to the T cell receptor (TCR) complex and signaling, including the positive regulation of TCR signaling and T cell selection. These pathways were crucial for the activation and differentiation of T cells, which played a central role in the immune response in SLE. (B) Antigen Presentation Pathways: The figure depicted the enrichment of pathways involved in antigen presentation, including the folding, assembly, and peptide loading of class I MHC molecules. These processes were essential for the activation of the adaptive immune response and the presentation of antigens to T cells. (C) Interferon Signaling Pathways: This panel showed the enrichment of genes in pathways related to the response to interferons, including both type I and type II interferons. Interferons were critical cytokines involved in the immune response to viral infections and had been implicated in the dysregulation of immune responses in SLE. (D) Apoptosis Pathways: The figure highlighted the enrichment of pathways associated with the apoptotic process, including the intrinsic apoptotic signaling pathway and its regulation by p53 class mediators. Apoptosis was a key process in maintaining immune homeostasis, and its dysregulation could contribute to autoimmune diseases like SLE. (E) Immunoregulation and Leukocyte Activation Pathways: This panel displayed the enrichment of pathways governing immunoregulation, leukocyte activation, and cell adhesion and migration. These pathways were involved in the modulation of immune responses and the recruitment of immune cells to sites of inflammation, which were critical in the pathogenesis of LN. (F) Inflammatory Response Pathways: The figure showed the enrichment of genes in pathways related to the inflammatory response, including the acute inflammatory response and its positive regulation. Inflammation was a hallmark of LN, and the pathways highlighted here contributed to the chronic inflammation observed in this condition. (G) Extracellular Matrix and Tissue Repair Pathways: This panel illustrated the enrichment of pathways involved in the extracellular matrix (ECM) and tissue repair, including ECM organization and cell-matrix adhesion. These processes were important in the structural integrity of tissues, and their dysregulation could lead to tissue fibrosis, a common complication in LN. (H) PI3K Signaling Pathways: The figure depicted the enrichment of pathways related to the PI3K signaling network, which played a role in various cellular processes including cell survival, proliferation, and metabolism. Dysregulation of these pathways could contribute to the pathogenesis of LN.
Upon analyzing transcriptome sequencing data for SLE compared to healthy control whole blood, we identified genes differentially expressed and positively correlated with LOC100130476 expression, significantly enriched in pathways related to cell cycle regulation, DNA damage response, transcription and translation processes, immune response, cellular signaling, and the biogenesis and maintenance of organelles. Conversely, pathways enriched by genes negatively correlated with LOC100130476 predominantly pertained to erythrocyte development and heme metabolism. Given the characteristics of SLE, we noted that the positively associated pathways include T cell receptor (Figure 4A), antigen presentation (Figure 4B), interferon (Figure 4C), and apoptosis (Figure 4D) suggesting a potential role for LOC100130476 in promoting the progression of SLE.
In the analysis of transcriptome sequencing results for lupus nephritis renal biopsy tissues and adjacent normal renal tissues, genes that were differentially expressed and positively correlated with LOC100130476 expression were significantly enriched in pathways governing immunoregulation, cell adhesion and migration, extracellular matrix (ECM) dynamics, signal transduction, inflammation, and phagocytosis. These included leukocyte-mediated immunity (Figure 4E), inflammatory responses (Figure 4F), pathways related to the extracellular matrix and tissue repair (Figure 4G), and the PI3K (Phosphoinositide 3-kinases) signaling network (Figure 4H). Such pathways are recognized contributors to the pathogenesis and progression of LN. Conversely, pathways negatively correlated with LOC100130476 exp
Comments (0)