A disease potential-driven graph attention model for comorbidity risk prediction of hypertension

Zhou, Leming; Qin, Hanshu; Yang, Yanmei; Huang, Gang; Liu, Zhigang

doi:10.3389/fdata.2026.1814157

ORIGINAL RESEARCH article

Front. Big Data, 02 April 2026

Sec. Data Mining and Management

Volume 9 - 2026 | https://doi.org/10.3389/fdata.2026.1814157

A disease potential-driven graph attention model for comorbidity risk prediction of hypertension

LZ
Leming Zhou ¹
HQ
Hanshu Qin ²
YY
Yanmei Yang ³
GH
Gang Huang ⁴
ZL
Zhigang Liu ⁵^*

1. School of Computer Science and Technology, Chongqing University of Posts and Telecommunications, Chongqing, China
2. The First Affiliated Hospital of Chongqing Medical University, Chongqing, China
3. Chongqing University of Traditional Chinese Medicine, Chongqing, China
4. Department of Cardiology, The Third People's Hospital of Chengdu, Chengdu, China
5. School of Computer Science and Technology, Dongguan University of Technology, Dongguan, China

Abstract

Hypertension is associated with an increased risk of serious complications, and the hazards are very serious. However, current methods for predicting comorbidity risks face the challenge that comorbidity prediction relying solely on data driven may lead to clinically implausible associations and reduce model interpretability. Also, how to capture the fusion features of patient and identify differences among them to facilitate risk prediction needs to be addressed. To overcome these challenges, we propose a Disease Potential-Driven Graph Attention (DP-GA) model for comorbidity risk prediction of hypertension, which has 3-fold ideas: (a) Constructing a fusion mechanism for the correlation among the patients' disease features and the structural, thus integrating feature attention and structural attention effectively; (b) Introducing a similarity-difference balance mechanism to further identify the relationships among patients; and (c) Designing a disease potential-driven attention mechanism to calculate the disease potential and construct masks, thus preserving the effective associations from high-risk patients to low-risk patients. Experimental results demonstrate that our proposed DP-GA model achieves a significant improvement in comorbidity risk prediction for patients with hypertension across three comorbidity datasets collected by the research group, compared with both the baseline and state-of-the-art peer methods. We also analyze the comorbidity network to predict the risk of hypertension comorbidity, thereby improving interpretability and early prediction of such comorbidities.

1 Introduction

The World Health Organization (WHO) reports that approximately 1.1 billion adults worldwide are affected by hypertension. Commonly, patients with hypertension often have various comorbidities and complications, e.g., chronic obstructive pulmonary disease (COPD), diabetes mellitus (DM), and coronary heart disease (CHD), which pose significant risks. According to the 2019 Global Burden of Disease Study, the total number of COPD cases was 212.3 million, with 16.2 million new COPD cases reported annually. COPD becomes the third leading cause of death globally, after ischemic heart disease and stroke, with an estimated 3.324 million deaths. COPD not only affects the lungs but also often coexists with other systemic diseases. The comorbidities are not only related to the acute exacerbation of COPD (AECOPD) but also have a heavy medical and economic burden. Heubel et al. (2021) have found that AECOPD is associated with an increased risk of cardiovascular events and may be related to endothelial dysfunction. Common comorbidities related to COPD include cardiovascular disease (CVD), lung cancer, asthma, diabetes, metabolic syndrome, depression, chronic kidney disease, gastrointestinal disease, and anemia, etc. (Gong et al., 2024; Xu and Yew, 2023; Zhang and Tong, 2025; Lu et al., 2021; Goh and Hartman, 2025).

Although hypertension, CHD, DM, and COPD all have relatively mature and systematic diagnostic and treatment strategies, when the two coexist, the different specialties of first-visit physicians can easily lead to missed or misdiagnoses, resulting in delayed treatment. Therefore, clarifying the above diagnoses of comorbidities, early screening and diagnosis, and concurrent treatment of comorbidities are key to diagnosing and treating patients with comorbidities. However, the challenge of hypertension comorbidity lies in the synergistic effects among different diseases, which overlap or mask symptoms, resulting in ambiguous diagnostic clues, data presenting characteristics of high-dimensional sparsity and non-linearity, complicating the prediction of comorbidity risks, fragmenting management, and making it difficult to distinguish different complications, which are highly similar in manifestation and hard to differentiate (Bao et al., 2023; Abess et al., 2023; Wang et al., 2023; Arakelyan et al., 2023; Ni et al., 2023; Wu et al., 2026; He et al., 2025). This makes it difficult to identify comorbidities, increasing the risk of misdiagnosis and missed diagnoses. It not only increases the risks for patients but also exacerbates the contradictions in doctor-patient relationships and the allocation of medical resources. Moreover, a data-driven approach based solely on deep learning makes it difficult for doctors to trust and adopt the prediction results. These challenges urgently require interdisciplinary research in medicine and artificial intelligence to address them and have become a research hotspot in the field of hypertension comorbidity prediction.

Some studies on post-classification prediction, represented by comorbidity graph neural networks have been conducted (Tian et al., 2025; Che and Wang, 2025). For instance, Abuhantash et al. (2024) propose a prediction method for Alzheimer's disease based on graph neural networks and achieved good results. Dong et al. (2022) propose a graph convolutional network, i.e., MorbidGCN, to predict the coexistence of multiple diseases by integrating population phenotypes and disease networks. Combining disease comorbidities to extract drug and disease characteristics can reduce the time and cost of developing new drugs (Luo et al., 2024). The adoption of a pre-embedding learning method based on hypergraphs is helpful for predicting new associations between comorbidity pairs (Biswas et al., 2025). Moreover, to address interpretability issues, causal graph learning-based methods have emerged. For instance, a causal graph learning method based on information-bottleneck constraints is useful for denoising (Yuan et al., 2024). Counterfactual interpretation methods based on causal intervention can reduce false correlations (Shao et al., 2024; Våle et al., 2025). The graph attention encoder method based on causal discovery has been shown to be effective at solving hypothesis-violation problems (Liu et al., 2024).

Some scholars have also studied causal inference in the context of comorbidities. For instance, Li et al. (2025) explore the causal associations between 35 modifiable factors and cardiovascular metabolic polypathy, as well as each individual disease. Zeng et al. (2025) quantify the prevalence of depression among patients with cardiovascular diseases and its impact on mortality through meta-analysis, verifying the causal relationship between cardiovascular diseases and depression. Chen et al. (2023) analyze the association between comorbidity status and the development trajectory of comorbidity and dementia in the elderly population.

Although researchers have provided feasible methods for predicting these severe comorbidities (Fu et al., 2024; Glyde et al., 2024; Yin et al., 2024; Bonomo et al., 2022; Yang et al., 2024; Kang et al., 2024; Pikin et al., 2024; Heid and Green, 2022), the current methods for predicting comorbidity risks encounter the following difficulties: Firstly, due to the complexity of the association of hypertension comorbidities, there are non-linear and multi-factor interaction patterns in the association with other concurrent and comorbid diseases such as DM, CHD, and COPD, making it difficult to capture the characteristics and relationships of patients using traditional methods. Secondly, the traditional attention mechanism often focuses on similar nodes, leading to convergence of learned node representations and loss of individual node characteristics. Accurately modeling these individual differences is a key challenge in personalized prediction. Thirdly, the traditional approach neglects the relationships among disease risks and lacks the logical connections among diseases in clinical practice.

This study aims to identify potential diseases and mechanisms related to the development of comorbidities of hypertension with these severe comorbidities. By analyzing the comorbidity networks of COPD, DM, and CHD, the study focuses on the high-incidence diseases in the case group and their pathological mechanisms. Explore comorbidity intervention strategies based on a common pathological mechanism. To this end, the paper proposes a Disease Potential-Driven Graph Attention (DP-GA) model for hypertension comorbidity risk prediction. The modeling framework is built upon three core mechanisms: (a) integrating feature attention with structural attention to jointly capture feature correlations and graph topology; (b) modeling patient relationships through a similarity-difference balance mechanism that accounts for both similarity and distinction; and (c) estimating disease potentials and constructing masks to preserve clinically meaningful directional associations from high-risk to low-risk patients. These mechanisms collectively enhance representation learning and aware prediction for comorbidity risks.

In general, the paper aims to make the following main contributions:

A unified fusion mechanism which integrates patient's disease features with network structure information. By introducing latent position embeddings to capture network topology and computing structural attention, the model achieves dual fusion of feature and structural attention with adaptive weighting, providing a more comprehensive informational basis for comorbidity risk prediction.
Establish a similarity-difference balance mechanism that models complicated patient relationships. Using Bregman divergence as a difference metric, the mechanism jointly considers similarity and distinction between nodes, avoiding representation homogenization while preserving node-specific attributes through adaptively balanced difference information.
A disease potential-driven attention network that incorporates clinical disease correlation logic. By calculating disease risk potential from patient characteristics and constructing masks based on it, the method simulates directional disease influence and imposes constraints on attention weights, thereby enhancing interpretability and reflecting realistic risk propagation pathways.

Experimental evaluation on three hypertension comorbidity datasets shows that the proposed DP-GA model consistently outperforms existing baseline and state-of-the-art methods in predicting comorbidity risks, demonstrating that our method can identify patients at high risk of hypertension-related complications in advance. These results highlight the model's capability to support early risk identification and clinically interpretable prediction in comorbidity management.

The reminder of this paper is organized as follows. Section 2 states the data material and problem statements. Section 3 presents the DP-GA model. The experimental results are discussed in Section 4. Finally, Section 5 concludes this paper.

2 Data material and problem statements

2.1 Dataset preparation

This retrospective study is based on the electronic medical record (EMR) homepage of anonymous discharged patients from a tertiary hospital from 2019 to 2023. The inclusion and exclusion labels are based on guidelines and indicators. The study was approved for hospital scientific review and passed the ethics review. With ICD-10 disease coding classification based on relevant clinical guidelines and other related literature and expert consultation, we have developed the following inclusion and exclusion criteria: due to our research objective of hypertension combined with COPD, DM and CHD, we excluded the acute infectious disease categories in sections A and B in ICD-10 coding, as well as the tumor diseases in section C coding, to achieve preliminary screening. To demonstrate the effectiveness of our proposed method, three groups of hypertension comorbidities datasets have been designed based on inclusion and exclusion criteria. The case group is defined as a total of hospitalized patients diagnosed with hypertension, and before suffering from chronic obstructive pulmonary disease (COPD, J44), diabetes mellitus (DM) and coronary heart disease (CHD), while the control group consists of patients with only primary hypertension and other complications, excluding COPD, DM and CHD. As can be seen from Table 1, the three datasets, i.e., COPD, DM and CHD, have 630, 1,024, and 1,668 nodes, respectively, and the features all reach hundreds of dimensions. The datasets are binary datasets, with the control group and the case group each accounting for half.

Table 1

Category	COPD numbers	Disease feature	DM numbers	Disease feature	CHD numbers	Disease feature
Control group	315	355	512	435	834	500
Case group	315	355	512	435	834	500
Group	630	355	1,024	435	1,668	500

Basic information of the case group and control group datasets.

2.2 Problem statements

Assume that the patient interaction network is denoted by G_p={V, E, X}, where G_p denotes the patient graph, V={v₁, v₂, …, v_Z} is the node set of Z patient nodes, E denotes the patient edges (connections) based on the same disease, |E| represents the number of connections between the patients, XεR^{Z × F} represents the input node feature matrix of patients, and F represents the dimension of features of each patient node. The adjacency matrix of G_p is denoted by Dε{0, 1}^{Z × Z}. In the following, we use W to denote the weight matrix, H to denote the feature matrix of patient nodes. In this work, we adopt classification to predict the probability that a patient has a specific disease, such as DM, CHD, or COPD. C_L denotes the number of categories in the classification, and the prediction of hypertension comorbidity refers to the predicted outcome of the disease based on the learned patient features. We denote the set consisting of their directly adjacent nodes as . When a patient's outcome is diagnosed as the target disease, the label value is 1; otherwise, it is 0.

3 Methods

3.1 Overview of the proposed DP-GA model

DP-GA is a graph neural network framework specifically designed for predicting comorbidity, and it has constructed an end-to-end process for comorbidity risk prediction. In the process of comorbidity risk prediction, the first step is to construct a dual attention mechanism combining feature attention and structural attention to calculate the basic attention. At the same time, it integrates node position embedding information to capture the positional information between patients, and through latent position learning, it captures the positional changes of patient nodes in the latent space. The second step involves the application of Bregman divergence to capture the differences between nodes, and the calculation of the difference metric is utilized in the subsequent adjustment of attention weights to implement the mechanism of similar-differentiated information transmission. The third step involves introducing the disease correlation logic from clinical practice and proposing a disease-potential-driven attention mechanism. By using masking to simulate the reasonable direction of disease transmission, this mechanism imposes constraints on the previously obtained attention weights, thereby effectively enhancing the accuracy and interpretability of comorbidity prediction. The DP-GA framework is illustrated in Figure 1.

Figure 1

3.2 Fusion attention mechanism

To overcome the limitations of information utilization, a dual-attention mechanism combining feature and structural attention was developed (He et al., 2021, 2024). At the same time, the similarity of patients' features and their structures was calculated to more accurately measure the influence among patients.

Generally, we calculate the attention score between two patients i and j as follows:

where denotes the patient's base attention, capturing the explicit risk factors of historical diseases. a_i and a_j denote the attention distributions for the disease features of patients i and j, respectively, h_i and h_j are the original disease feature vectors of patients i and j in Equation 1. W is a weight matrix that maps patients' high-dimensional disease features.

However, the attention score defined above relies solely on transformed node features, without incorporating the topological positions of the nodes within the graph. To construct a more comprehensive attention mechanism, it is necessary to integrate structural information into the base attention formulation. To address the underutilization of topological information in comorbidity prediction, we learn node position representations in a latent space, thereby uncovering the underlying structural relationships among patient nodes. This allows the model to leverage both feature-based and structure-based information for more accurate comorbidity inference.

To capture positional relationships among patient nodes, we further introduce a structural attention mechanism that models implicit structural associations among patients. The structural attention score is calculated as follows:

where denotes the structural attention score between patients i and j, serving as a quantitative measure of the structural similarity between the two nodes, a_s is the positional attention vector, s_i and s_j denote the structural feature vectors of the respective patient nodes, which are derived by learning latent position embeddings that encode the topological structure of the patient network in Equation 2.

To enable the model to jointly leverage both nodal feature information and graph structural information, we integrate the learned positional representations into the attention mechanism. This enhances the model's capacity to reason about comorbidities by combining explicit feature affinities with implicit structural proximities. The final integrated attention score in Equation 3 is obtained by fusing the feature-based and structure-based components:

3.3 Difference perception mechanism

To overcome the limitations of conventional attention mechanisms, which primarily focus on node similarity, we propose a dual-perspective framework that models both node similarities and differences. This approach enables more effective information propagation and enhances graph representation learning by adaptively strengthening or attenuating messages based on node relations.

3.3.1 Bregman divergence for node difference

To quantify the dissimilarity between patients in both feature and structural spaces, we employ Bregman divergence as a principled measure of difference (He et al., 2021, 2024). This allows the model to support personalized comorbidity prediction by explicitly accounting for patient heterogeneity. The composite Bregman divergence between node and node is defined as

where d_ij denotes the integrated difference measure from node i to node j in Equation 4. and represent the Bregman divergence in the feature space and structural space, respectively, capturing the respective discrepancies. ωε[0,1] is a trainable weighting factor that balances the contributions of feature-level and structure-level differences.

3.3.2 Similarity-difference guided message passing

The Bregman divergence provides a differentiable difference metric that is subsequently used to modulate attention weights (He et al., 2021, 2024). This leads to a similarity-difference aware message-passing scheme, where propagation is enhanced between similar nodes and suppressed between dissimilar ones. Let the node features and position embeddings at layer be transformed as:

H^(l) denote the feature matrix of patient nodes, P^(l) denote the latent positions, W^(l) to denote the weight matrix in Equation 5. We then construct a joint relation matrix R that incorporates both similarity and difference:

where C and O are learnable matrices that compute feature-structure similarity and difference, respectively; β is a scaling coefficient, and τ is a temperature hyperparameter in Equation 6. D_ij denotes the adjacency matrix of G_p.

Messages are aggregated as:

By the combination of the joint effect of C and O, a matrix M is constructed in Equation 7.

Finally, the updated node representation combines self-information and aggregated messages:

where μ is a learnable positive irrational, H_i represents the feature vector of node i in Equation 8. The resulting propagation mechanism strengthens information flow between similar nodes while reducing influence from dissimilar nodes. The attention score is accordingly adjusted by the difference measure:

where δ is a learnable scaling parameter. The final attention weight is obtained by normalizing over all neighbors in Equation 9.

This unified similarity-difference framework enables the model to dynamically balance homophily and heterophily in the patient graph, leading to more expressive and discriminative representations for comorbidity prediction.

3.4 Disease potential-driven attention

While the previously described modules are primarily data-driven, relying solely on statistical correlations may lead to clinically implausible associations and reduce model interpretability. To ground the model in established clinical knowledge, we introduce the disease correlation logic in clinical practice, designed to quantify the influence intensity of disease status by calculating the disease potential of patient nodes, and propose a disease potential-driven attention mechanism. This component calculates the disease potential based on the patient's disease-related features, and then constructs masks based on the disease potential to impose constraints on the attention weights, thereby learning node representations that conform to disease patterns.

3.4.1 Disease potential calculation

To assess the severity or progression risk of a target disease for each patient, we compute a disease potential score based on clinically relevant nodal features. This potential represents the propensity or risk level of a patient node for the disease under consideration. The potential for patient is obtained by calculating the difference in potential between the source node and the target node, and a mask based on the direction of disease transmission is created to influence the calculation of attention weights, i.e.,

p_i represents the risk or probability of the target disease occurring in patient i, q is an index of disease-related features. X_iq represents the value of the i-th sample at the q-th feature, σ(·) is the sigmoid function, which converts the linear combination result into a probability value in Equation 10. is the set of disease features.

3.4.2 Mask construction

In order to restrict the direction of information flow and only allow nodes with high disease potential to transfer information to nodes with low disease potential, we construct a binary causal mask. This mask permits attention flow only from nodes with strictly higher disease potential to those with lower or equal potential, thereby encoding an asymmetric, risk-informed constraint. For a directed edge from source node i to target node j, the mask coefficient is defined as:

Here, ϵ is a small positive constant (e.g., 0.1) applied to connections that do not satisfy the direction in Equation 11, effectively suppressing—rather than completely eliminating—the information flow. This ensures numerical stability while strongly biasing the model toward clinically plausible pathways.

3.4.3 Integration with attention mechanism

The mask is integrated into the attention mechanism via element-wise multiplication with the unconstrained attention weights. This constrained attention weights that align with disease progression logic, i.e.,

where is the adjusted attention score from Equation 9. We denote the set consisting of their directly adjacent nodes as in Equation 12. The resulting weights are then used to perform message passing and feature aggregation in Equation 13, i.e.,

This design ensures that the node representations are updated primarily along directions consistent with clinical risk gradients, thereby enhancing the model's clinical validity and robustness against spurious correlations.

3.5 Loss function

To train the proposed model end-to-end, we formulate a composite loss function that jointly optimizes the accuracy of comorbidity prediction and the preservation of inherent patient graph structure. This ensures the model not only performs accurate classification but also learns representations that are topologically meaningful.

3.5.1 Classification loss

The aggregated node representations, refined through the proposed attention mechanisms, are passed through a final classification layer, e.g., a fully connected network with Softmax activation, to generate the predicted probability ŷ_i of comorbidity for patient i in Equation 14. The primary objective is optimized using the binary cross-entropy loss, which measures the discrepancy between predictions and ground-truth labels:

where Z is the total number of patient samples, y_i∈{0, 1} is the ground-truth comorbidity label, and ŷ_i∈(0, 1) is the predicted probability. This loss drives the model to correctly identify patients at risk of comorbidities.

3.5.2 Position preservation loss

To ensure that the learned latent position embeddings h^p faithfully reflect the topological structure of the patient graph (He et al., 2021, 2024), we introduce a position preservation loss. This is implemented as a graph Laplacian regularization term, which constrains neighboring nodes in the graph to have similar position embeddings in the latent space. The loss is defined as:

where and are the position embeddings of nodes i and j, is the set of edges in the graph, w_ij is the weight of the edge connecting them in Equation 15. Minimizing L_p enforces smoothness in the embedding space with respect to the original graph connectivity, thereby explicitly preserving structural information that is crucial for robust relational inference.

3.5.3 Total loss

The final training objective is a weighted sum of the classification loss and the position preservation loss in Equation 16, i.e.,

The hyperparameter λ ≥ 0 controls the trade-off between predictive accuracy and structural fidelity. By jointly optimizing L_total, the model learns representations that are simultaneously discriminative for the downstream prediction task and structurally coherent, enhancing both performance and interpretability in the clinical application context.

3.6 Comorbidity networks analysis

To explain the reasons for the differences in the risk prediction of comorbidity among patients from the perspective of comorbidity, we also analyzed the correlation of comorbidity networks. It can be expressed as follows:

where N_uv is the number of patients affected by the same diseases, Z is the total number of patients, Y_u and Y_v are the prevalences of diseases u and v in Equation 17, respectively (Hidalgo et al., 2009).

4 Experiments

4.1 General settings

We divided the dataset into training set, validation set and test set in the ratio of 6:2:2. The hyperparameters λ, η, hid, drop, and heads represent the regularization coefficient, learning rate, hidden layer dimension, dropout rate, and the number of heads, respectively, β and μ are the learnable parameters of the model, and no manual parameter tuning is required. The specific settings are as follows:

On the COPD dataset, the values of λ, η, hid, drop, and heads are 1, 0.0005, 64, 0.3, and 4, respectively. On the DM dataset, the values of λ, η, hid, drop, and heads are 1, 0.0005, 64, 0.3, and 4. On the CHD dataset, the values of λ, η, hid, drop, and heads are 1, 0.001, 64, 0.3, and 4. For each baseline model in the three datasets, we conducted five runs to obtain the average performance, with each run lasting for 1,000 rounds. The experiments have been conducted on a PC with a 3.20 GHz i9 CPU and 32 GB RAM. All the models are implemented in Anaconda 3.8. To demonstrate the effectiveness of our proposed DF-GA model, three groups of hypertension comorbidities datasets have been designed based on inclusion and exclusion criteria, as summarized in Table 1. Their details have been provided in Section 2.1.

We adopt several baselines and state-of-the-art GNN models, including APPNP (Klicpera et al., 2019), DGI (Veličković et al., 2019), GCN (Kipf and Welling, 2017), JKNET (Xu et al., 2018), SGC (Wu et al., 2019), and GraphSAGE (Hamilton et al., 2017) as peer methods to evaluate the effectiveness of the proposed DP-GA model by comparing it with them for comorbidity risk prediction.

4.2 Evaluation metrics

To evaluate the classification prediction performance of the model, both accuracy (ACC) and F1 score (F1) are used to evaluate the co-morbidity prediction performance. ACC is a commonly used metric for evaluating classification predictions, representing the proportion of correctly classified samples out of the total sample count. The formula is as follows:

Equation 18 refers to the proportion of correctly predicted samples among all samples. Here, XP represents true positive samples, XN represents true negative samples, EP represents false positive samples, and EN represents false negative samples.

Since our model is designed for classifying and predicting patient comorbidities, the F1 evaluation metric is also suitable for the patient classification task. Therefore, we employed the F1 metric simultaneously as follows:

Pre represents precision, Rec represents recall rate in Equation 19. First, we calculate the average of precision and recall rate, and then use the formula to obtain F1.

4.3 Comparison results and analysis

To evaluate the efficacy of the proposed method, we conducted a series of comparative experiments. A patient comorbidity network was constructed based on disease co-occurrence relationships. As shown in Table 1, the historical disease vectors of patients prior to the onset of COPD, DM, and CHD were utilized as node features. The patient nodes were classified using our DP-GA model alongside six representative graph neural network baseline models. Experimental results are summarized in Table 2. The results demonstrate that our DP-GA model achieves superior performance across all evaluation metrics. This improvement can be attributed to the model's ability to effectively integrate nodal features, topological structure, and clinically-informed constraints, thereby learning more discriminative and robust node embeddings from the graph-structured data.

Table 2

Methods	COPD		DM		CHD
	ACC	F1	ACC	F1	ACC	F1
DP-GA (our)	0.8873 ±0.0154	0.8872 ±0.0155	0.7210 ±0.0078	0.7165 ±0.0053	0.7976 ±0.0084	0.7975 ±0.0083
APPNP	0.8683 ± 0.0268	0.8680 ± 0.0266	0.6864 ± 0.0376	0.6766 ± 0.0366	0.7546 ± 0.0330	0.7519 ± 0.0346
DGI	0.8302 ± 0.0357	0.8294 ± 0.0356	0.6874 ± 0.0270	0.6871 ± 0.0271	0.7534 ± 0.0254	0.7527 ± 0.0250
GCN	0.8397 ± 0.0380	0.8379 ± 0.0385	0.6981 ± 0.0193	0.6971 ± 0.0203	0.7063 ± 0.1211	0.6762 ± 0.1801
JKNET	0.8349 ± 0.0184	0.8317 ± 0.0187	0.6816 ± 0.0193	0.6794 ± 0.0193	0.7600 ± 0.0301	0.7597 ± 0.0300
SGC	0.8698 ± 0.0381	0.8695 ± 0.0380	0.6748 ± 0.0162	0.6742 ± 0.0161	0.7755 ± 0.0158	0.7753 ± 0.0156
GraphSAGE	0.8444 ± 0.0278	0.8425 ± 0.0275	0.6864 ± 0.0237	0.6857 ± 0.0234	0.7666 ± 0.0191	0.7661 ± 0.0187

Performance comparison results of our proposed DP-GA model and the compared GNN models across ACC and F1 metrics.

The bold values indicate that our proposed model achieves the best classification evaluation results.

As shown in Table 2, on the COPD dataset, DP-GA achieves the highest ACC of 88.73%, outperforming APPNP, DGI, GCN, JKNET, SGC, and GraphSAGE by 2.19%, 6.88%, 4.76%, 3.26%, 2.01%, and 5.08%, respectively. Similarly, in F1-score, it leads by margins of 2.21%, 6.97%, 5.88%, 6.67%, 2.04%, and 5.31% over the same baselines. This robust lead, particularly over strong feature-smoothing models like SGC (ACC: 86.98%), validates the advantage of our model's integrated structural and feature-based attention.

For the DM dataset, our model attains a top ACC of 72.10%, representing improvements of 5.04%, 4.89%, 3.28%, 5.78%, 6.85%, and 5.04% over the baselines. In F1-score, the corresponding improvements are 5.90%, 4.28%, 2.78%, 5.46%, 6.27%, and 4.49%. The significant gain over all models, highlights the effectiveness of the Bregman divergence module in capturing patient heterogeneity, which is crucial for modeling complex metabolic conditions like diabetes.

The most pronounced improvement is observed on the CHD dataset. Here, DP-GA's ACC of 79.76% exceeds that of the baselines by 5.70% (APPNP), 5.87% (DGI), 12.93% (GCN), 4.95% (JKNET), 2.85% (SGC), and 4.04% (GraphSAGE). The F1-score shows a similar trend. The exceptionally large margin over GCN (over 12%) underscores a critical finding: standard message-passing neural networks are prone to learning spurious correlations in clinical graphs. In contrast, DP-GA's disease potential-driven attention successfully constrains information flow to clinically plausible pathways, which is paramount for reliable prediction in cardiovascular etiology.

In summary, the systematic outperformance of DP-GA across all tasks and metrics confirms the efficacy of its core innovations: the fusion of structural position encoding, the dual-perspective modeling via Bregman divergence, and the incorporation of clinical logic. The model demonstrates not only higher accuracy but also greater robustness, as indicated by consistently lower standard deviations, establishing a new state-of-the-art for patient-centric comorbidity prediction.

4.4 Visualization results and analysis

To further validate the discriminative capability of the learned representations, we employed t-SNE to visualize the node embeddings in a two-dimensional space. Figure 2 presents a comparative visualization of the raw feature distributions against the embeddings generated by our DP-GA model.

Figure 2

Observations from the t-SNE plots reveal a significant qualitative improvement. In the visualization of the original high-dimensional features as depicted in Figures 2a, c, e, nodes belonging to the two target classes—patients with and without the target comorbidity—exhibit substantial overlap with no clear separation boundary. This indicates that the raw clinical features, while informative, are not readily separable for the downstream prediction task when viewed through a standard non-linear dimensionality reduction technique.

In contrast, the visualization of the embeddings produced by DP-GA depicted in Figures 2b, d, f, shows a markedly distinct pattern. The two patient cohorts form more compact and well-separated clusters in the latent space. This clear structural divergence between the classes provides direct visual evidence that our model successfully projects patients into an embedding space where the semantic information relevant to comorbidity risk is effectively encoded and amplified.

4.5 Findings of comorbidity network and pathological mechanism

The above classification of comorbidities predicts which comorbidities a patient may develop in the future. However, clinical decision-making also requires an understanding of the mechanistic explanations for the reason of this patient is prone to these comorbidities. Analyzing the comorbidity patterns and exploring the pathological mechanisms can help identify the pathways between diseases, which is beneficial for identifying key risk factors to determine the priority intervention path, achieving intervention and etiological treatment. To this end, a case group of patients with hypertension combined with COPD, DM and CHD and a control group of patients with hypertension were included in the analysis model. This study included patients with primary hypertension as the control group and patients with primary hypertension the ending complicated with COPD, DM and CHD as the case group in Tables 3–5 and Figure 3.

Table 3

Case group							Control group
^*V1	V2	V1 degree	V2 degree	Phi	t-value	Number	^*V1	V2	V1 degree	V2 degree	Phi	t-value	Number
C34	C77	16	7	0.5441	2.4265	6	E83	I97	8	8	0.7428	2.7179	6
I11	I25	49	71	0.2873	2.4915	30	E83	N18	8	18	0.6546	3.4637	8
I11	I50	49	38	0.3348	2.4358	21	H25	H35	54	32	0.3393	2.6010	18
I11	I51	49	26	0.3237	2.3458	16	H25	H40	54	11	0.2754	2.0657	8
I13	I51	7	26	0.4909	2.7603	7	H34	H35	19	32	0.5735	3.8348	15
I20	I25	18	71	0.3804	3.4169	17	I97	N18	8	18	0.6546	3.4637	8
I48	I50	18	38	0.4664	3.1639	14	K76	N28	36	33	0.4571	2.9966	18

Description of the comorbidity network in the case and control group of COPD.

^*V denotes disease vertex.

Figure 3

As shown in Table 3 and Figures 3a, b, the case group has more disease nodes than the control group significantly, indicates that patients with COPD ending has multiple diseases, such as diabetes, coronary heart disease, heart failure, etc., which are closely related to COPD ending. This is consistent with the current literature (Lipton, 2018; Atkinson et al., 2023; Summers et al., 2024; Romano et al., 2023). However, if the control group has been controlled, reducing the complications of endocrine and cardiovascular diseases also helps to block the key nodes of disease progression to more serious diseases, thereby achieving the goal of preventing COPD and other more serious diseases.

As shown in Table 4 and Figures 3c, d, the case group has more disease nodes than the control group significantly, indicates that patients with DM ending has multiple diseases, such as coronary heart disease, heart failure, etc., which are closely related to the DM outcome. The disease nodes in the case group are more closely connected, and there are complex interactions among different disease nodes, especially secondary hypertension, heart failure, atrial fibrillation and flutter, complications of heart disease, etc. This is consistent with the current literature (Lipton, 2018; Atkinson et al., 2023; Summers et al., 2024; Romano et al., 2023; Wang et al., 2021; Cai, 2025; Yu et al., 2025; An et al., 2024; Recenti et al., 2021; Tang et al., 2022; Zhou et al., 2023).

Table 4

Case group							Control group
^*V1	V2	V1 degree	V2 degree	Phi	t-value	Number	^*V1	V2	V1 degree	V2 degree	Phi	t-value	Number
E21	N18	7	41	0.3990	2.7178	7	E83	N18	14	30	0.6211	4.1932	13
E83	N18	13	41	0.5471	4.0813	13	H25	H35	82	60	0.3542	3.3873	31
I11	I25	125	176	0.2588	3.5337	70	H34	H35	28	60	0.5266	4.7179	23
I11	I48	125	58	0.2559	2.9361	32	I20	I25	22	112	0.3771	4.2708	21
I11	I50	125	103	0.3840	4.6118	59	I97	N18	10	30	0.5657	3.6304	10
I11	I51	125	99	0.6197	8.7573	78	K76	N28	70	57	0.3291	2.8738	26

Description of the comorbidity network in the case and control group of DM.

^*V denotes disease vertex.

As shown in Table 5 and Figures 3e, f, we found type 2 diabetes, secondary hypertension and chronic kidney disease are the diseases with the highest prevalence in the case group. Early detection and prevention of these highly prevalent disease nodes can to a certain extent prevent patients from progressing to CHD (Wang et al., 2021; Cai, 2025; Yu et al., 2025; An et al., 2024; Recenti et al., 2021; Tang et al., 2022; Zhou et al., 2023).

Table 5

Case group							Control group
^*V1	V2	V1 degree	V2 degree	Phi	t-value	Number	^*V1	V2	V1 degree	V2 degree	Phi	t-value	Number
E11	H43	382	23	0.1391	2.7381	20	E83	N18	18	42	0.6072	4.8336	17
E11	I11	382	149	0.1429	2.8154	91	H25	H35	140	114	0.2602	3.1659	47
E11	N18	382	128	0.1761	3.4868	85	H25	H40	140	28	0.2547	3.0939	19
E21	N18	22	128	0.3243	3.8482	19	H25	H43	140	20	0.2232	2.6896	14
E83	I97	37	26	0.4305	2.8214	14	H34	H35	52	114	0.5181	6.4111	43
E83	N18	37	128	0.4737	6.0378	35	I97	N18	13	42	0.5464	4.1265	13
H25	H35	122	86	0.3729	4.4019	46	K76	N28	95	101	0.3759	4.0363	44

Hypertension comorbidity network in the case and control group of CHD.

^*V denotes disease vertex.

5 Conclusions

In this work, we proposed DP-GA, a novel graph neural network framework for predicting hypertension comorbidity risk. The core innovation lies in integrating a disease potential-driven attention mechanism with structured similarity-difference learning, enabling the model to capture the topological relationships within patient networks. This approach effectively addresses the challenge of early comorbidity risk screening by moving beyond correlation-based learning to model plausible pathogenic pathways.

Our experimental results on three major comorbidity prediction tasks (e.g., COPD, DM, and CHD) demonstrate that the proposed framework consistently and significantly outperforms existing graph learning baselines. However, the co-morbidity networks constructed based on the method of disease phenotype co-occurrence ignore the indirect associations between diseases, and can only capture the dominant co-morbidity patterns, which has certain limitations.

In our future work, there are still the following issues to be studied: (a) clustering patients within a inference-informed graph representation to uncover more precise comorbidity patterns and improve clustering accuracy; (b) developing a heterogeneous graph model that incorporates patient-specific attributes to better understand comorbidity distribution across subpopulations and support personalized healthcare; and (c) integrating multi-modal clinical data—such as laboratory results and imaging features—within a latent representation framework to enhance predictive performance and facilitate the discovery of novel biomarkers.

Statements

Data availability statement

The datasets presented in this article are not readily available because medical data is classified as private information and requires strict protection. It should be used in accordance with legal regulations. Requests to access the datasets should be directed to 203861@hospital.cqmu.edu.cn.

Ethics statement

The studies involving humans were approved by the Medical Research Ethics Review Committee of the First Affiliated Hospital of Chongqing Medical University. The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation was not required from the participants or the participants' legal guardians/next of kin because this study is a retrospective study. The requirement of obtaining an informed consent form is waived, which complies with ethical standards.

Author contributions

LZ: Conceptualization, Data curation, Formal analysis, Methodology, Software, Visualization, Writing – original draft, Writing – review & editing. HQ: Data curation, Formal analysis, Writing – review & editing. YY: Supervision, Validation, Writing – review & editing. GH: Conceptualization, Validation, Writing – review & editing. ZL: Conceptualization, Project administration, Supervision, Validation, Writing – original draft, Writing – review & editing.

Funding

The author(s) declared that financial support was received for this work and/or its publication. This work was partly funded by Guangdong Basic and Applied Basic Research Foundation under the Grant 2023A1515110689, partly funded by Chongqing Technological Innovation and Application Development (Major Project) in 2025 under the Grant CSTB2025TIADSTX0029, partly funded by Sichuan Science and Technology Program under the Grant 2024YFFK0282, and partly funded by Sichuan Health Commission under the Grant 23LCYJ044.

Conflict of interest

The author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declared that generative AI was not used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1
AbessA. T.DeinerS. G.BriggsA.WhitlockE. L.CharetteK. E.ChowV. W.et al. (2023). Association of neurocognitive disorders with morbidity and mortality in older adults undergoing major surgery in the USA: a retrospective, population-based, cohort study. Lancet Healthy Longev. 4, e608–e617. doi: 10.1016/S2666-7568(23)00194-0
2
AbuhantashF.Abu HantashM. K.AlShehhiA. (2024). Comorbidity-based framework for Alzheimer's disease classification using graph neural networks. Sci. Rep. 14;21061. doi: 10.1038/s41598-024-72321-2
3
AnT. J.LimJ.LeeH.JiS.JungH. W.BaekJ. Y.et al. (2024). Breathlessness, frailty, and sarcopenia in older adults. Chest166, 1476–1486. doi: 10.1016/j.chest.2024.07.180
4
ArakelyanS.Mikula-NobleN.HoL.LoneN.AnandA.LyallM. J.et al. (2023). Effectiveness of holistic assessment-based interventions for adults with multiple long-term conditions and frailty: an umbrella review of systematic reviews. Lancet Healthy Longev. 4, e629–e644. doi: 10.1016/S2666-7568(23)00190-3
5
AtkinsonA.EllenbergerB.PiezziV.KasparT.Salazar-VizcayaL.EndrichO.et al. (2023). Extending outbreak investigation with machine learning and graph theory: benefits of new tools with application to a nosocomial outbreak of a multidrug-resistant organism. Infect. Control Hosp. Epidemiol. 44, 246–252. doi: 10.1017/ice.2022.66
6
BaoY.LuP.WangM.ZhangX.SongA.GuX.et al. (2023). Exploring multimorbidity profiles in middle-aged inpatients: a network-based comparative study of China and the United Kingdom. BMC Med. 21:495. doi: 10.1186/s12916-023-03204-y
7
BiswasS.RanjanV.MitraP.RaoK. S. (2025). Self supervised prediction of genetic associations in comorbid diseases with masked autoencoder using hypergraph representations. IEEE Trans. Comput. Biol. Bioinform. 22, 628–639. doi: 10.1109/TCBBIO.2025.3526805
8
BonomoM.HermsenM. G.KaskovichS.HemmrichM. J.RojasJ. C.CareyK. A.et al. (2022). Using machine learning to predict likelihood and cause of readmission after hospitalization for chronic obstructive pulmonary disease exacerbation. Int. J. Chron. Obstruct. Pulmon. Dis. 17, 2701–2709. doi: 10.2147/COPD.S379700
9
CaiW. (2025). DeepSeek AI: transforming medical AI with cost-efficiency, transparency, and privacy preservation. Intell. Oncol. 1, 172–175. doi: 10.1016/j.intonc.2025.03.006
- CrossRef
- Google Scholar
10
CheY.WangY. (2025). Prediction of multimorbidity network evolution in middle-aged and elderly population based on CE-GCN. Interdiscip. Sci. 17, 424–436. doi: 10.1007/s12539-024-00685-0
11
ChenH.ZhouY.HuangL.XuX.YuanC. (2023). Multimorbidity burden and developmental trajectory in relation to later-life dementia: a prospective study. Alzheimer's Demen. 19, 2024–2033. doi: 10.1002/alz.12840
12
DongG. Zhang Z. C FengJ.Zhao XM. (2022). MorbidGCN: prediction of multimorbidity with a graph convolutional network based on integration of population phenotypes and disease network. Brief Bioinform. 23:bbac255. doi: 10.1093/bib/bbac255
13
FuY.LiuY.ZhongC.HeidariA. A.LiuL.YuS.et al. (2024). An enhanced machine learning-based prognostic prediction model for patients with AECOPD on invasive mechanical ventilation. iScience27:111230. doi: 10.1016/j.isci.2024.111230
14
GlydeH. M. G.MorganC.WilkinsonT. M. A.NabneyI. T.DoddJ. W. (2024). Remote patient monitoring and machine learning in acute exacerbations of chronic obstructive pulmonary disease: dual systematic literature review and narrative synthesis. J. Med. Internet Res. 26:e52143. doi: 10.2196/52143
15
GohS. S. N.HartmanM. (2025). From promise to practice: harnessing artificial intelligence for breast cancer screening. Intell. Oncol.1, 4–6. doi: 10.1016/j.intonc.2024.11.001
- CrossRef
- Google Scholar
16
GongY.DuF.YaoY.WangH.WangX.XiongW.et al. (2024). Clinical characteristics of overweight patients with acute exacerbation chronic obstructive pulmonary disease (AECOPD). Clin. Respir. J. 18:e70001. doi: 10.1111/crj.70001
17
HamiltonW. L.YingR.LeskovecJ. (2017). “Inductive representation learning on large graphs,” in 2017 International Conference on Neural Information Processing Systems (NIPS) (Red Hook, NY: Curran Associates, Inc.), 1025–1035. doi: 10.5555/3294771.3294869
- CrossRef
- Google Scholar
18
HeM.ZengC.WangN.YangC. (2025). ADAPTIVE motion-state estimation and feature reuse for intermittent dynamics in visual SLAM. Artif. Intell. Sci. Eng. 1, 278–293. doi: 10.23919/AISE.2025.000019
- CrossRef
- Google Scholar
19
HeT.LiuY.OngY.-S.WuX.LuoX. (2024). Polarized message-passing in graph neural networks. Artif. Intell. 331:104129. doi: 10.1016/j.artint.2024.104129
- CrossRef
- Google Scholar
20
HeT.OngY. S.BaiL. (2021). Learning conjoint attentions for graph neural nets. Adv. Neural Inf. Process. Syst.34, 2641–2653. doi: 10.48550/arXiv.2102.03147
- CrossRef
- Google Scholar
21
HeidE.GreenW. H. (2022). Machine learning of reaction properties via learned representations of the condensed graph of reaction. J. Chem. Inf. Model. 62, 2101–2110. doi: 10.1021/acs.jcim.1c00975
22
HeubelA. D.KabbachE. Z.SchafauserN. S.PhillipsS. A.Pires Di LorenzoV. A.Borghi SilvaA.et al. (2021). Noninvasive ventilation acutely improves endothelial function in exacerbated COPD patients. Respir. Med. 181:106389. doi: 10.1016/j.rmed.2021.106389
23
HidalgoC. A.BlummN.BarabásiA. L.ChristakisN. A. (2009). A dynamic network approach for the study of human phenotypes. PLoS Comput. Biol. 5:e1000353. doi: 10.1371/journal.pcbi.1000353
24
KangH. Y. J.KoM.RyuK. S. (2024). Prediction model for survival of younger patients with breast cancer using the breast cancer public staging database. Sci. Rep. 14:25723. doi: 10.1038/s41598-024-76331-y
25
KipfT. N.WellingM. (2017). “Semi-supervised classification with graph convolutional networks,” in 2017 International Conference on Learning Representations (ICLR) [Toulon: International Conference on Learning Representations (ICLR)], 1–14. doi: 10.48550/arXiv.1609.02907
- CrossRef
- Google Scholar
26
KlicperaJ.BojchevskiA.GünnemannS. (2019). “Predict then propagate: graph neural networks meet personalized PageRank,” in 2019 International Conference on Learning Representations (ICLR) [New Orleans, LA: International Conference on Learning Representations (ICLR)], 1–15. doi: 10.48550/arXiv.1810.05997
- CrossRef
- Google Scholar
27
LiD.LinJ.YangH.ZhouL.LiY.XuZ.et al. (2025). Causal association of modifiable factors with cardiometabolic multimorbidity: an exposome-wide Mendelian randomization investigation. Cardiovasc. Diabetol. 24:241. doi: 10.1186/s12933-025-02790-w
28
LiptonZ. C. (2018). The mythos of model interpretability: in machine learning, the concept of interpretability is both important and slippery. Queue16, 31–57. doi: 10.1145/3236386.3241340
- CrossRef
- Google Scholar
29
LiuS.FengY.WuK.ChengG.HuangJ.LiuZ. (2024). Graph-attention-based Casual Discovery with Trust Region-navigated Clipping Policy Optimization. IEEE Trans. Cybern. 53, 2311–2324. doi: 10.1109/TCYB.2021.3116762
30
LuJ.WangY.HouL.ZuoZ.ZhangN.WeiA. (2021). Multimorbidity patterns in old adults and their associated multi-layered factors: a cross-sectional study. BMC Geriatr. 21:372. doi: 10.1186/s12877-021-02292-w
31
LuoY.ShanW.PengL.LuoL.DingP.LiangW. (2024). A computational framework for predicting novel drug indications using graph convolutional network with contrastive learning. IEEE J. Biomed. Health Inform. 28, 4503–4511. doi: 10.1109/JBHI.2024.3387937
32
NiY.ZhouY.KivimäkiM.CaiY.Carrillo-LarcoR. M.XuX.et al. (2023). Socioeconomic inequalities in physical, psychological, and cognitive multimorbidity in middle-aged and older adults in 33 countries: a cross-sectional study. Lancet Healthy Longev. 4, e618–e628. doi: 10.1016/S2666-7568(23)00195-2
33
PikinO.RyabovA.AleksandrovO.ToneevE.LarionovD.GarifullinA.et al. (2024). Predictive model for intraoperative decision-making in video-assisted thoracoscopic lobectomy: optimizing chest drain placement for high-risk patients. J. Thorac. Dis.16, 5909–5922. doi: 10.21037/jtd-24-617
34
RecentiM.RicciardiC.EdmundsK. J.GislasonM. K.SigurdssonS.CarraroU.et al. (2021). Healthy aging within an image: using muscle radiodensitometry and lifestyle factors to predict diabetes and hypertension. J. Biomed. Health Inform. 25, 2103–2112. doi: 10.1109/JBHI.2020.3044158
35
RomanoJ. D.MeiL.SennJ.MooreJ. H.MortensenH. M. (2023). Exploring genetic influences on adverse outcome pathways using heuristic simulation and graph data science. Comput Toxicol. 25:100261. doi: 10.1016/j.comtox.2023.100261
36
ShaoX.WangH.ZhuX.ZhangY.ChenX. (2024). CUBE: causal intervention-based counterfactual explanation for prediction models. IEEE Trans. Automat. Contr. 36:14. doi: 10.1109/TKDE.2023.3322126
- CrossRef
- Google Scholar
37
SummersK. L.KerutE. K.ToF.SheahanC. M.SheahanM. G. (2024). Machine learning-based prediction of abdominal aortic aneurysms for individualized patient care. J. Vasc. Surg. 79, 1057–1067.e2. doi: 10.1016/j.jvs.2023.12.046
38
TangZ.WangP.DongC.ZhangJ.WangX.PeiH. (2022). oxidative stress signaling mediated pathogenesis of diabetic cardiomyopathy. Oxid. Med. Cell. Longev.2022, 5913374. doi: 10.1155/2022/5913374
39
TianG.YangY.WenS. (2025). Time-series stock price forecasting based on neural networks: a comprehensive survey. Artif. Intell. Sci. Eng. 1, 255–277. doi: 10.23919/AISE.2025.000018
- CrossRef
- Google Scholar
40
VåleO.ZhangS.MaharjanS.KlæboeG. (2025). Exploring the interpretability of forecasting models for energy balancing market. Artif. Intell. Sci. Eng. 1, 295–306. doi: 10.23919/AISE.2025.000020
- CrossRef
- Google Scholar
41
VeličkovićP.FedusW.HamiltonW. L.LiòP.BengioY.HjelmR. D. (2019). “Deep graph infomax,” in 2019 International Conference on Learning Representations (ICLR) [New Orleans, LA: International Conference on Learning Representations (ICLR)], 1–17. doi: 10.48550/arXiv.1809.10341
- CrossRef
- Google Scholar
42
WangH.ZhangQ.ChenF. Y.LeungE. Y. M.WongE. L. Y.YeohE. K. (2021). Tensor factorization-based prediction with an application to estimate the risk of chronic diseases. IEEE Intell. Syst. 36, 53–61. doi: 10.1109/MIS.2021.3071018
- CrossRef
- Google Scholar
43
WangJ.FoxmanB.RaoK.CassoneM.GibsonK.ModyL.et al. (2023). Association of patient clinical and gut microbiota features with vancomycin-resistant enterococci environmental contamination in nursing homes: a retrospective observational study. Lancet Healthy Longev. 4, e600–e607. doi: 10.1016/S2666-7568(23)00188-5
44
WuD.LiS.HeY.LuoX.GaoX. (2026). Non-gradient hash factor learning for high-dimensional and incomplete data representation learning. IEEE Trans. Pattern Anal. Mach. Intell. doi: 10.1109/TPAMI.2026.3653780
45
WuF.ZhangT.SouzaA. H. D.FiftyC.YuT.WeinbergerK. Q. (2019). “Simplifying graph convolutional networks,” in 2019 International Conference on Machine Learning (ICML) [Long Beach, CA: Proceedings of Machine Learning Research (PMLR)], 6861–6871. doi: 10.48550/arXiv.1902.07153
- CrossRef
- Google Scholar
46
XuH.YewM. S. (2023). Visual ordinal coronary artery calcium score from non-gated chest CT predicts mortality after severe chronic obstructive pulmonary disease exacerbation. Int. J. Chron. Obstruct. Pulmon. Dis. 18, 3115–3124. doi: 10.2147/COPD.S437401
47
XuK.LiC.TianY.SonobeT.KawarabayashiK. I.JegelkaS. (2018). “Representation learning on graphs with jumping knowledge networks,” in 2018 International Conference on Machine Learning (ICML) [Stockholm: Proceedings of Machine Learning Research (PMLR)], 5453–5462. doi: 10.48550/arXiv.1806.03536
- CrossRef
- Google Scholar
48
YangZ.ZhengY.ZhangL.ZhaoJ.XuW.WuH.et al. (2024). Screening the best risk model and susceptibility SNPs for chronic obstructive pulmonary disease (copd) based on machine learning algorithms. Int. J. Chron. Obstruct. Pulmon. Dis. 19, 2397–2414. doi: 10.2147/COPD.S478634
49
YinH.WangK.YangR.TanY.LiQ.ZhuW.et al. (2024). A machine learning model for predicting acute exacerbation of in-home chronic obstructive pulmonary disease patients. Comput. Methods Programs Biomed. 246:108005. doi: 10.1016/j.cmpb.2023.108005
50
YuZ.XinC.YuY.XiaJ.HanL. (2025). AI dermatology: reviewing the frontiers of skin cancer detection technologies. Intell. Oncol. 1:89–104. doi: 10.1016/j.intonc.2025.03.002
- CrossRef
- Google Scholar
51
YuanR.TangY.XiaoY.ZhangW. (2024). IBCS: learning information bottleneck-constrained denoised causal subgraph for graph classification. IEEE Trans. Pattern Anal. Mach. Intell. 47, 1627–1643. doi: 10.1109/TPAMI.2024.3508766
52
ZengJ.QiuY.YangC.FanX.ZhouX.ZhangC.et al. (2025). Cardiovascular diseases and depression: a meta-analysis and Mendelian randomization analysis. Mol. Psychiatry30, 4234–4246. doi: 10.1038/s41380-025-03003-2
53
ZhangH.TongH. H. Y. (2025). Discovery of novel EGFR and BRAF inhibitors by machine learning approach. Intell. Oncol.1, 7–16. doi: 10.1016/j.intonc.2024.10.001
- CrossRef
- Google Scholar
54
ZhouC.QinY.ZhaoW.LiangZ.LiM.LiuD.et al. (2023). International expert consensus on diagnosis and treatment of lung cancer complicated by chronic obstructive pulmonary disease. Transl. Lung Cancer Res. 12, 1661–1701. doi: 10.21037/tlcr-23-339

Summary

Keywords

comorbidity network, disease potential, fusion attention, hypertension, risk prediction

Citation

Zhou L, Qin H, Yang Y, Huang G and Liu Z (2026) A disease potential-driven graph attention model for comorbidity risk prediction of hypertension. Front. Big Data 9:1814157. doi: 10.3389/fdata.2026.1814157

Received

20 February 2026

Revised

03 March 2026

Accepted

04 March 2026

Published

02 April 2026

Volume

9 - 2026

Edited by

Qingguo Lü, Chongqing University, China

Reviewed by

Jianhong Gan, Chengdu University of Information Technology, China

Qiangting Deng, Army Medical University, China

Yong Tang, Huazhong University of Science and Technology, China

Updates

This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Zhigang Liu, liuzhigang@dgut.edu.cn

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

ORIGINAL RESEARCH article

A disease potential-driven graph attention model for comorbidity risk prediction of hypertension

Abstract

1 Introduction

2 Data material and problem statements

2.1 Dataset preparation

2.2 Problem statements

3 Methods

3.1 Overview of the proposed DP-GA model

3.2 Fusion attention mechanism

3.3 Difference perception mechanism

3.3.1 Bregman divergence for node difference

3.3.2 Similarity-difference guided message passing

3.4 Disease potential-driven attention

3.4.1 Disease potential calculation

3.4.2 Mask construction

3.4.3 Integration with attention mechanism

3.5 Loss function

3.5.1 Classification loss

3.5.2 Position preservation loss

3.5.3 Total loss

3.6 Comorbidity networks analysis

4 Experiments

4.1 General settings

4.2 Evaluation metrics

4.3 Comparison results and analysis

4.4 Visualization results and analysis

4.5 Findings of comorbidity network and pathological mechanism

5 Conclusions

Statements

Data availability statement

Ethics statement

Author contributions

Funding

Conflict of interest

Generative AI statement

Publisher’s note

References

Summary

Outline

Figures

Cite article

Share article

Article metrics