Objective To systematically review prediction models of small for gestational age (SGA) based on machine learning and provide references for the construction and optimization of such a prediction model. Methods The PubMed, EMbase, Web of Science, CBM, WanFang Data, VIP and CNKI databases were electronically searched to collect studies on SGA prediction models from database inception to August 10, 2022. Two researchers independently screened the literature, extracted data, evaluated the risk of bias of the included studies, and conducted a systematic review. Results A total of 14 studies, comprising 40 prediction models constructed using 19 methods, such as logical regression and random forest, were included. The results of the risk of bias assessment from 13 studies were high; the area under the curve of the prediction models ranged from 0.561 to 0.953. Conclusion The overall risk of bias in the prediction models for SGA was high, and the predictive performance was average. Models built using extreme gradient boosting (XGBoost) demonstrated the best predictive performance across different studies. The stacking method can improve predictive performance by integrating different models. Finally, maternal blood pressure, fetal abdominal circumference, head circumference, and estimated fetal weight were important predictors of SGA.
ObjectiveTo explore the risk factors for accompanying depression in patients with community type Ⅱ diabetes and to construct their risk prediction model. MethodsA total of 269 patients with type Ⅱ diabetes accompanied with depression and 217 patients with simple type Ⅱ diabetes from three community health service centers in two streets of Pingshan District, Shenzhen from October 2021 to April 2022 were included. The risk factors were analyzed and screened out, and a logistic regression risk prediction model was constructed. The goodness of fit and prediction ability of the model were tested by the Hosmer-Lemeshow test and the receiver operating characteristic (ROC) curve. Finally, the model was verified. ResultsLogistic regression analysis showed that smoking, diabetes complications, physical function, psychological dimension, medical coping for face, and medical coping for avoidance were independent risk factors for depressive disorder in patients with type Ⅱ diabetes. Modeling group Hosmer-Lemeshow test P=0.345, the area under the ROC curve was 0.987, sensitivity was 95.2% and specificity was 98.6%. The area under the ROC curve was 0.945, sensitivity was 89.8%, specificity was 84.8%, and accuracy was 86.8%, showing the model predictive value. ConclusionThe risk prediction model of type Ⅱ diabetes patients with depressive disorder constructed in this study has good predictive and discriminating ability.
ObjectiveTo establish a hypertension prediction model for middle-aged and elderly people in China and to use the basic public health service database for performance validation. MethodsThe literature related to hypertension was retrieved from the internet. Using meta-analysis to assess the effect value of influencing factors. Statistically significant factors, which were also combined in the database, were extracted as the predictors of the models. The predictors’ effect values were logarithmarithm-transformed as the parameters of the Logit function model and the risk score model. Participants who were never diagnosed with hypertension at the physical examination of health service project of Hongguang Town Health Center in Pidu District of Chengdu from January 1, 2017, to January 1, 2022, were considered as the external validation group. ResultsA total of 15 original studies were involved in the meta-analysis and 11 statistically significant influencing factors for hypertension were identified, including age, female, systolic blood pressure, diastolic blood pressure, BMI, central obesity, triglyceride, smoking, drinking, history of diabetes and family history of hypertension. Of 4997 qualified participants, 684 individuals were identified with hypertension during the five-years follow-up. External validation indicated an AUC of 0.571 for the Logit function model and an AUC of 0.657 for the risk score model. ConclusionIn this study, we developed two different prediction models based on the results of meta-analysis. National basic public health service database is used to verify the models. The risk score model has a better prediction performance, which may help quickly stratify the risk class of the community crowd and strengthen the primary-level assistance system.
ObjectiveTo construct and verify the nomogram prediction model of pregnant women's fear of childbirth. MethodsA convenient sampling method was used to select 675 pregnant women in tertiary hospital in Tangshan City, Hebei Province from July to September 2022 as the modeling group, and 290 pregnant women in secondary hospital in Tangshan City from October to December 2022 as the verification group. The risk factors were determined by logistic regression analysis, and the nomogram was drawn by R 4.1.2 software. ResultsSix predictors were entered into the model: prenatal education, education level, depression, pregnancy complications, anxiety and preference for delivery mode. The areas under the ROC curves of the modeling group and the verification group were 0.834 and 0.806, respectively. The optimal critical values were 0.113 and 0.200, respectively, with sensitivities of 67.2% and 77.1%, the specificities were 87.3% and 74.0%, and the Jordan indices were 0.545 and 0.511, respectively. The calibration charts of the modeling group and the verification group showed that the coincidence degree between the actual curve and the ideal curve was good. The results of Hosmer-Lemeshow goodness of fit test were χ2=6.541 (P=0.685) and χ2=5.797 (P=0.760), and Brier scores were 0.096 and 0.117, respectively. DCA in modeling group and verification group showed that when the threshold probability of fear of childbirth were 0.00 to 0.70 and 0.00 to 0.70, it had clinical practical value. ConclusionThe nomogram model has good discrimination, calibration and clinical applicability, which can effectively predict the risk of pregnant women's fear of childbirth and provide references for early clinical identification of high-risk pregnant women and targeted intervention.
ObjectiveTo analyze independent factors for treatment-requiring retinopathy of prematurity (TR-ROP) and establish a predictive nomogram model for TR-ROP. MethodA retrospective cohort study. A total of 6 998 preterm infants who were born at Guangdong Women's and Children's Hospital between January 1, 2012 and March 31, 2022 and were screened for retinopathy of prematurity (ROP) were included in the study. TR-ROP was defined as type 1 ROP and aggressive ROP; 22 independent factors including general information, maternal perinatal conditions, interventions and neonatal diseases related to ROP were collected. The infants were divided at the level at an 8:2 ratio according to clinical experience, with 5 598 in the training cohort and 1 400 in the validation cohort. t test was used for comparison of quantitative data and χ2 test was used for comparison of counting data between groups. Multivariate logistic regression analysis was carried out for the indicators with differences in the univariate analysis. The visualized regression analysis results of R software were used to obtain the histogram. The accuracy of the nomogram was verified by C-index and receiver operating characteristic curve (ROC curve). ResultsAmong the 6 998 children tested, 4 069 were males and 2 920 were females. Gestational age was (33.69±3.19) weeks; birth weight was (2 090±660) g. There were 376 cases of TR-ROP (5.4%, 376/6 998). The results of multivariate logistic regression analysis showed that gestational age [odds ratio (OR) =0.63, 95% confidence interval (CI) 0.47-0.85, P=0.002], intrauterine distress (OR=0.30, 95%CI 0.10-0.99, P=0.048), bronchopulmonary dysplasia (OR=0.23, 95%CI 0.09-0.60, P=0.003), hypoxic-ischemic encephalopathy (OR=5.40, 95%CI 1.45-20.10, P=0.012), blood transfusion history (OR=4.05, 95%CI 1.50-10.95, P=0.006) were the independent influencing factors of TR-ROP. Based on this and combined with birth weight, a nomogram prediction model was established. The C-index of the training set and validation set were 0.940 and 0.885, respectively, and the area under ROC curve were 0.945 (95%CI 0.930-0.961) and 0.931 (95%CI 0.876-0.986), respectively. The sensitivity and specificity were 86.2%, 94.0% and 83.2%, 93.3%, respectively. ConclusionsGestational age, intrauterine distress, bronchopulmonary dysplasia, hypoxic-ischemic encephalopathy and blood transfusion history are the independent factors influencing the occurrence of TR-ROP. The TR-ROP nomogram prediction model based on independent influencing factors has high sensitivity and specificity.
ObjectiveTo observe the relationship between ventilator-associated pneumonia (VAP) and changes in bronchial mucosa and sputum in critically ill patients. A prediction model for SEH score was developed according to the abnormal degrees of airway sputum , mucosal edema and mucosal hyperemia , as well as to analyze the diagnostic value of the SEH scores for VAP during bronchoscopy. MethodsA collection of general data and initial bronchoscopy results was conducted for patients admitted to the department of intensive care unit at West China Hospital from March 1, 2024, to July 1, 2024. Patients were divided into infection group (n=138) and non-infection group (n=227) according to diagnostic criteria for VAP based on the date of their first bronchoscopy. T-tests were used to compare baseline data between groups, while analysis of variance was employed to assess differences in airway mucosal and sputum lesions. A binary logistic regression model was constructed using the SEH scores for predicting VAP risk, with receiver operating characteristic curve area under the curve (AUC) utilized to evaluate model accuracy. ResultsA total of 365 patients were included in this study, among which 138 cases (37.8%) were diagnosed with VAP. The AUC for using SEH scores in diagnosing VAP was found to be 0.81 [95% confidence interval (CI) 0.76-0.85], with an optimal cutoff value set at 6.5. The sensitivity and specificity of SEH scores for diagnosing VAP were determined as 79.7% (95% CI: 72.2%-85.6%) and 73.1% (95% CI:67.0%-78.5%). Patients with SEH scores over 6.5 exhibited a significantly higher rate of VAP infection (64.3% vs.14.4%, P<0.0001), elevated white blood cell count levels (WBC) [(13.3±7.5 vs.1.8±6.2), P=0.04], as well as increased hospital mortality rates (39.8 % vs.24.2 %, P=0.002). ConclusionsThe SEH scores has a certain efficacy in the diagnosis of VAP in patients with mechanical ventilation. Compared with the traditional VAP diagnostic criteria, SEH scores is easier to obtain in clinical practice, and has certain clinical application value.
ObjectiveTo systematically evaluate postpartum depression risk prediction models in order to provide references for the construction, application and optimization of related prediction models. MethodsThe CNKI, VIP, WanFang Data, PubMed, Web of Science and EMbase were electronically searched to collect studies on predictive model for the risk of postpartum from January 2013 to April 2023. Two reviewers independently screened the literature, extracted data, and assessed the quality of the included studies based on PROBAST tool. ResultsA total of 10 studies, each study with 1 optimal model were evaluated. Common predictors included prenatal depression, age, smoking history, thyroid hormones and other factors. The area under the curve of the model was greater than 0.7, and the overall applicability was general. Overall high risk of bias and average applicability, mainly due to insufficient number of events in the analysis domain for the response variable, improper handling of missing data, screening of predictors based on univariate analysis, lack of model performance assessment, and consideration of model overfitting. ConclusionThe model is still in the development stage. The included model has good predictive performance and can help early identify people with high incidence of postpartum depression. However, the overall applicability of the model needs to be strengthened, a large sample, multi-center prospective clinical study should be carried out to construct the optimal risk prediction model of PPD, in order to identify and prevent PPD as soon as possible.
ObjectiveTo systematically review the research status of risk prediction models for gestational diabetes mellitus (GDM). MethodsThe CNKI, WanFang Data, VIP, CBM, PubMed, JBI EBP, Ovid MEDLINE, Embase, Web of Science and Cochrane Library databases were electronically searched to collect relevant literature on risk prediction models for GDM from inception to October 2022. Two researchers independently screened the literature, extracted data, and assessed the risk of bias of the included studies, and then qualitative description was performed. ResultsA total of 19 studies were included, involving 19 risk prediction models. The evaluation results showed that, in terms of the risk of bias, 18 studies were high risk, and 1 study was unclear. In terms of applicability, 14 studies were high risk, 2 studies were low risk, and 3 studies were unclear. The area under the receiver operating characteristic curve of the included models was 0.69 to 0.88. The most common predictors included age, weight, pre-pregnancy BMI, history of diabetes, family history of diabetes, and race. ConclusionThe overall performance of the risk prediction model for gestational diabetes mellitus is good, but the risk of bias of the model is high, and the clinical applicability of the model needs to be further verified.
ObjectiveConstructing a prediction model for seizures after stroke, and exploring the risk factors that lead to seizures after stroke. MethodsA retrospective analysis was conducted on 1 741 patients with stroke admitted to People's Hospital of Zhongjiang from July 2020 to September 2022 who met the inclusion and exclusion criteria. These patients were followed up for one year after the occurrence of stroke to observe whether they experienced seizures. Patient data such as gender, age, diagnosis, National Institute of Health Stroke Scale (NIHSS) score, Activity of daily living (ADL) score, laboratory tests, and imaging examination data were recorded. Taking the occurrence of seizures as the outcome, an analysis was conducted on the above data. The Least absolute shrinkage and selection operator (LASSO) regression analysis was used to screen predictive variables, and multivariate Logistic regression analysis was performed. Subsequently, the data were randomly divided into a training set and a validation set in a 7:3 ratio. Construct prediction model, calculate the C-index, draw nomogram, calibration plot, receiver operating characteristic (ROC) curve, and decision curve analysis (DCA) to evaluate the model's performance and clinical application value. ResultsThrough LASSO regression, nine non-zero coefficient predictive variables were identified: NIHSS score, homocysteine (Hcy), aspartate aminotransferase (AST), platelet count, hyperuricemia, hyponatremia, frontal lobe lesions, temporal lobe lesions, and pons lesions. Multivariate logistic regression analysis revealed that NIHSS score, Hcy, hyperuricemia, hyponatremia, and pons lesions were positively correlated with seizures after stroke, while AST and platelet count were negatively correlated with seizures after stroke. A nomogram for predicting seizures after stroke was established. The C-index of the training set and validation set were 0.854 [95%CI (0.841, 0.947)] and 0.838 [95%CI (0.800, 0.988)], respectively. The areas under the ROC curves were 0.842 [95%CI (0.777, 0.899)] and 0.829 [95%CI (0.694, 0.936)] respectively. Conclusion These nine variables can be used to predict seizures after stroke, and they provide new insights into its risk factors.
Objective To observe the correlation between the level of advanced glycosylation end products (AGE) in skin and diabetic retinopathy (DR), and establish and preliminatively verify the nomogramolumbaric model for predicting the risk of DR. MethodsA clinical case-control study. A total of 346 patients with type 2 diabetes mellitus (T2DM) who were admitted to the Department of Endocrinology and Ophthalmology of the First Affiliated Hospital of Zhengzhou University from January 2023 to June 2024 were included in the study. Among them, 198 were males and 148 were females. The mean age was (54.77±10.92). According to whether the patients were accompanied by DR, the patients were divided into the non-DR group (NDR group) and the DR group (DR group), 174 and 172 cases, respectively. All patients underwent skin AGE detection using a noninvasive diabetes detector. Diabetes duration, hemoglobin A1c (HbA1c), fasting plasma glucose, Urea, creatinine (Crea), uric acid, total cholesterol, triglyceride, estimated glomerular filtration rate (eGFR), urinary albumin concentration (UALB), and body mass index (BMI) were collected in detail. Univariate analysis and multivariate logistic regression analysis were used to determine the independent risk factors for T2DM concurrent DR, and to construct a nomogram prediction model for DR risk. Receiver operating characteristic curve (ROC curve), calibration curve and decision curve (DCA) were used to evaluate the model. ResultsHypertension prevalence rate (χ2=3.892), Diabetes duration (Z=?7.708), BMI (Z=?2.627), HbA1c (Z=?4.484), Urea (Z=?4.620), Crea (Z=?3.526), UALB (Z=?6.999), AGE (Z=?8.097) in DR group were significantly higher than those in NDR group, with statistical significance (P<0.05); eGFR was lower than that in NDR group, the difference was statistically significant (Z=?6.061, P<0.05). Logistic regression analysis showed that AGE, diabetes duration, HbA1c, UALB and eGFR were independent risk factors for DR (P<0.05). Based on the results of multi-factor regression analysis, a nomogram prediction model was constructed. The area under ROC curve of the model was 0.843, 95% confidence interval was 0.802-0.884, sensitivity and specificity were 79.1% and 75.9%, respectively. The calibration curve was basically consistent with the ideal curve. The results of DCA analysis showed that when the model predicted the risk threshold of patients with DR between 0.17 and 0.99, the clinical net benefit provided by the nomogram model was>0. ConclusionsSkin AGE level is an independent risk factor for DR. The nomogram prediction model based on AGE, diabetes duration, HbA1c, eGFR and UALB can accurately predict the risk of DR, and has good clinical practicability.