Objective By collecting the patient information in the Colon Cancer Database of Peking Union Medical College Hospital since its establishment, we aim to demonstrate the completeness of the clinicopathological characteristics and follow-up data to provide reference for single-center database establishment.
Methods We collected 33 surgical patient information entries from the single-center prospective registry database of Colon Cancer at Peking Union Medical College Hospital from January 5, 2016 to May 11, 2022, including demographic characteristics, features of the primary lesion, surgical and pathological information and postoperative complications, and calculated the missing rate of each entry as well as the follow-up data.
Results A total of 1682 patients with colon cancer were included in the analysis, in which 981 patients were male and 701 patients were female, with an average age of (62.75±11.97) years. The postoperative complication rate was 15.6% (263/1682). Among the data entries, demographic features such as sex, age, ethnic group as well as the pathologic staging were collected without missing. The relatively high-missing entries were the colonoscopy tumor site with a missing rate of 12.2%, followed by immunohistochemical information of mismatch repair with a missing rate of 8.0%. The missing rate of all 33 entries was distributed in 0-12.2%, of which 81.8% (27/33) were less than 1% and 93.9% (31/33) were less than 5%. As for follow-up data, the percentage of patients with complete missing follow-up was 1.7%, which meant 98.3% patients had available survival data. The 1-year, 2-year, 3-year, 4-year and 5-year follow-up rates of patients enrolled before July 2, 2020 (patients enrolled after this date did not have regular follow-up information in a uniform format due to data migration) were 55.02%, 70.96%, 72.02%, 65.42% and 70.91%, respectively, and the 2-year complete follow-up rate was 19.5%.
Conclusion The low rate of missing statistical entries and the high follow-up rate of our center's database has certain reference value for database establishment, but there is still room for improvement in terms of continuous follow-up.