Importance Understanding the prevalence of the pathogenic germline variants in cancer susceptibility genes within a non-selected population could be highly beneficial for evaluating the need for broader genetic testing guidelines and thereby aiding early detection of cancer.
Objective To determine the population-level prevalence of pathogenic variants in specific cancer susceptibility genes, which are typically screened for only in individuals with cancer or their at-risk relatives due to cost considerations. We will further breakdown the variant prevalence by race, ethnicity and sex
Design Cancer susceptibility genes were selected based on the Invitae Multi-Cancer panel.
Setting This population-based genomic study utilized short-read whole genome sequencing (srWGS) data from the All of Us (AoU) controlled tier database v8.
Participants A total of 414,830 individuals with srWGS data and variant information were included in the analysis.
Main Outcomes and Measures We extracted the ClinVar variant consequences for each participant and applied stringent criteria to classify variants as truly pathogenic or likely pathogenic (P/LP). A total of 3,454 unique P/LP variants were identified across 77 transcripts and 72 genes.
Results We identified 20,968 individuals with P/LP variants, representing approximately 5.05% of the total population. Pathogenic variant prevalence did not significantly differ by sex and ethnicity, while significant differences were observed across racial groups (adj p < 0.0001), with white participants having the highest prevalence (5.72%) and Asians the lowest. Among individual genes, MUTYH (1.33%) had the highest prevalence, followed by BRCA2 (0.42%) and MITF (0.37%). Pathogenic variants in TP53 showed significant sex-based differences, while a higher number of genes had significantly different carrier rates across ethnicity (7) and racial groups (14).
Conclusions and Relevance The pathogenic variant prevalence found in the general population, along with its racial variability, highlights the need to reconsider current genetic testing guidelines. Expanding screening recommendations could enhance early cancer detection and prevention efforts ultimately reducing disease burden and improving outcomes.
Questions What are the population-level prevalence rates of pathogenic germline variants in specific cancer susceptibility genes?
Findings Among 414,830 participants with short read whole genome sequencing data from the All of Us dataset, 5.05% carried a pathogenic or likely pathogenic variant in one of 84 cancer susceptibility genes identified from the Invitae Multi-Cancer Panel.
Meaning These findings suggest that pathogenic variants are sufficiently common to support broader genetic screening programs for cancer predisposition, enabling early detection and targeted prevention strategies.
Competing Interest StatementThe authors have declared no competing interest.
Funding StatementThis study did not receive any funding
Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
Data AvailabilityAll data produced in the present study are available upon reasonable request to the authors
Comments (0)