Functional annotation of the 2q35 breast cancer risk locus implicates a structural variant in influencing activity of a long-range enhancer element

Published on Jul 1, 2021in American Journal of Human Genetics10.502
· DOI :10.1016/J.AJHG.2021.05.013
A combination of genetic and functional approaches has identified three independent breast cancer risk loci at 2q35. A recent fine-scale mapping analysis to refine these associations resulted in 1 (signal 1), 5 (signal 2), and 42 (signal 3) credible causal variants at these loci. We used publicly available in silico DNase I and ChIP-seq data with in vitro reporter gene and CRISPR assays to annotate signals 2 and 3. We identified putative regulatory elements that enhanced cell-type-specific transcription from the IGFBP5 promoter at both signals (30- to 40-fold increased expression by the putative regulatory element at signal 2, 2- to 3-fold by the putative regulatory element at signal 3). We further identified one of the five credible causal variants at signal 2, a 1.4 kb deletion (esv3594306), as the likely causal variant; the deletion allele of this variant was associated with an average additional increase in IGFBP5 expression of 1.3-fold (MCF-7) and 2.2-fold (T-47D). We propose a model in which the deletion allele of esv3594306 juxtaposes two transcription factor binding regions (annotated by estrogen receptor alpha ChIP-seq peaks) to generate a single extended regulatory element. This regulatory element increases cell-type-specific expression of the tumor suppressor gene IGFBP5 and, thereby, reduces risk of estrogen receptor-positive breast cancer (odds ratio = 0.77, 95% CI 0.74-0.81, p = 3.1 × 10-31).
📖 Papers frequently viewed together
86 Citations
2 Citations
12 Authors (Yosr Hamdi, ..., Jacques Simard)
3 Citations
Rapidly advancing genomic technologies and cross-disciplinary partnerships are accelerating the biological and clinical interpretation of genome-wide association studies, with some therapies developed based on these findings already being tested in clinical trials. The next decade promises further progress in understanding the function of genetic variants.
4 CitationsSource
#1M. Grace Gordon (UCSF: University of California, San Francisco)H-Index: 3
#2Fumitaka Inoue (UCSF: University of California, San Francisco)H-Index: 13
Last. Nadav Ahituv (UCSF: University of California, San Francisco)H-Index: 39
view all 18 authors...
Massively parallel reporter assays (MPRAs) can simultaneously measure the function of thousands of candidate regulatory sequences (CRSs) in a quantitative manner. In this method, CRSs are cloned upstream of a minimal promoter and reporter gene, alongside a unique barcode, and introduced into cells. If the CRS is a functional regulatory element, it will lead to the transcription of the barcode sequence, which is measured via RNA sequencing and normalized for cellular integration via DNA sequencin...
13 CitationsSource
#1Haoyu Zhang (Johns Hopkins University)H-Index: 5
#2Thomas Ahearn (NIH: National Institutes of Health)H-Index: 6
Last. Manjeet K. Bolla (University of Cambridge)H-Index: 41
view all 3 authors...
Breast cancer susceptibility variants frequently show heterogeneity in associations by tumor subtype1–3. To identify novel loci, we performed a genome-wide association study including 133,384 breast cancer cases and 113,789 controls, plus 18,908 BRCA1 mutation carriers (9,414 with breast cancer) of European ancestry, using both standard and novel methodologies that account for underlying tumor heterogeneity by estrogen receptor, progesterone receptor and human epidermal growth factor receptor 2 ...
52 CitationsSource
#1Laura Fachal (University of Cambridge)H-Index: 24
#2Hugues Aschard (Harvard University)H-Index: 29
Last. Alison M. Dunning (University of Cambridge)H-Index: 8
view all 3 authors...
Genome-wide association studies have identified breast cancer risk variants in over 150 genomic regions, but the mechanisms underlying risk remain largely unknown. These regions were explored by combining association analysis with in silico genomic feature annotations. We defined 205 independent risk-associated signals with the set of credible causal variants in each one. In parallel, we used a Bayesian approach (PAINTOR) that combines genetic association, linkage disequilibrium and enriched gen...
48 CitationsSource
#1Elizabeth A Coe (University of Bath)H-Index: 2
#2Jennifer Y Tan (UNIL: University of Lausanne)H-Index: 9
Last. Keith W. Vance (University of Bath)H-Index: 14
view all 8 authors...
The MITF and SOX10 transcription factors regulate the expression of genes important for melanoma proliferation, invasion and metastasis. Despite growing evidence of the contribution of long noncoding RNAs (lncRNAs) in cancer, including melanoma, their functions within MITF-SOX10 transcriptional programmes remain poorly investigated. Here we identify 245 candidate melanoma associated lncRNAs whose loci are co-occupied by MITF-SOX10 and that are enriched at active enhancer-like regions. Our work s...
25 CitationsSource
#1Joseph S Baxter (ICR: Institute of Cancer Research)H-Index: 1
#2Olivia C. Leavy (Lond: University of London)H-Index: 2
Last. Olivia Fletcher (ICR: Institute of Cancer Research)H-Index: 60
view all 18 authors...
Genome-wide association studies (GWAS) have identified approximately 100 breast cancer risk loci. Translating these findings into a greater understanding of the mechanisms that influence disease risk requires identification of the genes or non-coding RNAs that mediate these associations. Here, we use Capture Hi-C (CHi-C) to annotate 63 loci; we identify 110 putative target genes at 33 loci. To assess the support for these target genes in other data sources we test for associations between levels...
56 CitationsSource
Breast cancer risk is influenced by rare coding variants in susceptibility genes, such as BRCA1, and many common, mostly non-coding variants. However, much of the genetic contribution to breast cancer risk remains unknown. Here we report the results of a genome-wide association study of breast cancer in 122,977 cases and 105,974 controls of European ancestry and 14,068 cases and 13,104 controls of East Asian ancestry1. We identified 65 new loci that are associated with overall breast cancer risk...
652 CitationsSource
Most common breast cancer susceptibility variants have been identified through genome-wide association studies (GWAS) of predominantly estrogen receptor (ER)-positive disease. We conducted a GWAS using 21,468 ER-negative cases and 100,594 controls combined with 18,908 BRCA1 mutation carriers (9,414 with breast cancer), all of European origin. We identified independent associations at P < 5 × 10-8 with ten variants at nine new loci. At P < 0.05, we replicated associations with 10 of 11 variants p...
182 CitationsSource
#1Joshua A. Betts (UQ: University of Queensland)H-Index: 5
#2Mahdi Moradi Marjaneh (QIMR: QIMR Berghofer Medical Research Institute)H-Index: 8
Last. Juliet D. French (QIMR: QIMR Berghofer Medical Research Institute)H-Index: 26
view all 31 authors...
Breast cancer risk is strongly associated with an intergenic region on 11q13. We have previously shown that the strongest risk-associated SNPs fall within a distal enhancer that regulates CCND1. Here, we report that, in addition to regulating CCND1, this enhancer regulates two estrogen-regulated long noncoding RNAs, CUPID1 and CUPID2. We provide evidence that the risk-associated SNPs are associated with reduced chromatin looping between the enhancer and the CUPID1 and CUPID2 bidirectional promot...
54 CitationsSource
#1Maya Ghoussaini (University of Cambridge)H-Index: 27
#2Juliet D. French (QIMR: QIMR Berghofer Medical Research Institute)H-Index: 26
Last. Stacey L. Edwards (QIMR: QIMR Berghofer Medical Research Institute)H-Index: 25
view all 123 authors...
Genome-wide association studies (GWASs) have revealed increased breast cancer risk associated with multiple genetic variants at 5p12. Here, we report the fine mapping of this locus using data from 104,660 subjects from 50 case-control studies in the Breast Cancer Association Consortium (BCAC). With data for 3,365 genotyped and imputed SNPs across a 1 Mb region (positions 44,394,495–45,364,167; NCBI build 37), we found evidence for at least three independent signals: the strongest signal, consist...
40 CitationsSource
Cited By2
#1Natasha K Tuano (DI: Discovery Institute)
#2Jonathan Beesley (QIMR: QIMR Berghofer Medical Research Institute)H-Index: 58
Last. Aaron Hatzipantelis (QIMR: QIMR Berghofer Medical Research Institute)
view all 21 authors...
Genome-wide association studies (GWAS) have identified >200 loci associated with breast cancer (BC) risk. The majority of candidate causal variants (CCVs) are in non-coding regions and are likely to modulate cancer risk by regulating gene expression. We recently developed a scoring system, INQUISIT, to predict candidate risk genes at BC-risk loci. Here, we used pooled CRISPR activation and suppression screens to validate INQUISIT predictions, and to define the cancer phenotypes they mediate. We ...
#1Joe Dennis (University of Cambridge)H-Index: 55
#2Tyrer Jp (University of Cambridge)H-Index: 2
Last. Doug Easton (University of Cambridge)H-Index: 62
view all 122 authors...
Background: Copy number variants (CNVs) are pervasive in the human genome but potential disease associations with rare CNVs have not been comprehensively assessed in large datasets. We analysed rare CNVs in genes and non-coding regions for 86,788 breast cancer cases and 76,122 controls of European ancestry with genome-wide array data. Results: Gene burden tests detected the strongest association for deletions in BRCA1 (P= 3.7E-18). Nine other genes were associated with a p-value < 0.01 including...