Performance Evaluation of SpliceAI for the Prediction of Splicing of NF1 Variants

Published on Aug 25, 2021in Genes3.759
· DOI :10.3390/GENES12091308
Changhee Ha , Jong-Won Kim38
Estimated H-index: 38
(SMC: Samsung Medical Center),
Ja-Hyun Jang9
Estimated H-index: 9
Sources
Abstract
Neurofibromatosis type 1, characterized by neurofibromas and cafe-au-lait macules, is one of the most common genetic disorders caused by pathogenic NF1 variants. Because of the high proportion of splicing mutations in NF1, identifying variants that alter splicing may be an essential issue for laboratories. Here, we investigated the sensitivity and specificity of SpliceAI, a recently introduced in silico splicing prediction algorithm in conjunction with other in silico tools. We evaluated 285 NF1 variants identified from 653 patients. The effect on variants on splicing alteration was confirmed by complementary DNA sequencing followed by genomic DNA sequencing. For in silico prediction of splicing effects, we used SpliceAI, MaxEntScan (MES), and Splice Site Finder-like (SSF). The sensitivity and specificity of SpliceAI were 94.5% and 94.3%, respectively, with a cut-off value of Δ Score > 0.22. The area under the curve of SpliceAI was 0.975 (p < 0.0001). Combined analysis of MES/SSF showed a sensitivity of 83.6% and specificity of 82.5%. The concordance rate between SpliceAI and MES/SSF was 84.2%. SpliceAI showed better performance for the prediction of splicing alteration for NF1 variants compared with MES/SSF. As a convenient web-based tool, SpliceAI may be helpful in clinical laboratories conducting DNA-based NF1 sequencing.
References39
Newest
#2Jun Wang (BCM: Baylor College of Medicine)H-Index: 210
High throughput sequencing technologies have revolutionized the identification of mutations responsible for a diverse set of Mendelian disorders, including inherited retinal disorders (IRDs). However, the causal mutations remain elusive for a significant proportion of patients. This may be partially due to pathogenic mutations located in non-coding regions, which are largely missed by capture sequencing targeting the coding regions. The advent of whole-genome sequencing (WGS) allows us to system...
4 CitationsSource
Exon splicing triggered by unpredicted genetic mutation can cause translational variations in neurodegenerative disorders. In this study, we discover Alzheimer's disease (AD)-specific single-nucleotide variants (SNVs) and abnormal exon splicing of phospholipase c gamma-1 (PLCγ1) gene, using genome-wide association study (GWAS) and a deep learning-based exon splicing prediction tool. GWAS revealed that the identified single-nucleotide variations were mainly distributed in the H3K27ac-enriched reg...
3 CitationsSource
#1Anya T. Joynt (JHUSOM: Johns Hopkins University School of Medicine)H-Index: 5
#2Taylor A. Evans (JHUSOM: Johns Hopkins University School of Medicine)H-Index: 8
Last. Neeraj Sharma (JHUSOM: Johns Hopkins University School of Medicine)H-Index: 15
view all 16 authors...
Elucidating the functional consequence of molecular defects underlying genetic diseases enables appropriate design of therapeutic options. Treatment of cystic fibrosis (CF) is an exemplar of this paradigm as the development of CFTR modulator therapies has allowed for targeted and effective treatment of individuals harboring specific genetic variants. However, the mechanism of these drugs limits effectiveness to particular classes of variants that allow production of CFTR protein. Thus, assessmen...
3 CitationsSource
#1Jian-Min Chen (French Institute of Health and Medical Research)H-Index: 54
#2Jin-Huan Lin (French Institute of Health and Medical Research)H-Index: 10
Last. Matthew Hayden (Cardiff University)H-Index: 39
view all 7 authors...
Introduction 5' splice site GT>GC or +2T>C variants have been frequently reported to cause human genetic disease and are routinely scored as pathogenic splicing mutations. However, we have recently demonstrated that such variants in human disease genes may not invariably be pathogenic. Moreover, we found that no splicing prediction tools appear to be capable of reliably distinguishing those +2T>C variants that generate wild-type transcripts from those that do not. Methodology Herein, we evaluate...
7 CitationsSource
#1Adam FrankishH-Index: 40
#2Mark DiekhansH-Index: 5
Last. Paul FlicekH-Index: 104
view all 56 authors...
The GENCODE project annotates human and mouse genes and transcripts supported by experimental data with high accuracy, providing a foundational resource that supports genome biology and clinical genomics GENCODE annotation processes make use of primary data and bioinformatic tools and analysis generated both within the consortium and externally to support the creation of transcript structures and the determination of their function Here, we present improvements to our annotation infrastructure, ...
21 CitationsSource
#1Kishore Jaganathan (Illumina)H-Index: 11
Last. Kyle Kai-How Farh (Illumina)H-Index: 16
view all 17 authors...
Summary The splicing of pre-mRNAs into mature transcripts is remarkable for its precision, but the mechanisms by which the cellular machinery achieves such specificity are incompletely understood. Here, we describe a deep neural network that accurately predicts splice junctions from an arbitrary pre-mRNA transcript sequence, enabling precise prediction of noncoding genetic variants that cause cryptic splicing. Synonymous and intronic mutations with predicted splice-altering consequence validate ...
438 CitationsSource
#1Raphaël Leman (UNICAEN: University of Caen Lower Normandy)H-Index: 4
#2Pascaline Gaildrat (French Institute of Health and Medical Research)H-Index: 19
Last. Claude Houdayer (Curie Institute)H-Index: 37
view all 34 authors...
: Variant interpretation is the key issue in molecular diagnosis. Spliceogenic variants exemplify this issue as each nucleotide variant can be deleterious via disruption or creation of splice site consensus sequences. Consequently, reliable in silico prediction of variant spliceogenicity would be a major improvement. Thanks to an international effort, a set of 395 variants studied at the mRNA level and occurring in 5' and 3' consensus regions (defined as the 11 and 14 bases surrounding the exon/...
44 CitationsSource
In silico tools for splicing defect prediction have a key role to assess the impact of variants of uncertain significance. Our aim was to evaluate the performance of a set of commonly used splicing in silico tools comparing the predictions against RNA in vitro results. This was done for natural splice sites of clinically relevant genes in hereditary breast/ovarian cancer (HBOC) and Lynch syndrome. A study divided into two stages was used to evaluate SSF-like, MaxEntScan, NNSplice, HSF, SPANR and...
31 CitationsSource
#1Alessandro StellaH-Index: 18
#2Patrizia LastellaH-Index: 15
Last. Nicoletta RestaH-Index: 28
view all 11 authors...
Neurofibromatosis type 1 (NF1) is one of the most common autosomal dominant genetic diseases. It is caused by mutations in the NF1 gene encoding for the large protein, neurofibromin. Genetic testing of NF1 is cumbersome because 50% of cases are sporadic, and there are no mutation hot spots. In addition, the most recognizable NF1 clinical features—cafe-au-lait (CALs) spots and axillary and/or inguinal freckling—appear early in childhood but are rather non-specific. Thus, the identification of cau...
12 CitationsSource
#1David H. Gutmann (WashU: Washington University in St. Louis)H-Index: 113
#2Rosalie E. Ferner (Guy's and St Thomas' NHS Foundation Trust)H-Index: 35
Last. Kimberly J. JohnsonH-Index: 27
view all 6 authors...
Neurofibromatosis type 1 is caused by mutations in the NF1 tumour suppressor gene. This Primer by Gutmann and colleagues discusses the genetics underlying the development of this disease, and describes the diagnosis and treatment of the widespread clinical manifestations.
225 CitationsSource
Cited By0
Newest