Yum, tasty mutations...

MutationTaster - study a chromosomal position

MTQE documentation
NEVER press reload or F5 - unless you want to start from the very beginning.
input seems to be ok - now mapping the variant to the different transcripts...
found 1 transcript(s)...
Querying Taster for transcript #1: ENST00000225964
MT speed 0 s - this script 3.40198 s

Results


genesymbolpredictionprobabilitymodelprediction
problem
splicingClinVaramino acid changesvariant typedbSNP IDprotein lengthfile
COL1A1disease_causing_automatic0.999999992236817simple_aaeaffected0G221Csingle base exchangers72667037show file

Taster files

Yum, tasty mutations...

mutation t@sting

documentation

Prediction

disease causing

Model: simple_aae, prob: 0.999999992236817 (classification due to ClinVar, real probability is shown anyway)      (explain)
Summary
  • amino acid sequence changed
  • known disease mutation at this position (HGMD CM1611482)
  • known disease mutation at this position (HGMD CM920198)
  • known disease mutation: rs17320 (pathogenic)
  • protein features (might be) affected
  • splice site changes
hyperlink
analysed issue analysis result
name of alteration no title
alteration (phys. location) chr17:48275128C>AN/A show variant in all transcripts   IGV
HGNC symbol COL1A1
Ensembl transcript ID ENST00000225964
Genbank transcript ID NM_000088
UniProt peptide P02452
alteration type single base exchange
alteration region CDS
DNA changes c.661G>T
cDNA.780G>T
g.3866G>T
AA changes G221C Score: 159 explain score(s)
position(s) of altered AA
if AA alteration in CDS
221
frameshift no
known variant Reference ID: rs72667037
Allele 'A' was neither found in ExAC nor 1000G.
known disease mutation: rs17320 (pathogenic for Osteogenesis imperfecta type 1, mild) dbSNP  NCBI variation viewer
known disease mutation at this position, please check HGMD for details (HGMD ID CM1611482)

known disease mutation at this position, please check HGMD for details (HGMD ID CM1611482)
known disease mutation at this position, please check HGMD for details (HGMD ID CM920198)

known disease mutation at this position, please check HGMD for details (HGMD ID CM1611482)
known disease mutation at this position, please check HGMD for details (HGMD ID CM920198)
known disease mutation at this position, please check HGMD for details (HGMD ID CM920198)
regulatory features H3K14ac, Histone, Histone 3 Lysine 14 Acetylation
H3K18ac, Histone, Histone 3 Lysine 18 Acetylation
H3K27ac, Histone, Histone 3 Lysine 27 Acetylation
H3K27me3, Histone, Histone 3 Lysine 27 Tri-Methylation
H3K36me3, Histone, Histone 3 Lysine 36 Tri-Methylation
H3K4ac, Histone, Histone 3 Lysine 4 Acetylation
H3K4me2, Histone, Histone 3 Lysine 4 Di-Methylation
H3K4me3, Histone, Histone 3 Lysine 4 Tri-Methylation
H3K56ac, Histone, Histone 3 Lysine 56 Acetylation
H3K79me2, Histone, Histone 3 Lysine 79 di-methylation
H3K9ac, Histone, Histone 3 Lysine 9 Acetylation
H4K5ac, Histone, Histone 4 Lysine 5 Acetylation
H4K8ac, Histone, Histone 4 Lysine 8 Acetylation
H4K91ac, Histone, Histone 4 Lysine 91 Acetylation
phyloP / phastCons
PhyloPPhastCons
(flanking)5.131
5.131
(flanking)-0.1960.992
explain score(s) and/or inspect your position(s) in in UCSC Genome Browser
splice sites
effectgDNA positionscorewt detection sequence exon-intron border
Acc increased3875wt: 0.65 / mu: 0.86wt: TGGGTCCCCGAGGTCCCCCAGGTCCCCCTGGAAAGAATGGA
mu: TGGGTCCCCGATGTCCCCCAGGTCCCCCTGGAAAGAATGGA
 ccag|GTCC
Acc marginally increased3857wt: 0.9503 / mu: 0.9619 (marginal change - not scored)wt: ATCTTTTCTAGGGTCCCATGGGTCCCCGAGGTCCCCCAGGT
mu: ATCTTTTCTAGGGTCCCATGGGTCCCCGATGTCCCCCAGGT
 atgg|GTCC
Acc increased3874wt: 0.21 / mu: 0.63wt: ATGGGTCCCCGAGGTCCCCCAGGTCCCCCTGGAAAGAATGG
mu: ATGGGTCCCCGATGTCCCCCAGGTCCCCCTGGAAAGAATGG
 ccca|GGTC
Donor marginally increased3870wt: 0.8143 / mu: 0.8207 (marginal change - not scored)wt: AGGTCCCCCAGGTCC
mu: ATGTCCCCCAGGTCC
 GTCC|ccca
Donor marginally increased3861wt: 0.6188 / mu: 0.6650 (marginal change - not scored)wt: GGGTCCCCGAGGTCC
mu: GGGTCCCCGATGTCC
 GTCC|ccga
distance from splice site 19
Kozak consensus sequence altered? N/A
conservation
protein level for non-synonymous changes
speciesmatchgeneaaalignment
Human      221EPGASGPMGPRGPPGPPGKNGDDG
mutated  not conserved    221EPGASGPMGPRCPPGPPGKNGDD
Ptroglodytes  all identical  ENSPTRG00000009393  221EPGASGPMGPRGPPGPPGKNGDD
Mmulatta  all identical  ENSMMUG00000001467  221EPGASGPMGPRGPPGPPGKNGDD
Fcatus  no homologue    
Mmusculus  all identical  ENSMUSG00000001506  210EPGGSGPMGPRGPPGPPGKNGDD
Ggallus  no homologue    
Trubripes  all identical  ENSTRUG00000007520  213EPGSPGPMGPRGPSGPPGKNGDD
Drerio  all identical  ENSDARG00000012405  205EAGAPGPMGPRGAAGPPGKNGED
Dmelanogaster  no homologue    
Celegans  no homologue    
Xtropicalis  all identical  ENSXETG00000003374  207EPGSSGAMGPRGPAGPPGKNGED
protein features
start (aa)end (aa)featuredetails 
1791192REGIONTriple-helical region.lost
264266STRANDmight get lost (downstream of altered splice site)
265265MOD_RES5-hydroxylysine.might get lost (downstream of altered splice site)
265265CARBOHYDO-linked (Gal...).might get lost (downstream of altered splice site)
288288CONFLICTE -> P (in Ref. 15; AA sequence).might get lost (downstream of altered splice site)
370370CONFLICTR -> L (in Ref. 6; AAB59373).might get lost (downstream of altered splice site)
484484CONFLICTP -> L (in Ref. 19; AAA52289).might get lost (downstream of altered splice site)
595595CONFLICTA -> R (in Ref. 20; AAA51847).might get lost (downstream of altered splice site)
721721CONFLICTQ -> E (in Ref. 22; no nucleotide entry).might get lost (downstream of altered splice site)
738738CONFLICTL -> E (in Ref. 22; no nucleotide entry).might get lost (downstream of altered splice site)
745747MOTIFCell attachment site (Potential).might get lost (downstream of altered splice site)
953954SITECleavage; by collagenase (By similarity).might get lost (downstream of altered splice site)
966968STRANDmight get lost (downstream of altered splice site)
975976CONFLICTLP -> PL (in Ref. 19; AAA52291).might get lost (downstream of altered splice site)
10811081CONFLICTV -> A (in Ref. 18; AAA51995).might get lost (downstream of altered splice site)
10931095MOTIFCell attachment site (Potential).might get lost (downstream of altered splice site)
11081108CARBOHYDO-linked (Gal...) (By similarity).might get lost (downstream of altered splice site)
11081108MOD_RES5-hydroxylysine (By similarity).might get lost (downstream of altered splice site)
11641164MOD_RES3-hydroxyproline (By similarity).might get lost (downstream of altered splice site)
11931218REGIONNonhelical region (C-terminal).might get lost (downstream of altered splice site)
12081208MOD_RESAllysine (By similarity).might get lost (downstream of altered splice site)
12181219SITECleavage; by procollagen C-endopeptidase.might get lost (downstream of altered splice site)
12191464PROPEPC-terminal propeptide. /FTId=PRO_0000005721.might get lost (downstream of altered splice site)
12291464DOMAINFibrillar collagen NC1.might get lost (downstream of altered splice site)
12591259DISULFIDInterchain (By similarity).might get lost (downstream of altered splice site)
12591259DISULFIDInterchain (By similarity).might get lost (downstream of altered splice site)
12651265DISULFIDInterchain (By similarity).might get lost (downstream of altered splice site)
12651265DISULFIDInterchain (By similarity).might get lost (downstream of altered splice site)
12821282DISULFIDInterchain (By similarity).might get lost (downstream of altered splice site)
12821282DISULFIDInterchain (By similarity).might get lost (downstream of altered splice site)
12911291DISULFIDInterchain (By similarity).might get lost (downstream of altered splice site)
12911291DISULFIDInterchain (By similarity).might get lost (downstream of altered splice site)
12991299DISULFIDBy similarity.might get lost (downstream of altered splice site)
13291329CONFLICTS -> T (in Ref. 25; AAB27856).might get lost (downstream of altered splice site)
13651365CARBOHYDN-linked (GlcNAc...).might get lost (downstream of altered splice site)
13701370DISULFIDBy similarity.might get lost (downstream of altered splice site)
14151415DISULFIDBy similarity.might get lost (downstream of altered splice site)
14621462DISULFIDBy similarity.might get lost (downstream of altered splice site)
length of protein normal
AA sequence altered yes
position of stopcodon in wt / mu CDS 4395 / 4395
position (AA) of stopcodon in wt / mu AA sequence 1465 / 1465
position of stopcodon in wt / mu cDNA 4514 / 4514
poly(A) signal N/A
conservation
nucleotide level for all changes - no scoring up to now
N/A
position of start ATG in wt / mu cDNA 120 / 120
chromosome 17
strand -1
last intron/exon boundary 4368
theoretical NMD boundary in CDS 4198
length of CDS 4395
coding sequence (CDS) position 661
cDNA position
(for ins/del: last normal base / first normal base)
780
gDNA position
(for ins/del: last normal base / first normal base)
3866
chromosomal position
(for ins/del: last normal base / first normal base)
48275128
original gDNA sequence snippet AGGGTCCCATGGGTCCCCGAGGTCCCCCAGGTCCCCCTGGA
altered gDNA sequence snippet AGGGTCCCATGGGTCCCCGATGTCCCCCAGGTCCCCCTGGA
original cDNA sequence snippet CAGGTCCCATGGGTCCCCGAGGTCCCCCAGGTCCCCCTGGA
altered cDNA sequence snippet CAGGTCCCATGGGTCCCCGATGTCCCCCAGGTCCCCCTGGA
wildtype AA sequence MFSFVDLRLL LLLAATALLT HGQEEGQVEG QDEDIPPITC VQNGLRYHDR DVWKPEPCRI
CVCDNGKVLC DDVICDETKN CPGAEVPEGE CCPVCPDGSE SPTDQETTGV EGPKGDTGPR
GPRGPAGPPG RDGIPGQPGL PGPPGPPGPP GPPGLGGNFA PQLSYGYDEK STGGISVPGP
MGPSGPRGLP GPPGAPGPQG FQGPPGEPGE PGASGPMGPR GPPGPPGKNG DDGEAGKPGR
PGERGPPGPQ GARGLPGTAG LPGMKGHRGF SGLDGAKGDA GPAGPKGEPG SPGENGAPGQ
MGPRGLPGER GRPGAPGPAG ARGNDGATGA AGPPGPTGPA GPPGFPGAVG AKGEAGPQGP
RGSEGPQGVR GEPGPPGPAG AAGPAGNPGA DGQPGAKGAN GAPGIAGAPG FPGARGPSGP
QGPGGPPGPK GNSGEPGAPG SKGDTGAKGE PGPVGVQGPP GPAGEEGKRG ARGEPGPTGL
PGPPGERGGP GSRGFPGADG VAGPKGPAGE RGSPGPAGPK GSPGEAGRPG EAGLPGAKGL
TGSPGSPGPD GKTGPPGPAG QDGRPGPPGP PGARGQAGVM GFPGPKGAAG EPGKAGERGV
PGPPGAVGPA GKDGEAGAQG PPGPAGPAGE RGEQGPAGSP GFQGLPGPAG PPGEAGKPGE
QGVPGDLGAP GPSGARGERG FPGERGVQGP PGPAGPRGAN GAPGNDGAKG DAGAPGAPGS
QGAPGLQGMP GERGAAGLPG PKGDRGDAGP KGADGSPGKD GVRGLTGPIG PPGPAGAPGD
KGESGPSGPA GPTGARGAPG DRGEPGPPGP AGFAGPPGAD GQPGAKGEPG DAGAKGDAGP
PGPAGPAGPP GPIGNVGAPG AKGARGSAGP PGATGFPGAA GRVGPPGPSG NAGPPGPPGP
AGKEGGKGPR GETGPAGRPG EVGPPGPPGP AGEKGSPGAD GPAGAPGTPG PQGIAGQRGV
VGLPGQRGER GFPGLPGPSG EPGKQGPSGA SGERGPPGPM GPPGLAGPPG ESGREGAPGA
EGSPGRDGSP GAKGDRGETG PAGPPGAPGA PGAPGPVGPA GKSGDRGETG PAGPTGPVGP
VGARGPAGPQ GPRGDKGETG EQGDRGIKGH RGFSGLQGPP GPPGSPGEQG PSGASGPAGP
RGPPGSAGAP GKDGLNGLPG PIGPPGPRGR TGDAGPVGPP GPPGPPGPPG PPSAGFDFSF
LPQPPQEKAH DGGRYYRADD ANVVRDRDLE VDTTLKSLSQ QIENIRSPEG SRKNPARTCR
DLKMCHSDWK SGEYWIDPNQ GCNLDAIKVF CNMETGETCV YPTQPSVAQK NWYISKNPKD
KRHVWFGESM TDGFQFEYGG QGSDPADVAI QLTFLRLMST EASQNITYHC KNSVAYMDQQ
TGNLKKALLL QGSNEIEIRA EGNSRFTYSV TVDGCTSHTG AWGKTVIEYK TTKTSRLPII
DVAPLDVGAP DQEFGFDVGP VCFL*
mutated AA sequence MFSFVDLRLL LLLAATALLT HGQEEGQVEG QDEDIPPITC VQNGLRYHDR DVWKPEPCRI
CVCDNGKVLC DDVICDETKN CPGAEVPEGE CCPVCPDGSE SPTDQETTGV EGPKGDTGPR
GPRGPAGPPG RDGIPGQPGL PGPPGPPGPP GPPGLGGNFA PQLSYGYDEK STGGISVPGP
MGPSGPRGLP GPPGAPGPQG FQGPPGEPGE PGASGPMGPR CPPGPPGKNG DDGEAGKPGR
PGERGPPGPQ GARGLPGTAG LPGMKGHRGF SGLDGAKGDA GPAGPKGEPG SPGENGAPGQ
MGPRGLPGER GRPGAPGPAG ARGNDGATGA AGPPGPTGPA GPPGFPGAVG AKGEAGPQGP
RGSEGPQGVR GEPGPPGPAG AAGPAGNPGA DGQPGAKGAN GAPGIAGAPG FPGARGPSGP
QGPGGPPGPK GNSGEPGAPG SKGDTGAKGE PGPVGVQGPP GPAGEEGKRG ARGEPGPTGL
PGPPGERGGP GSRGFPGADG VAGPKGPAGE RGSPGPAGPK GSPGEAGRPG EAGLPGAKGL
TGSPGSPGPD GKTGPPGPAG QDGRPGPPGP PGARGQAGVM GFPGPKGAAG EPGKAGERGV
PGPPGAVGPA GKDGEAGAQG PPGPAGPAGE RGEQGPAGSP GFQGLPGPAG PPGEAGKPGE
QGVPGDLGAP GPSGARGERG FPGERGVQGP PGPAGPRGAN GAPGNDGAKG DAGAPGAPGS
QGAPGLQGMP GERGAAGLPG PKGDRGDAGP KGADGSPGKD GVRGLTGPIG PPGPAGAPGD
KGESGPSGPA GPTGARGAPG DRGEPGPPGP AGFAGPPGAD GQPGAKGEPG DAGAKGDAGP
PGPAGPAGPP GPIGNVGAPG AKGARGSAGP PGATGFPGAA GRVGPPGPSG NAGPPGPPGP
AGKEGGKGPR GETGPAGRPG EVGPPGPPGP AGEKGSPGAD GPAGAPGTPG PQGIAGQRGV
VGLPGQRGER GFPGLPGPSG EPGKQGPSGA SGERGPPGPM GPPGLAGPPG ESGREGAPGA
EGSPGRDGSP GAKGDRGETG PAGPPGAPGA PGAPGPVGPA GKSGDRGETG PAGPTGPVGP
VGARGPAGPQ GPRGDKGETG EQGDRGIKGH RGFSGLQGPP GPPGSPGEQG PSGASGPAGP
RGPPGSAGAP GKDGLNGLPG PIGPPGPRGR TGDAGPVGPP GPPGPPGPPG PPSAGFDFSF
LPQPPQEKAH DGGRYYRADD ANVVRDRDLE VDTTLKSLSQ QIENIRSPEG SRKNPARTCR
DLKMCHSDWK SGEYWIDPNQ GCNLDAIKVF CNMETGETCV YPTQPSVAQK NWYISKNPKD
KRHVWFGESM TDGFQFEYGG QGSDPADVAI QLTFLRLMST EASQNITYHC KNSVAYMDQQ
TGNLKKALLL QGSNEIEIRA EGNSRFTYSV TVDGCTSHTG AWGKTVIEYK TTKTSRLPII
DVAPLDVGAP DQEFGFDVGP VCFL*
speed 1.28 s
All positions are in basepairs (bp) if not explicitly stated differently.
AA/aa: amino acid; CDS: coding sequence; mu: mutated; NMD: nonsense-mediated mRNA decay; nt: nucleotide; wt: wildtype; TGP: 1000 Genomes Project
back to results table

Problems