HsaINT0038487 @ hg38
Intron Retention
Gene
ENSG00000101203 | COL20A1
Description
collagen type XX alpha 1 chain [Source:HGNC Symbol;Acc:HGNC:14670]
Coordinates
chr20:63322058-63325494:+
Coord C1 exon
chr20:63322058-63322111
Coord A exon
chr20:63322112-63325440
Coord C2 exon
chr20:63325441-63325494
Length
3329 bp
Sequences
Splice sites
5' ss Seq
AGGGTAAGT
5' ss Score
10.45
3' ss Seq
AGCCCATCTTCCCCCTCCAGGGT
3' ss Score
9.58
Exon sequences
Seq C1 exon
GGCCTCCCTGGGAGGAATGGCACCCCAGGAGAGCAGGGCTTCCCAGGGCCCAGG
Seq A exon
GTAAGTTTTGGGGAGCCCTGGGAGGTGAGGGGCCTGGATCGTACCTACCAGAAGAGAACACCCCTCCCAGTGCATTCCTGAGAGTGCAGGGAGCTGCACGCAGCTTCCCTAGGCAGGGTGGGGTCTCCCCAAGCTGCAGACAGGCAGGGACCCCAGCACCCCAGGAGTCCAGGGGGTTTAGGCTGGCCTCCCCCATGTGCCTGCAGGGCGCCTGGCCCAGGCCTGGTTATGGTGCAAGTCCCCTTTTCACCCAGATAGGGAGGATGAACCCCATGGCCCAGCAGGGGAATCTGAGGCTTGGGGAGGTGAGGCCACCTGTCCTATTGGCCCTGAAGGCCGCCCTTGATTTCCATGGGGCAGGCATGAGCCAGATGCCTTCTGTGTGCCGGGCTGGGCCCAGGAGCTTAGGCCCACAGAGTATTTTCAGATGTCTCTAGGCTCAGCGCATGCGCGGCGTGCAGGGAGGTGTTTCGGGGGCGGAGGTTCTGGTTCTGCGGAGAGGGATGTTGAAGACAGCGCGCCCAGCGATGGCAGCAGCCCCCCACCCCAGGCGCCAAAGGGCCGCCCTCCAGGCTCGATTCTAAACATTTACCTAAACCACGGTACTGCACAAGCGTGACACAGATGCCGAAGGCGTGGTCGGCATGCAGCAAGGCCGCCCTCCTCCCGACCCCGACACCCACCTCTCCCCCGGCAGCCTTCCAGCGAGTTCCACATCCCCGTGAGCAAAGACGTGCGCCTGGCTCTGGCGGGGCTGTCCTTTTGCACCTGCGCTGCCATATTCCCTTTGTCTGCGGTTGTGCTGGAGCCTCAGAGCACATCTCAGGCACAGAGTCCCTAGGACAGTCGCTGGGTGGAAGGGCACACGCCCCTGTGCTTTGGGGTGGATGTCCCCACACTGGCCCGGCAGAGCCGTGCGAGGCGCTCCCCCGCGACGAGCCAGCGGTGACCTCCCACCCCCATCTTGCATTCCTGGCTGCTTTGGGAGCCGCGTCTGATAGATGAAAATGCATTTCTCTTCCAAGCAGCGTGAATGCCCTTTGAAAGTTGCCAATATTTTCTTGCTGGAGATGTCCTCTGACTTCTTCTCCACCGTTCTTTTTTCTTAATAATTTGTGGGCACGCTTTTTACTTGCTGGAACCAGCCCTGTGTCTGTGATGTGAGTTTTGACTGCGTTTGGCATTTTCTTGTCTTTTGACCTTGTTTGCGACGTGCGTTTCCATACAGATTTTAAAATAACTTTTACGTAGTCCATGTTACTGCTCTGTTCTATGGCTTCTGAATTTTGTATCCTGTTTTAAAAGATCTTTCTTATTCCGAGAGTAGAAATCGCGCCTCCTCCTGGTTCCTCCTGGTGCTTCCAGGCACTTTTCGACACTCCCAGGTGGATTCGTCTAGAGCTGATTTTGTTACAGCACTGGGGAGGACAGAGGACGGGGAGGACTCCAGCTTCACCTCTTCACAGACCACCCCTGGTCGCCTGAGTCGTTCATTACACAGCCAGCCCCTTGCAGAAGTTGAACGCCCTGGCTCTCGTGCTGTCCTAAGGCCCCTGTGTATTTTGGGGTTTTTCAGATGCTCCAGTCTGTCCCGTTGATGTGTCTGTTGTTCACAGGCGAGGGCTGTGCTGCTTAAATTATTACAGATTTCTTTAAGATGTCGATGCCAGATGGGCTGGTCCTCTTTTGTATTATTCCCCCAGAATTTTCCTGGCTCTCTTTGCTTATTTCATTGTTCAGTGAACTGTAGAATCGACTTATCCGTTTAACCTAAACAATTCAGTTGGTGTTTTTACTGGAACTGCATTCGCTTTATAGATATACTCGGGAACAGATGATGTCTTCATGATGTTGAATCTTCTCATCCAAAAACACAGCCATCCTGCCTTTTGTTCACAGGGCCTGTTGTGGCCCTCAGTACAGTGTGGGGATGGTCTCTTCCTACAGACCCACTCGGTTCTCCTTCATCCCTACGTATGTCCTCTTTTTTGTTGCTATTATAAGTGAGGTGTTTTCTCTCATTACAGCTTCTACCTCATTATTTTCACACTGGCCGTGACCGCTGTGTATGACTTTTGTACCCAGTTTTCTTTCTCAATTATCTTCTTTGTGGTTTTCTTGGATTTTCCAGGTTTACAGTGACATCACCTGCAAATAAACACTTCACCTTCGGCTTTTCAGTCTTGATCACTCAAATTTCTTTTTCTTGTCTAATGGTGTTGGTGAATACTCCAGAATTTTGTCAGAGAACAGTGGCAGTAGCAGATACTCTCATTTTCTTCTTGGCCTAACAGAAGTGCTTCTAGCATTTGACCATTAAAGTGCTGTCCTGGAAGCGGAGACAGGTGCATTTTGTCATGTGGTAGTGAAGTCCATCCACTCTTCTTTTATTAGATTTAAAACACCAAGAATGCACATTAAACAAGCTCTAGTGCTGTTATGGACATAATGAAATGACTGTGCGTCTTGGAGCTATTAATTGTAGGATATCCCTAAATACTGGATGATTGCTGCATCCGTCGAAAATCTGACGTGGTCGCTGAATTCTCGTTATCCATGCCATGTGCTGCAGAGCTCAGCTTGCTGACATTTTGTTTTGGATTTCGCACTGATGCTTCTAAATAAGACTGTCACAGTTTTCTTTATAGTGTTTGGAACCTCAGTTTGCTTTGTTCAAAAGATAATTTGAGTTTAAATAGCAGCAGTGTCCTCTGCCCCACAGGGTTGGATTTCCCATAAAACCTGAGTCTAGTGTTTTGGGGTCCAGGGCTTAAGAAAGTTTTCTGTCTCTTCCATACAGTTGGCCTGTGGGGGTCTCTGGCTTTTCCTCGGTGATTTCTGCAGCGCCCCCCACACCCCGTTAGAACCCAGAGTTGGGCAGGGGAGGGCTGGGGGTGAGGTTGGTTGGAACTGGCTTCACAGGCTTGGCTCTGCCATTCTCTAGCTCTGCAGACCCCTGCTGTGGGTTCTCCCAGACCCTCCCAGGGTGAGCAGTGAGGGTGCTGGGGCTGCCCCTGCTCCTTTAATGGAGGTGTCTCCATGTCGGTCTGGGTCTGAATGCCCAGAGGGCTGGGAGGGCTGGCTGTGACCCAGAGGGGCCACAGGAGGGGTGGCAAGCGTGACCTTGTCAGGCCCTACCCGCTGCCTGTGTCTCCAGGGAGAGCCCGGGCCACCCGGACAGATGGGACCAGAAGGTCCTGGAGGCCAGCAGGGCTCGCCGGGGACCCAGGGCCGTGCAGTCCAGGGGCCTGTGGTAGGTGTCACTCCTTCCCTGCCCTCCTGCCCTGTGCCCCCTCCGCTTCGCTGTCCAGCCCATCTTCCCCCTCCAG
Seq C2 exon
GGTCCACCAGGGGTCAAAGGAGAGAAGGGAGACCATGGGCTTCCAGGCTTGCAG
VastDB Features
Vast-tools module Information
Secondary ID
ENSG00000101203:ENST00000422202:26
Average complexity
IR
Mappability confidence:
NA
Protein Impact
ORF disruption upon sequence inclusion
No structure available
Features
Disorder rate (Iupred):
C1=1.000 A=NA C2=1.000
Domain overlap (PFAM):
C1:
PF0139113=Collagen=FE(22.4=100)
A:
NA
C2:
PF0139113=Collagen=FE(28.8=100),PF0139113=Collagen=PU(21.6=61.1)
Main Inclusion Isoform:
NA

Associated events
Other assemblies
Conservation
Rat
(rn6)
No conservation detected
Zebrafish
(danRer10)
No conservation detected
Fruitfly
(dm6)
No conservation detected
Primers PCR
Suggestions for RT-PCR validation
F:
No suggested primer sequences
R:
No suggested primer sequences
Band lengths:
Functional annotations
There are 0 annotated functions for this event
GENOMIC CONTEXT[edit]
INCLUSION PATTERN[edit]
SPECIAL DATASETS
- Autistic and control brains
- Pre-implantation embryo development