Skip to main content

Table 1 Statistics for Arabidopsis reannotation Release 5.

From: Complete reannotation of the Arabidopsis genome: methods, tools, protocols and the final release

  Chr. 1 Chr. 2 Chr. 3 Chr.4 Chr. 5 Total
DNA molecules       
Length (Mb) 30.269 19.702 23.465 18.582 26.978 118.998
%GC       
overall 35.9 35.9 36.3 36.2 35.9 36.0
coding 44.1 44.2 44.3 44.2 44.1 44.2
intronic 32.4 32.3 32.6 32.4 32.3 32.4
intergenic 30.8 31.4 31.6 31.6 31.1 31.2
Genes       
# genes 6,772 4,104 5,233 3,985 6,113 26,207
gene density (kb/gene) 4.47 4.80 4.48 4.66 4.41 4.5
Avg. gene length (bp)a 2,287 2,156 2,197 2,269 2,227 2,232
Avg. protein length 425 398 417 421 419 417
# genes in protein families 4,834 2,884 3,803 2,839 4,281 18,641
#genes duplicated via segmental chromosome duplications 1,868 961 1,315 1,147 1,291 6,582
#genes found tandemly duplicated 993 545 750 636 813 3,737
#genes with alt splicing isoforms 600 412 444 357 517 2,330
#genes with annotated UTRs 4,717 2,936 3,575 2,724 4,147 18,099
#transposons and pseudogenes 748 817 837 652 732 3,786
# tRNA genes 240 96 93 79 123 631
Exons       
# exons 37,710 21,428 27,937 21,800 33,255 142,130
total length (Mb) 10.378 5.919 7.812 6.011 9.170 39.290
avg exons/gene 5.57 5.22 5.34 5.47 5.44 5.42
avg exon size 275 276 280 276 276 276
Introns       
# introns 30,938 17,324 22,704 17,814 27,191 115,921
total length (Mb) 5.060 2.903 3.657 3.016 4.416 19.053
   avg size 164 168 161 169 163 164
Proteome       
# distinct proteins 7,176 4,451 5,540 4,231 6,457 27,855
# proteins with interpro domains 6,142 3,686 4,676 3,573 5,441 23,518
# with TM domain 2,047 1,429 1,599 1,316 1,768 8,159
Signal peptides       
secretory 1,262 797 974 773 1,103 4,909
chloroplast 1,062 681 845 666 1,021 4,275
mitochondria 820 490 612 430 736 3,088
  1. aLength of genomic sequence from annotated transcriptional start to stop.