Skip to main content

Advertisement

Table 4 Assembly statistics for the transcriptome

From: Transcriptome, proteome and draft genome of Euglena gracilis

Transcripts   Coding sequence (CDS)   Proteins  
Number of sequences 72,509 Number of sequences 36,526 Number of proteins 36,526
Median sequence length 540 Median sequence length 765 Median protein length 254
Mean sequence length 869 Mean sequence length 1041 Mean protein length 346
Max sequence length 25,763 Max sequence length 25,218 Max protein length 8406
Min sequence length 202 Min sequence length 297 Min protein length 98
No. sequence > 1kbp 19,765 No. sequence > 1kbp 13,991 No. proteins > 1kaa 1290
No. sequence > 10kbp 25 No. sequence > 10kbp 24 N50 471
No. sequence > 100kbp 0 N50 1413   
No. gaps 0 Combined sequence length 38,030,668   
Bases in gaps 0     
N50 1242     
Combined sequence length 63,050,794