Skip to main content

Table 4 Assembly statistics for the transcriptome

From: Transcriptome, proteome and draft genome of Euglena gracilis

Transcripts

 

Coding sequence (CDS)

 

Proteins

 

Number of sequences

72,509

Number of sequences

36,526

Number of proteins

36,526

Median sequence length

540

Median sequence length

765

Median protein length

254

Mean sequence length

869

Mean sequence length

1041

Mean protein length

346

Max sequence length

25,763

Max sequence length

25,218

Max protein length

8406

Min sequence length

202

Min sequence length

297

Min protein length

98

No. sequence > 1kbp

19,765

No. sequence > 1kbp

13,991

No. proteins > 1kaa

1290

No. sequence > 10kbp

25

No. sequence > 10kbp

24

N50

471

No. sequence > 100kbp

0

N50

1413

  

No. gaps

0

Combined sequence length

38,030,668

  

Bases in gaps

0

    

N50

1242

    

Combined sequence length

63,050,794

   Â