Table 4 Comparison of different compression methods from the original single-read file (SRR1536586) and paired-end read file (SRR922270) in FASTQ format
File Size (in Bytes)
MethodsSRR1536586SRR922270_1SRR922270_2
FASTQ1,604,183,3482,647,494,3602,647,494,360
GZIP299,347,123441,010,173466,626,719
LFQC101,191,680159,732,224174,810,624
FASTQ+a119,596,093493,950,425493,950,425
GZIP+FASTQ+35,078,888130,605,692130,546,296
LFQC+FASTQ+16,506,88061,696,00062,689,280
  • SRR1536586 and SRR922270 are SRA file IDs in NCBI SRA database.

  • a After converting FASTAQ+ format, the quality score for an entry such as SeqID_200 is the mean for the 200 reads and not for individual sequences.