Toolkit for processing sequences in FASTA/Q formats
Notable changes:
Improvement: faster FASTX parsing (#123)
New feature: added the telo
command to output telomere regions.
New feature: added the size
command to count the number of sequences and
the number of bases. Lighter and thus faster than comp
.
New feature: added the hpc
command to compress homopolymers in input
sequences.
New feature: added the split
command to split a large input file into
multiple smaller files.
New feature: added the gap
command to output non-ACGT regions in the input
file.
New feature: added option -s
to command subseq
to support the strand
field in BED. For the moment, this option does not work with other subseq
options.
(1.4: 19 May 2023, r122)
Updated kseq.h to the latest klib version for enhanced error detection. Made kseq aware of fasta or fastq during parsing. However, most seqtk commands are not taking the advantage of this feature for now. (#109 and #110).
Check file opening errors.
Added option -F
to the seq command to convert FASTA to FASTQ with fixed base quality.
Return non-zero code upon errors (#114). Force trimfq to output FASTQ.
Allow to sample a single read (#111).
Added make install BINDIR=/path/to/bin
(#103).
This release fixed a bug in subseq -L
.
Release seqtk-1.1-r91