ntcard

Streaming algorithm to estimate cardinality in genomics datasets
  https://github.com/bcgsc/ntCard
  0
  no reviews



As input it takes file(s) in fasta, fastq, sam, or bam formats and computes the total number of distinct k-mers, F0, and also the k-mer coverage frequency histogram, fi, i>=1.