mercier created page: home

Celine Mercier
2015-07-17 13:58:16 +02:00
parent 6befaf99f7
commit 3a24e83a5f

@@ -1,5 +1,5 @@
With the development of next-generation sequencing, efficient tools are needed to handle millions of sequences in reasonable amounts of time.
**Sumaclust** is a program developed by the LECA.
**Sumaclust** is a program developed by the [LECA](http://www-leca.ujf-grenoble.fr/?lang=en).
**Sumaclust** aims to cluster sequences in a way that is fast and exact at the same time. This tool has been developed to be adapted to the type of data generated by DNA metabarcoding, i.e. entirely sequenced, short markers.
**Sumaclust** clusters sequences using the same clustering algorithm as **UCLUST** and **CD-HIT**. This algorithm is mainly useful to detect the 'erroneous' sequences created during amplification and sequencing protocols, deriving from 'true' sequences.
Currently, **Sumaclust** is available as a program that you can download and install on Unix-like machines.