Files
obitools4/pkg
Eric Coissac af7ae3d60c Correct Shannon entropy bias for canonical k-mers
Multiple raw k-mers collapsing into identical circular canonical forms introduce bias into complexity estimates. This change pre-computes `log(class_size)` tables and per-word-size maximum entropy bounds. The `KmerEntropy` function and `KmerEntropyFilter` are updated to apply the corrected formula `(log(N) + Σf·log(s) - Σf·log(f))/N / emax`, ensuring accurate sequence complexity estimation.
2026-05-17 14:54:57 +08:00
..
2025-11-06 15:43:13 +01:00
2024-05-14 08:16:12 +02:00
2025-01-27 17:23:07 +01:00
2026-04-14 14:48:29 +02:00
2025-06-17 09:06:42 +02:00
2025-06-04 09:48:10 +02:00
2023-12-05 15:28:29 +01:00