31 Commits

Author SHA1 Message Date
Eric Coissac
b49aba9c09 Implémentation du filtrage unique basé sur séquence et catégories
Ajout d'une fonctionnalité pour le filtrage unique qui prend en compte à la fois la séquence et les catégories.

- Modification de la fonction ISequenceChunk pour accepter un classifieur unique optionnel
- Implémentation du traitement unique sur disque en utilisant un classifieur composite
- Mise à jour du classifieur utilisé pour le tri sur disque
- Correction de la gestion des clés de unicité en utilisant le code et la valeur du classifieur
- Mise à jour du numéro de commit
2026-01-14 19:18:17 +01:00
Eric Coissac
0678181023 Refactor chunk processing and update version commit
Optimize chunk processing by moving variable declarations inside the loop and update the commit hash in version.go to reflect the latest changes.
2026-01-14 18:46:04 +01:00
Eric Coissac
ac0d3f3fe4 Update obiuniq for very large dataset 2025-12-18 14:11:11 +01:00
Eric Coissac
7f4ebe757e Bug obiuniq - don't clean the chunks 2025-10-28 13:50:22 +01:00
Eric Coissac
04f3af3e60 some renaming of functions 2025-08-06 15:54:50 -04:00
Eric Coissac
f239e8da92 Rename ISequenceChunk 2025-08-05 08:49:45 -04:00
Eric Coissac
7c4042df6b introduce obidefault 2025-01-27 17:12:45 +01:00
Eric Coissac
40fb4e9767 reduce the memory impact of obiuniq. 2024-11-27 13:30:16 +01:00
Eric Coissac
241f2286f2 remove the slice pool management 2024-09-24 16:31:30 +02:00
Eric Coissac
1b1cd41fd3 Add some code refactoring from the blackboard branch 2024-08-02 12:35:46 +02:00
Eric Coissac
67665a6b40 Xprize update
Former-commit-id: d38919a897961e4d40da3b844057c3fb94fdb6d7
2024-07-25 18:09:03 -04:00
Eric Coissac
54a138196c Patch a bug in fasta and fastq reading
Former-commit-id: bcaa264b4c4a7c67617eb909b199176bf09913db
2024-06-21 14:28:57 +02:00
a36e830de3 homogenize worker count
Former-commit-id: 4352ac6694c08023cdf0e090723fac8af655cea4
2023-12-20 08:14:07 +01:00
8d77cc4133 Change path of the obitools pkg
Former-commit-id: 311cbf8df3b990b393c6f4885d62e74564423b65
2023-11-29 12:14:37 +01:00
2e0c1bd801 Correct the number of workers
Former-commit-id: febbccfb853263e0761ecfccb0f09c8c1bf88475
2023-11-22 09:46:30 +01:00
988ae79989 Optimize memory allocation of the apat algorithms
Former-commit-id: 5010c5a666b322715b3b81c1078d325e1f647ede
2023-03-28 19:37:05 +07:00
d88de15cdc Refactoring codes for removing buffer size options. An some other changes...
Former-commit-id: 10b57cc1a27446ade3c444217341e9651e89cdce
2023-03-07 11:12:13 +07:00
072b85e155 change the model for representing paired reads and extend its usage to other commands 2023-02-23 23:35:58 +01:00
f97f92df72 rename the iterator class 2023-01-22 22:04:17 +01:00
29563aa94e Rename the Length methods Len to follow GO standart 2022-11-17 11:09:58 +01:00
eb32089305 Refactoring related to iterators 2022-11-16 17:13:40 +01:00
ad4a861261 Change some comments 2022-08-21 14:47:22 +02:00
cf5b4baa54 Small bug in sequence counting on merge 2022-05-30 16:28:59 +02:00
f14860a486 Patch header parting and formatiing 2022-05-27 11:53:29 +03:00
abcf02e488 Start to use leveled log 2022-02-24 12:14:52 +01:00
eaf65fbcce Some code refactoring, a new version of obiuniq more efficient in memory and a first make file allowing to build obitools 2022-02-24 07:08:40 +01:00
2e7c1834b0 Big change iin the data model, and a first version of obiuniq 2022-02-21 19:00:23 +01:00
9737f97084 before big changes 2022-02-18 22:53:09 +01:00
2636882f9f big bug on subschunk 2022-02-18 09:58:08 +01:00
ce226acac0 Few debug on obidistribute and some progresses on obiunique, but -c and -m options are still not working 2022-02-15 10:49:01 +01:00
3586ecc483 second version of obidistribute and a first buggy version of obiuniq 2022-02-15 00:47:02 +01:00