obikmer

Author	SHA1	Message	Date
Eric Coissac	1cd7916e06	refactor: replace rayon with NUMA-aware PartitionRunner Replaces `rayon` parallel iteration across index, rebuild, reindex, and select modules with a custom `PartitionRunner`. This introduces NUMA-aware task distribution with CPU pinning and round-robin scheduling, eliminating `Arc`, `Mutex`, and atomic synchronization primitives in favor of a flat, pre-spawned worker architecture. Error handling is simplified via `.map_err()` and the `?` operator, while progress bar updates are decoupled into dedicated callbacks.	2026-06-15 18:29:04 +02:00
Eric Coissac	bc92dc4592	refactor: restructure partitioner with shared utilities and pipeline This commit restructures the partitioner crate by extracting shared utilities and the `ColBuilder` enum into a new `common` module. It introduces a multi-phase `graph_pipeline` for constructing and materializing De Bruijn graphs, replacing manual graph construction in `index_layer`, `merge_layer`, and `rebuild_layer`. All layer workflows now use centralized `build_graph` and `materialize_layer` abstractions, with standardized error context strings for improved diagnostics.	2026-06-15 14:08:16 +02:00
coissac	a9567ad023	Merge pull request 'Push rtnzuqxzmkon' (#31 ) from push-rtnzuqxzmkon into main Reviewed-on: #31	2026-06-15 09:40:35 +00:00
Eric Coissac	4a64718fd1	perf: replace partition processing with adaptive NUMA worker pool Replaces the previous partition processing logic with an adaptive, NUMA-aware multi-threaded worker pool that dynamically scales active threads based on real-time CPU efficiency. Introduces pre-spawned, CPU-pinned threads managed via crossbeam channels and Rayon to optimize memory bandwidth and core utilization. Adds a `max_workers()` accessor to aggregate maximum worker capacity across NUMA nodes and updates diagnostics to report active versus maximum worker counts.	2026-06-15 11:40:14 +02:00
Eric Coissac	7a87e911b6	feat: introduce NUMA-aware PartitionRunner for adaptive parallelism Replace NUMA-naive Rayon loops and ad-hoc adaptive pools with a unified `PartitionRunner` that manages a NUMA-aware worker pool. The implementation uses pinned Rayon thread pools per node and activates dormant threads based on real-time CPU efficiency metrics. This standardizes partition-level parallelism, optimizes memory locality, and eliminates cross-socket traffic. Includes architecture documentation and updates mkdocs navigation.	2026-06-15 11:34:41 +02:00
coissac	313d73838a	Merge pull request 'feat: add pipeline concurrency throttling and HPC build docs' (#30 ) from push-owwylwtskwzw into main Reviewed-on: #30	2026-06-15 08:33:41 +00:00
Eric Coissac	175ea5bbd0	feat: add pipeline concurrency throttling and HPC build docs Introduces a counting semaphore-based throttling mechanism to limit concurrent file I/O and pipeline processing. Replaces custom path wrappers with standardized `Throttled` types across `obikmer` and `obikpartitionner`, ensuring RAII-based resource cleanup and explicit backpressure. Additionally, documents how to redirect Cargo build artifacts to local scratch storage on HPC filesystems to prevent compilation slowdowns.	2026-06-15 10:33:23 +02:00
coissac	c6ea0c53e3	Merge pull request 'feat: implement NUMA-aware worker pools for merge command' (#29 ) from push-wusvurukprsr into main Reviewed-on: #29	2026-06-14 21:57:21 +00:00
Eric Coissac	ea767376bd	feat: implement NUMA-aware worker pools for merge command Replaces the global Rayon pool with per-NUMA-node thread pools that pin worker threads to their respective nodes, leveraging Linux first-touch allocation to reduce cross-NUMA memory contention and improve cache locality. Integrates the `hwlocality` crate with a vendored build, includes graceful fallbacks for single-socket or non-Linux systems, and updates dependency constraints. Also adds installation and architecture documentation, and corrects parallelism detection in the partitioner.	2026-06-14 23:56:52 +02:00
coissac	f1d76f3203	Merge pull request 'refactor(merge): extract adaptive worker spawn logic' (#28 ) from push-yzruqtyqvopm into main Reviewed-on: #28	2026-06-13 12:56:34 +00:00
Eric Coissac	c4071eb450	refactor(merge): extract adaptive worker spawn logic Centralize inline spawn checks into a `should_spawn_worker` function with adaptive thresholds. The first worker spawns at <95% CPU efficiency, while subsequent workers only trigger if marginal efficiency gain exceeds 25% of the expected `1/n_workers` (minimum 3%). Also increases the spawn poll interval from 10s to 20s.	2026-06-13 14:56:01 +02:00
coissac	817b02cbc1	Merge pull request 'Push zkspuxlpumpw' (#27 ) from push-zkspuxlpumpw into main Reviewed-on: #27	2026-06-13 11:25:12 +00:00
Eric Coissac	547cb72d76	refactor: Enforce Rayon parallelism and fix merge_layer concurrency Updated memory guidelines and feedback docs to explicitly classify intra-partition phases as parallel, correcting prior assumptions of sequential execution. Refactored merge_layer.rs to wrap column builders in Arc<Mutex<ColBuilder>> and use Arc::try_unwrap for safe concurrent access, eliminating race conditions and preventing double-closes during pass2.	2026-06-13 13:24:55 +02:00
Eric Coissac	6d85387077	feat: add performance instrumentation and dynamic worker scaling This change enhances observability and adaptability in the merge pipeline. Performance timing and debug logging are added to the De Bruijn graph and partition merge layers to track phase durations and pipeline metrics. The merge module replaces blocking receives with timed polls to sample CPU efficiency, dynamically spawning workers when utilization drops below a threshold. A new script is also introduced to parse merge debug logs and generate structured Markdown reports detailing throughput, phase breakdowns, and partition performance.	2026-06-13 13:21:53 +02:00
coissac	fb5b53dca9	Merge pull request 'Push ooxwzorvsqvy' (#26 ) from push-ooxwzorvsqvy into main Reviewed-on: #26	2026-06-13 09:59:07 +00:00
Eric Coissac	fddf630772	style: apply consistent formatting and whitespace normalization Applies consistent formatting, whitespace normalization, and indentation standardization to `debruijn.rs` and `merge.rs`. Reorganizes imports and downgrades a unitig traversal log from `info!` to `debug!`. No functional logic or runtime behavior is altered.	2026-06-13 11:58:20 +02:00
Eric Coissac	bc14346f5f	feat: add CPU-aware parallel worker pool for partition merging Introduce CpuSample to measure process-level CPU efficiency and wall-clock time. Use crossbeam-channel to distribute partition merging tasks to a dynamic worker pool that scales based on CPU utilization, capped at half the available cores. Update diagnostics to track pool usage.	2026-06-13 11:58:20 +02:00
Eric Coissac	fb8c6e427c	refactor: pass Unitig objects directly instead of raw byte slices Refactored `try_for_each_unitig` and related pipelines across `obidebruinj` and `obikpartitionner` to accept `Unitig` instances directly. This eliminates manual `Unitig::from_nucleotides()` conversions, simplifies the data flow, and reduces unnecessary allocation overhead.	2026-06-13 11:52:50 +02:00
Eric Coissac	1f336fe496	refactor: replace mutex with channels for parallel debruijn processing Add `rayon` and `crossbeam-channel` dependencies to support concurrent execution. Replace the synchronous, mutex-protected closure pattern with a channel-based producer-consumer approach using `std::thread::scope`. This decouples unitig iteration from processing, eliminating lock contention and `Mutex` overhead while enabling parallel workloads.	2026-06-13 11:49:27 +02:00
Eric Coissac	5f98d2ef96	refactor: replace explicit collect with Unitig::from_nucleotides Introduce a thread-local buffer to materialize nucleotide iterators into contiguous slices. Update `try_for_each_unitig` across the debruijn, index, merge, and rebuild layers to directly instantiate `Unitig` via `from_nucleotides()` instead of explicitly collecting iterators. This eliminates intermediate allocations and aligns test code with the new approach.	2026-06-13 11:47:06 +02:00
Eric Coissac	8b563d0804	refactor: migrate pipeline stages and improve graph processing Refactored neighbor resolution to explicitly track unvisited indices for degree-1 nodes, updated display formatting, and added timing and debug logging to the degree computation routine. Migrated pipeline stages from eager vector returns to explicit flat implementations, enabling backpressure-aware streaming, configurable batch processing, incremental yielding, and progress tracking via a delta channel.	2026-06-13 11:44:17 +02:00
coissac	7208dcbb4a	Merge pull request 'refactor: defer SrcLayerData lookups in RawBatch' (#25 ) from push-nxrynoorswrw into main Reviewed-on: #25	2026-06-12 20:19:21 +00:00
Eric Coissac	2e69b0b7fe	refactor: defer SrcLayerData lookups in RawBatch Replace eager resolution of `Vec<u32>` values with an `Arc<SrcLayerData>` handle passed alongside `Vec<CanonicalKmer>`. This shifts the lookup logic to the subsequent transform step, reducing memory overhead and enabling shared, thread-safe access to the source layer data.	2026-06-12 22:18:57 +02:00
coissac	9ea1dff5d6	Merge pull request 'Push rwqsmuvystym' (#24 ) from push-rwqsmuvystym into main Reviewed-on: #24	2026-06-12 19:33:20 +00:00
Eric Coissac	b2c8373586	refactor: parallelize merge layer with WorkerPool pipeline Replaces the synchronous sequential loop with a multi-threaded pipeline using `WorkerPool` and custom stage macros. Shared mutable state is wrapped in `Arc<Mutex<>>` for thread-safe updates, while pipeline errors are centralized via `Arc<Mutex<Option<String>>>` to propagate failures before thread join.	2026-06-12 21:32:53 +02:00
Eric Coissac	ba49af6f9e	refactor: parallelize merge and partition logic with obipipeline Introduce the `obipipeline` dependency and refactor merge and partition logic to leverage parallel execution. Update `merge_partitions` to use rayon with dynamic memory budgeting and concurrency control via a pilot run. Refactor Pass 1 to concurrently read unitigs, filter kmers through a shared `LayeredMap`, and populate the graph safely. Simplify diagnostics to report total kmer counts and replace manual flags with graph length validation.	2026-06-12 21:32:04 +02:00
Eric Coissac	2bc189e962	feat: dynamically compute seed expansion based on RSS Introduce a `peak_rss_bytes()` utility for accurate per-phase RAM measurement. Replace the genome-length heuristic with a dynamic seed expansion ratio based on actual RSS delta. Explicitly drop the `GraphDeBruijn` instance before MPHF construction to prevent resource contention and ensure proper memory management.	2026-06-12 16:39:38 +02:00
coissac	db9c604199	Merge pull request 'feat: enhance memory budgeting and add rebuild diagnostics' (#23 ) from push-nptzpkomspkv into main Reviewed-on: #23	2026-06-12 13:21:12 +00:00
Eric Coissac	52fd2cf801	feat: enhance memory budgeting and add rebuild diagnostics This commit improves memory management by respecting Linux cgroup v1/v2 limits and introduces a configurable memory budget for the new `rebuild` subcommand to prevent OOM during index reconstruction. The rebuild process now supports filtering, compaction, and parallelization. Diagnostic capabilities are expanded with debug-level tracing for partition merges, k-mer expansion tracking, and utility flags for label renaming, matrix size breakdowns, per-genome counts, and partition distribution reporting. Accessor methods for active and remaining memory have also been added to the stats struct.	2026-06-12 15:20:38 +02:00
coissac	97e3fb9761	Merge pull request 'Push ylnwstyzqwrt' (#22 ) from push-ylnwstyzqwrt into main Reviewed-on: #22	2026-06-12 10:10:03 +00:00
Eric Coissac	b5e027f23b	feat: add memory-aware parallel merge scheduling and CLI flags Introduces a memory-aware scheduling strategy for parallel partition merging that replaces unbounded concurrency with a First-Fit Decreasing approach gated by a thread-safe `MemoryBudget` semaphore. An adaptive expansion factor, seeded by a sequential pilot run, dynamically caps concurrent workers to prevent hashbrown OOMs. Adds a `--budget-fraction` CLI flag to configure RAM allocation, enhances the CLI to accept multiple indexes, and introduces comprehensive partition diagnostics including memory utilization tracking, concurrency metrics, and statistical summaries with ASCII histograms. Updates documentation and navigation accordingly.	2026-06-12 11:44:10 +02:00
Eric Coissac	f44fe042bc	feat: add parallel k-mer counting and stats CLI Introduces allocation-free `sum()` and `count_nonzero()` methods for compact integer vectors, extending the `ColumnWeights` trait with `partial_kmer_counts`. Adds parallel partition scanning to the k-mer index for computing per-genome distinct k-mer counts, and exposes a new `--stats` CLI flag to output these statistics as CSV.	2026-06-12 11:29:32 +02:00
coissac	94e0a370b3	Merge pull request 'Push tmpsxsztwpxl' (#21 ) from push-tmpsxsztwpxl into main Reviewed-on: #21	2026-06-09 13:31:25 +00:00
Eric Coissac	970460be42	refactor: rename rebuild subcommand to filter Rename the `rebuild` CLI subcommand to `filter` to better reflect its primary purpose of row-level selection and k-mer filtering. Update all associated CLI arguments, logging, error messages, and module registrations accordingly. Introduce a dedicated `Rebuild` subcommand for index compaction, fully decoupling it from the filtering logic. Also refine related documentation to align with the new naming and semantics.	2026-06-09 15:26:37 +02:00
Eric Coissac	e66adef23d	feat: add select command for genome column projection and aggregation Introduces the `select` CLI command to project and aggregate genome-level k-mer data by column. Adds `filter` as an alias for `rebuild`. The implementation uses parallel partition processing, supports metadata-driven grouping with configurable aggregation operators, and performs atomic in-place rewrites or filtered exports. Updates documentation and navigation accordingly.	2026-06-09 15:09:47 +02:00
coissac	b0dab452f6	Merge pull request 'refactor: optimize dump partition iteration and add progress tracking' (#20 ) from push-xqswlxlvmyrq into main Reviewed-on: #20	2026-06-09 09:34:13 +00:00
Eric Coissac	db730e9cf6	refactor: optimize dump partition iteration and add progress tracking Refactor partition iteration to support a generic `on_partition` callback executed after each parallel partition completes. Split the logic into bounded and unbounded paths; the bounded path uses an `AtomicUsize` to enforce row limits, while the unbounded path eliminates atomic contention to improve throughput. Additionally, integrate a progress bar into the dump command by passing an increment callback to `idx.dump()`, ensuring proper initialization and cleanup.	2026-06-09 11:07:48 +02:00
coissac	f65ecd19cc	Merge pull request 'Push lrwmyplxxzkn' (#19 ) from push-lrwmyplxxzkn into main Reviewed-on: #19	2026-06-09 08:28:20 +00:00
Eric Coissac	7dd8db1409	docs: document conservative rounding strategy for filtering thresholds Specifies that minimum bounds use ceiling and maximum bounds use floor to enforce strictness. Clarifies that the implementation avoids explicit rounding by directly comparing integer counts against floating-point fractions, which is mathematically equivalent.	2026-06-09 10:26:21 +02:00
Eric Coissac	ce45e2fbe1	refactor: centralize k-mer filtering logic and add validation Refactor shared `FilterArgs` and `build_group_filter` to return a `Result` with explicit validation for fraction bounds, min/max ordering, and count constraints. Update conditional defaults for `--min-frac` and `--max-outgroup-count` to depend on explicit quorum flags, preventing silent configuration conflicts. Update documentation and MkDocs navigation to reflect the new centralized k-mer filtering system across `rebuild`, `dump`, and `unitig` commands.	2026-06-09 10:22:25 +02:00
Eric Coissac	2465cfbc4b	Parallelize partition iteration using Rayon Introduce thread-local `Vec<u8>` buffers to eliminate concurrent I/O contention. Replace the mutable row counter with an `AtomicUsize` and `fetch_update` to enable lock-free early termination when the limit is reached. Collected chunks are then written sequentially to preserve partition ordering.	2026-06-09 10:04:25 +02:00
Eric Coissac	d626d42ec7	feat: add --head and --presence-threshold to dump and distance Introduces `--head N` to the `dump` command for early iteration termination and `--presence-threshold N` to the `distance` command for Jaccard filtering on count indexes. Updates filter defaults to adapt based on explicit ingroup/outgroup declarations. Fixes a Rust type mismatch in the unitig closure and updates partition iteration callbacks to return `bool` for proper early termination support. Documentation is updated accordingly.	2026-06-09 10:04:25 +02:00
coissac	650eea43b6	Merge pull request 'Push quqlpklvxsqx' (#18 ) from push-quqlpklvxsqx into main Reviewed-on: #18	2026-06-08 18:15:01 +00:00
Eric Coissac	eb7805c747	feat: add configurable presence threshold to kmer distance Introduce a `--presence-threshold` CLI argument (default: 1) and update `KmerIndex::distance` to accept a `presence_threshold` parameter. This replaces hardcoded zero thresholds, enabling configurable filtering of low-abundance kmers during Jaccard distance calculations.	2026-06-08 20:14:33 +02:00
Eric Coissac	1ec65922df	feat: implement parallel pairwise distance matrices Introduces parallelized pairwise distance matrix computation for Jaccard, Hamming, Bray-Curtis, Euclidean, and Hellinger metrics across `Columnar`, `Packed`, and `Implicit` matrix variants. Adds trait methods and convenience wrappers, safely handles normalization and zero-denominator edge cases, and updates test suites to import required traits for validation.	2026-06-08 20:08:09 +02:00
Eric Coissac	09d9e21744	feat: integrate tracing and enhance bit matrix operations Add the `tracing` crate to `obidebruinj`, `obisys`, and resolve it in `Cargo.lock`. Replace `eprintln!` statements with structured `debug!` and `info!` macros. Introduce a `TracedBar` wrapper for progress bars and enhance the `Stage` lifecycle to emit structured events for timing, memory metrics, and swap warnings. Add a progress spinner for unitig degree computation. Extend `PersistentBitMatrix` with columnar bit-vector operations and parallel distance methods, enabling uniform distance computations across all storage layouts while replacing previous panics with dimension-based fallbacks.	2026-06-08 19:55:06 +02:00
coissac	3f47e22083	Merge pull request 'Push pvqkqxlkkwry' (#17 ) from push-pvqkqxlkkwry into main Reviewed-on: #17	2026-06-06 04:44:10 +00:00
Eric Coissac	03c7bb0b99	Relax unitig assertion in debruijn test Replace the strict `unitigs.len() == 1` assertion with a non-empty check to allow multiple unitigs. Update the test comment to describe the general non-repetitive sequence recovery principle instead of a specific example. The core k-mer roundtrip validation logic remains unchanged.	2026-06-06 06:41:45 +02:00
Eric Coissac	b39eee688a	refactor(debruijn): unify graph traversal with WalkState iterator Replaces deeply nested branching with early returns and `then_some`. Introduces a cycle-detecting `find_chain_start` method and updates `UnitigNucIter` to use step-based iteration with atomic node claiming. This eliminates nested iterators and redundant state management, improving code readability and maintainability.	2026-06-06 06:38:28 +02:00
Eric Coissac	95b3461405	refactor: centralize graph traversal logic in walk Refactor `leavable` and `reachable` to eliminate duplicated graph traversal logic by mutually delegating via `WalkState`. `leavable` now returns `self.walk(graph).is_some()`, while `reachable` delegates to the inverted `direct` state's `leavable` check. This centralizes kmer extension and visited-state validation in `walk`, simplifying control flow and reducing code duplication.	2026-06-06 06:36:48 +02:00

1 2 3 4 5

210 Commits