obikmer

Author	SHA1	Message	Date
coissac	c4c71dc892	Merge pull request 'Push mtzqmmrlmzzx' (#34 ) from push-mtzqmmrlmzzx into main CI / build (push) Successful in 4m34s Details Reviewed-on: #34	2026-06-22 08:47:24 +00:00
Eric Coissac	4e625afaba	refactor: update CI toolchain setup and optimize parallel indexing CI / build (push) Successful in 4m56s Details CI / build (pull_request) Successful in 4m11s Details Update CI workflows to explicitly install the Rust toolchain via rustup and configure musl targets for deterministic static builds in Docker containers. Bump obikmer dependency to 0.1.3. Refactor obicompactvec to reduce peak memory usage by computing column sizes from filesystem metadata, add atomic writes, and implement cleanup guards. Replace parallel iteration patterns in obikindex with a structured PartitionRunner pipeline for simplified error handling and progress tracking.	2026-06-22 10:46:24 +02:00
Eric Coissac	a522c0907e	feat: add CI/CD workflows, release automation, and CLI version flag CI / build (push) Failing after 2m41s Details CI / build (pull_request) Failing after 6s Details Adds Gitea Actions for continuous integration and tagged releases, including static musl binary compilation and artifact upload. Introduces a Makefile target to automate semantic version bumping and publishing. Bumps the package version to 0.1.1 and enables automatic `--version` output via Clap.	2026-06-22 10:36:40 +02:00
Eric Coissac	c1d6f277ce	feat(select): add metrics reporting to selection methods Integrates an obisys::Reporter across indexing and command modules to capture execution metrics. Replaces discarded timer stops with explicit rep.push() calls, adds timing instrumentation for the pack stage, and prints collected reports after each selection branch.	2026-06-22 10:25:24 +02:00
Eric Coissac	9356be4ec0	feat: introduce obitaxonomy crate for hierarchical taxonomy parsing Adds the `obitaxonomy` crate to parse and validate hierarchical taxonomy paths using a strict `taxonomy:/name@rank/...` syntax. Replaces generic string-based path matching in predicates with structured `TaxPath` and `TaxPattern` types, enforcing explicit anchor constraints and rank-aware semantics. Updates filtering documentation to clarify optional leading slashes and segment-boundary matching rules.	2026-06-22 10:24:04 +02:00
Eric Coissac	c694e1f2b0	feat: add benchmark pipeline, expose APIs, and enforce strict paths Introduces a Make-based orchestration for simulating, indexing, merging, filtering, and verifying k-mer counts and presence. Exposes internal builder and iterator APIs publicly, enforces mandatory leading slashes for predicate patterns, registers the `obitaxonomy` crate, and updates tooling configurations alongside documentation.	2026-06-22 10:18:33 +02:00
Eric Coissac	280ca1f5a3	feat: add optimized new_ones constructor for all-ones bit vectors Introduces `new_ones` and `add_col_ones` methods to directly initialize all-ones bit vectors and matrix columns. This replaces redundant initialization sequences that created zero-filled structures and applied bitwise NOT, with a single pass that writes contiguous 0xFF bytes to disk. The change eliminates inversion overhead, streamlines test setup, and improves performance for filter mask intersection logic while preserving identical semantics.	2026-06-22 10:00:01 +02:00
Eric Coissac	9abb2db92f	refactor: replace explicit bit-setting loops with optimized bulk operations Refactor bitmatrix, colgroup, and layer modules to replace manual iteration with concise `or_where` predicates and bulk inversion calls. This simplifies the codebase and leverages optimized internal implementations for improved performance.	2026-06-22 09:56:41 +02:00
Eric Coissac	7c1efa9cbb	feat: add vectorized column filters and optimize partitioner iteration Adds `FilterMask` and conditional bitwise methods (`*_where`) to `obicompactvec` for composable column-based slot filtering. Extends `obikpartitionner` with a `MatrixGroupOps` trait and `column_mask_expr` method to express aggregate constraints as vectorized masks. Refactors matrix builder management into a unified `Builders` enum and introduces `try_compute_combined_mask`, enabling O(1) slot checks and skipping unnecessary row reads during partitioning and rebuilding passes.	2026-06-22 09:54:51 +02:00
Eric Coissac	4c4524766c	feat(matrix): add partial group reductions and column persistence Expands MatrixGroupOps with partial_group_min/max helpers for bitwise reductions and introduces add_col_from methods to persist external vectors as matrix columns. Refactors column aggregation in the partitioner to leverage these group operations directly, replacing iterative row processing with simplified builder lifecycle management and explicit metadata serialization.	2026-06-22 09:49:04 +02:00
Eric Coissac	7eea71fdcd	docs(obicompactvec): update API docs and algorithm descriptions Replace trait-based API documentation with concrete, zero-copy view structs and update all associated diagrams. Refine algorithmic descriptions for sentinel handling, overflow stores, and bulk operations. Clarify temporary file lifecycles and group-chunking strategies to support memory-efficient parallel aggregation.	2026-06-22 09:46:19 +02:00
Eric Coissac	f91c5a3f79	refactor(obicompactvec): unify bit and int vector slice views Refactors column and matrix access to use unified `BitSliceView` and `IntSliceView` abstractions, replacing legacy `PackedCol`/`IntColView` types. Introduces `BitSlice`/`IntSlice` traits for zero-copy, trait-based bitwise and arithmetic operations across persistent and temporary vector types. Removes deprecated in-memory `MemoryBitVec` and `MemoryIntVec` implementations and their tests, while updating dependent crates to use the new view-based API and `BitSliceMut` trait.	2026-06-17 23:51:32 +02:00
Eric Coissac	fb4962c4fe	refactor: replace in-memory vectors with temp-file-backed storage Introduces `TempCompactIntVec` and `TempBitVec` as temporary, file-backed intermediates to replace eager in-memory vectors, enabling OS-level paging under memory pressure. Updates the `MatrixGroupOps` trait to return `io::Result` types, allowing proper error propagation and supporting chunked accumulation for large column groups. Includes builder patterns with `.freeze()` finalization, automatic `TempDir` cleanup on drop, and necessary test updates to handle the new fallible signatures. Also fixes `Cargo.toml` section ordering.	2026-06-17 23:36:15 +02:00
Eric Coissac	1d38d87ff9	Add column group operations and mask_with trait Introduce the `ColGroup` struct and `MatrixGroupOps` trait to manage named subsets of column indices and perform additive aggregations (count, sum, any). Implement these operations for `PersistentBitMatrix` and `PersistentCompactIntMatrix`, applying size-optimized branches for presence counts and direct accumulation for small groups. Additionally, add a `mask_with` trait method that efficiently zero-sets elements based on a mask, optimized for sparse masks with O(n_zeros) complexity. Include comprehensive tests covering overflow handling, slot masking, and result additivity across partitioned data.	2026-06-17 23:28:52 +02:00
Eric Coissac	93559c3294	feat: introduce unified column view types for bit and int matrices This commit introduces `BitColView` and `IntColView` to abstract over Columnar and Packed storage formats, implementing `BitSlice` and `IntSlice` for uniform column access. It adds `col_view()` accessors to `PersistentBitMatrix` and `PackedCompactIntMatrix`, explicitly panicking on implicit variants. The new types are publicly re-exported, and unit tests are added to validate per-element retrieval, aggregation methods, and parity with the original columnar representation.	2026-06-17 23:24:11 +02:00
Eric Coissac	1f0d77d5bf	docs: document compact vector implementation with Mermaid diagrams Add Mermaid diagrams to visualize the trait hierarchy, compact int storage layout, and SIMD-vectorizable arithmetic operations for MemoryIntVec and PersistentCompactIntVec. Also document concrete type structures and planned layer/partition composition rules to improve documentation clarity.	2026-06-17 23:20:55 +02:00
Eric Coissac	eeba43ac4f	docs: add technical reference for obicompactvec module Document the two-tier compact integer encoding, BitSlice/IntSlice trait hierarchy, and SIMD-friendly O(n+k) algorithms. Include details on concrete memory and persistent vector types, matrix aggregation traits, and planned group-filtering APIs.	2026-06-17 23:19:31 +02:00
Eric Coissac	7ed7b26039	perf: optimize vec arithmetic and add overflow tests Refactor `cmp_scalar`, `min`, `max`, `add`, and `diff` to operate directly on the primary byte array, deferring overflow slot resolution to a secondary pass. This eliminates HashMap lookups in the hot path and enables SIMD vectorization. Add six unit tests to validate correct promotion and demotion between storage slots when values cross the 255 threshold.	2026-06-17 23:18:18 +02:00
Eric Coissac	26de90f18d	feat: add iteration and aggregation to compact int vec Implemented `sum()`, `count_nonzero()`, and `iter()` to complete the numeric vector interface. The builder now computes aggregate values across memory-mapped regions and overflow entries, while the reader delegates these operations to its inherent methods. The iterator provides zero-copy access to underlying `u32` elements.	2026-06-17 23:15:56 +02:00
Eric Coissac	497d250d8a	refactor: replace byte-level bit iteration with 64-bit words Refactor `BitIter` to process `u64` chunks using word-aligned shifts instead of byte-level operations. Introduce a dedicated `MemoryBitIter` for `MemoryBitVec`, updating its `iter()` and `IntoIterator` implementations accordingly. Hide `MemoryBitIter` from the public API to narrow the crate's interface, while leveraging explicit alignment guarantees for safer and more efficient bit extraction.	2026-06-17 23:11:12 +02:00
Eric Coissac	aa98e82875	refactor: introduce PackedIntCol view and use iterators Centralizes overflow handling and improves modularity by replacing manual mmap indexing and row loops with composable iterator patterns. This change leverages Rust's iterator traits for efficient, idiomatic column traversal while encapsulating data access in a dedicated view struct.	2026-06-17 23:09:18 +02:00
Eric Coissac	5ff5b04d2d	refactor: replace manual bit ops with BitSlice traits Refactors bit manipulation and distance calculations to leverage standardized `BitSlice` traits, replacing manual byte/word logic with safer, reusable methods. Extends `IntSlice` and `IntSliceMut` traits to expose direct memory-mapped access and overflow management, enabling efficient bulk data extraction and serialization. Replaces manual bit-shifting loops with optimized table-based unpacking and adds population count and distance metric methods for improved performance. Updates `PersistentBitVecBuilder` with file tracking and safe flushing, and aligns test imports with new trait bounds.	2026-06-17 15:28:44 +02:00
Eric Coissac	df7b400fda	perf: optimize aggregation with byte-level helpers and direct mmap Introduce `byte_sum` and `byte_count_nonzero` to efficiently aggregate compact-int byte slices, bypassing per-element decoding and overflow map lookups. Refactor `sum()` and `count_nonzero()` across the matrix, reader, and traits modules to use direct memory-mapped slice iteration and idiomatic Rust iterators. Additionally, expose `MemoryIntIter` publicly and implement `IntoIterator` and `IntSlice` for `MemoryIntVec` to enable standard iteration and delegate aggregation to the new helpers.	2026-06-17 15:21:21 +02:00
Eric Coissac	d1717688d2	refactor: extract matrix helpers and improve bit iteration ergonomics Refactor parallel matrix construction by extracting reusable `pairwise_matrix` and `pairwise2_matrix` helpers, and consolidate binary record deserialization into dedicated parsing functions. Add `set` and `iter` methods to `BitSliceMut` and `MemoryBitVec` for ergonomic bit manipulation and iteration. Standardize JSON field extraction via `meta::field`, expose `MemoryBitIter`, and improve test reliability by automatically cleaning up temporary directories.	2026-06-17 15:15:39 +02:00
Eric Coissac	cde6457eea	feat: add memory vectors, slice traits, and column extraction methods Introduce `MemoryBitVec` and `MemoryIntVec` for efficient in-memory storage with hybrid compression and overflow handling. Implement `BitSlice`, `BitSliceMut`, `IntSlice`, and `IntSliceMut` traits across persistent and memory-backed types to enable generic slice operations and bitwise/arithmetic overloads. Add `col_persist` and `col_as_memory` methods to `BitMatrix` and `IntMatrix` for efficient column extraction. Align with the new single-pass rebuild architecture by supporting fast kmer filtering and matrix rebuilding. Includes comprehensive tests and profiling instrumentation for the packing phase.	2026-06-17 15:03:18 +02:00
Eric Coissac	b6fcbc545f	refactor: replace rayon with NUMA-aware PartitionRunner Replaces `rayon` parallel iteration across index, rebuild, reindex, and select modules with a custom `PartitionRunner`. This introduces NUMA-aware task distribution with CPU pinning and round-robin scheduling, eliminating `Arc`, `Mutex`, and atomic synchronization primitives in favor of a flat, pre-spawned worker architecture. Error handling is simplified via `.map_err()` and the `?` operator, while progress bar updates are decoupled into dedicated callbacks.	2026-06-15 18:53:31 +02:00
coissac	9578f991f4	Merge pull request 'Push pslsukyowzrp' (#32 ) from push-pslsukyowzrp into main Reviewed-on: #32	2026-06-15 16:29:24 +00:00
Eric Coissac	1cd7916e06	refactor: replace rayon with NUMA-aware PartitionRunner Replaces `rayon` parallel iteration across index, rebuild, reindex, and select modules with a custom `PartitionRunner`. This introduces NUMA-aware task distribution with CPU pinning and round-robin scheduling, eliminating `Arc`, `Mutex`, and atomic synchronization primitives in favor of a flat, pre-spawned worker architecture. Error handling is simplified via `.map_err()` and the `?` operator, while progress bar updates are decoupled into dedicated callbacks.	2026-06-15 18:29:04 +02:00
Eric Coissac	bc92dc4592	refactor: restructure partitioner with shared utilities and pipeline This commit restructures the partitioner crate by extracting shared utilities and the `ColBuilder` enum into a new `common` module. It introduces a multi-phase `graph_pipeline` for constructing and materializing De Bruijn graphs, replacing manual graph construction in `index_layer`, `merge_layer`, and `rebuild_layer`. All layer workflows now use centralized `build_graph` and `materialize_layer` abstractions, with standardized error context strings for improved diagnostics.	2026-06-15 14:08:16 +02:00
coissac	a9567ad023	Merge pull request 'Push rtnzuqxzmkon' (#31 ) from push-rtnzuqxzmkon into main Reviewed-on: #31	2026-06-15 09:40:35 +00:00
Eric Coissac	4a64718fd1	perf: replace partition processing with adaptive NUMA worker pool Replaces the previous partition processing logic with an adaptive, NUMA-aware multi-threaded worker pool that dynamically scales active threads based on real-time CPU efficiency. Introduces pre-spawned, CPU-pinned threads managed via crossbeam channels and Rayon to optimize memory bandwidth and core utilization. Adds a `max_workers()` accessor to aggregate maximum worker capacity across NUMA nodes and updates diagnostics to report active versus maximum worker counts.	2026-06-15 11:40:14 +02:00
Eric Coissac	7a87e911b6	feat: introduce NUMA-aware PartitionRunner for adaptive parallelism Replace NUMA-naive Rayon loops and ad-hoc adaptive pools with a unified `PartitionRunner` that manages a NUMA-aware worker pool. The implementation uses pinned Rayon thread pools per node and activates dormant threads based on real-time CPU efficiency metrics. This standardizes partition-level parallelism, optimizes memory locality, and eliminates cross-socket traffic. Includes architecture documentation and updates mkdocs navigation.	2026-06-15 11:34:41 +02:00
coissac	313d73838a	Merge pull request 'feat: add pipeline concurrency throttling and HPC build docs' (#30 ) from push-owwylwtskwzw into main Reviewed-on: #30	2026-06-15 08:33:41 +00:00
Eric Coissac	175ea5bbd0	feat: add pipeline concurrency throttling and HPC build docs Introduces a counting semaphore-based throttling mechanism to limit concurrent file I/O and pipeline processing. Replaces custom path wrappers with standardized `Throttled` types across `obikmer` and `obikpartitionner`, ensuring RAII-based resource cleanup and explicit backpressure. Additionally, documents how to redirect Cargo build artifacts to local scratch storage on HPC filesystems to prevent compilation slowdowns.	2026-06-15 10:33:23 +02:00
coissac	c6ea0c53e3	Merge pull request 'feat: implement NUMA-aware worker pools for merge command' (#29 ) from push-wusvurukprsr into main Reviewed-on: #29	2026-06-14 21:57:21 +00:00
Eric Coissac	ea767376bd	feat: implement NUMA-aware worker pools for merge command Replaces the global Rayon pool with per-NUMA-node thread pools that pin worker threads to their respective nodes, leveraging Linux first-touch allocation to reduce cross-NUMA memory contention and improve cache locality. Integrates the `hwlocality` crate with a vendored build, includes graceful fallbacks for single-socket or non-Linux systems, and updates dependency constraints. Also adds installation and architecture documentation, and corrects parallelism detection in the partitioner.	2026-06-14 23:56:52 +02:00
coissac	f1d76f3203	Merge pull request 'refactor(merge): extract adaptive worker spawn logic' (#28 ) from push-yzruqtyqvopm into main Reviewed-on: #28	2026-06-13 12:56:34 +00:00
Eric Coissac	c4071eb450	refactor(merge): extract adaptive worker spawn logic Centralize inline spawn checks into a `should_spawn_worker` function with adaptive thresholds. The first worker spawns at <95% CPU efficiency, while subsequent workers only trigger if marginal efficiency gain exceeds 25% of the expected `1/n_workers` (minimum 3%). Also increases the spawn poll interval from 10s to 20s.	2026-06-13 14:56:01 +02:00
coissac	817b02cbc1	Merge pull request 'Push zkspuxlpumpw' (#27 ) from push-zkspuxlpumpw into main Reviewed-on: #27	2026-06-13 11:25:12 +00:00
Eric Coissac	547cb72d76	refactor: Enforce Rayon parallelism and fix merge_layer concurrency Updated memory guidelines and feedback docs to explicitly classify intra-partition phases as parallel, correcting prior assumptions of sequential execution. Refactored merge_layer.rs to wrap column builders in Arc<Mutex<ColBuilder>> and use Arc::try_unwrap for safe concurrent access, eliminating race conditions and preventing double-closes during pass2.	2026-06-13 13:24:55 +02:00
Eric Coissac	6d85387077	feat: add performance instrumentation and dynamic worker scaling This change enhances observability and adaptability in the merge pipeline. Performance timing and debug logging are added to the De Bruijn graph and partition merge layers to track phase durations and pipeline metrics. The merge module replaces blocking receives with timed polls to sample CPU efficiency, dynamically spawning workers when utilization drops below a threshold. A new script is also introduced to parse merge debug logs and generate structured Markdown reports detailing throughput, phase breakdowns, and partition performance.	2026-06-13 13:21:53 +02:00
coissac	fb5b53dca9	Merge pull request 'Push ooxwzorvsqvy' (#26 ) from push-ooxwzorvsqvy into main Reviewed-on: #26	2026-06-13 09:59:07 +00:00
Eric Coissac	fddf630772	style: apply consistent formatting and whitespace normalization Applies consistent formatting, whitespace normalization, and indentation standardization to `debruijn.rs` and `merge.rs`. Reorganizes imports and downgrades a unitig traversal log from `info!` to `debug!`. No functional logic or runtime behavior is altered.	2026-06-13 11:58:20 +02:00
Eric Coissac	bc14346f5f	feat: add CPU-aware parallel worker pool for partition merging Introduce CpuSample to measure process-level CPU efficiency and wall-clock time. Use crossbeam-channel to distribute partition merging tasks to a dynamic worker pool that scales based on CPU utilization, capped at half the available cores. Update diagnostics to track pool usage.	2026-06-13 11:58:20 +02:00
Eric Coissac	fb8c6e427c	refactor: pass Unitig objects directly instead of raw byte slices Refactored `try_for_each_unitig` and related pipelines across `obidebruinj` and `obikpartitionner` to accept `Unitig` instances directly. This eliminates manual `Unitig::from_nucleotides()` conversions, simplifies the data flow, and reduces unnecessary allocation overhead.	2026-06-13 11:52:50 +02:00
Eric Coissac	1f336fe496	refactor: replace mutex with channels for parallel debruijn processing Add `rayon` and `crossbeam-channel` dependencies to support concurrent execution. Replace the synchronous, mutex-protected closure pattern with a channel-based producer-consumer approach using `std::thread::scope`. This decouples unitig iteration from processing, eliminating lock contention and `Mutex` overhead while enabling parallel workloads.	2026-06-13 11:49:27 +02:00
Eric Coissac	5f98d2ef96	refactor: replace explicit collect with Unitig::from_nucleotides Introduce a thread-local buffer to materialize nucleotide iterators into contiguous slices. Update `try_for_each_unitig` across the debruijn, index, merge, and rebuild layers to directly instantiate `Unitig` via `from_nucleotides()` instead of explicitly collecting iterators. This eliminates intermediate allocations and aligns test code with the new approach.	2026-06-13 11:47:06 +02:00
Eric Coissac	8b563d0804	refactor: migrate pipeline stages and improve graph processing Refactored neighbor resolution to explicitly track unvisited indices for degree-1 nodes, updated display formatting, and added timing and debug logging to the degree computation routine. Migrated pipeline stages from eager vector returns to explicit flat implementations, enabling backpressure-aware streaming, configurable batch processing, incremental yielding, and progress tracking via a delta channel.	2026-06-13 11:44:17 +02:00
coissac	7208dcbb4a	Merge pull request 'refactor: defer SrcLayerData lookups in RawBatch' (#25 ) from push-nxrynoorswrw into main Reviewed-on: #25	2026-06-12 20:19:21 +00:00
Eric Coissac	2e69b0b7fe	refactor: defer SrcLayerData lookups in RawBatch Replace eager resolution of `Vec<u32>` values with an `Arc<SrcLayerData>` handle passed alongside `Vec<CanonicalKmer>`. This shifts the lookup logic to the subsequent transform step, reducing memory overhead and enabling shared, thread-safe access to the source layer data.	2026-06-12 22:18:57 +02:00

1 2 3 4 5

237 Commits