mirror of
https://github.com/metabarcoding/obitools4.git
synced 2026-04-30 12:00:39 +00:00
8c7017a99d
- Update obioptions.Version from "Release 4.4.29" to "/v/ Release v5" - Update version.txt from 4.29 → .30 (automated by Makefile)
1.2 KiB
1.2 KiB
Taxonomic Analysis Functions in obiseq Package
This module provides tools for assigning taxonomic labels to biological sequences using a reference taxonomy.
-
TaxonomicDistribution(taxonomy):
Returns a map from taxonomic nodes to read counts, based ontaxidannotations in the sequence metadata. It validates taxids against the taxonomy and enforces strict handling of aliases. -
LCA(taxonomy, threshold):
Computes the Lowest Common Ancestor (LCA) of all taxonomic assignments for a sequence, weighted by their abundances.- Iteratively traverses upward from each taxon’s path in the taxonomy tree.
- At each level, computes the relative weight (
rmax) of the most frequent taxon. - Stops when
rmax < threshold, returning:
• the LCA taxon,
• its confidence score (rans), and
• total read count used.
-
AddLCAWorker(...):
Creates aSeqWorkerfunction to annotate sequences with LCA results:- Sets attributes like
<slot>_taxid,<slot>_name, and<slot>_error(rounded to 3 decimals). - Automatically appends
_taxidif missing inslot_name.
- Sets attributes like
All functions integrate with the OBITools4 ecosystem, supporting robust taxonomic inference for metabarcoding workflows.