diff --git a/Wolf-tutorial-with-the-OBITools3.md b/Wolf-tutorial-with-the-OBITools3.md index 596f603..48ef398 100644 --- a/Wolf-tutorial-with-the-OBITools3.md +++ b/Wolf-tutorial-with-the-OBITools3.md @@ -131,11 +131,11 @@ For EMBL files, you can give the path to a directory with several EMBL files. obi grep --require-rank=species --require-rank=genus --require-rank=family --taxonomy wolf/taxonomy/my_tax wolf/v05_refs wolf/v05_refs_clean -2. Dereplicate identical sequences: +2. Dereplicate identical sequences (note: not a necessary step, avoid for big databases as long as #79 is not fixed): obi uniq --taxonomy wolf/taxonomy/my_tax wolf/v05_refs_clean wolf/v05_refs_uniq -3. Ensure that the dereplicated sequences have a taxid at the family level: +3. Ensure that the dereplicated sequences have a taxid at the family level (if you ran the previous step): obi grep --require-rank=family --taxonomy wolf/taxonomy/my_tax wolf/v05_refs_uniq wolf/v05_refs_uniq_clean