Commit Graph

126 Commits

Author SHA1 Message Date
812fdfa06f Switch to version 2.4.0 of exonerate
Former-commit-id: c3d1cd54f603eb46ee89a6534ab1f3569182c762
Former-commit-id: e501561bb6453ef97c7e3a00bf6759a779120438
2018-05-11 16:19:30 +02:00
5a1c8283db Freshly regerenrated CAU tRNA reference library with the new tool script
Former-commit-id: b2808b9d4462c32d0d81fd4f8d134fb94ad9b51c
Former-commit-id: a1db9d37bba3adf1100c4418b9211aa49aa00da7
2018-04-05 18:31:49 +02:00
405c89ea47 Add eclipse project files
Former-commit-id: 96afe525043a5a45b7d0775a7fe777c022426ba4
Former-commit-id: 52972f01288fd73821ea16d51ed645d7a792d19f
2018-04-05 18:16:07 +02:00
c691818059 Changes in .gitignore
Former-commit-id: 7e9fcd4ed6487e52562b274d87345e0e46f1458d
Former-commit-id: bf7a58dcf96db9f8c132977aa7d3f6af97cce0f5
2018-04-05 18:15:40 +02:00
9070b54732 Switch to clustal-omega version 1.2.4
Former-commit-id: f2eba5fb6556643d8926f3be4a67efcf5c55c9dc
Former-commit-id: 5303009434cf90955cd0d58246fe549fa3b487b4
2018-04-05 18:02:01 +02:00
9bcfa914fe Switch to version 1.2.38 of aragorn
Former-commit-id: 8fdde2e7586aa22e401bc760f8c67c24eded9d78
Former-commit-id: 7e79dc7df97373f5527b6a2d9526cecfcb2b7376
2018-04-05 18:01:01 +02:00
0a5a65ab26 Change the notation algorithm to take advantage of the new CAU tRNA
reference library

Former-commit-id: 32650f41c4a7f95ce5da78c1f520438b35c1d4d1
Former-commit-id: 7ee31aaed2aca437b689fc7930095279fce0051b
2018-04-05 17:59:12 +02:00
81657a288a Modify script to accept compressed genome files
Former-commit-id: f816e3ce8b10e2ca3f1aa9ae969c24e699368e25
Former-commit-id: 16fb412552debdfd2172926e8a8b63be05257bdf
2018-04-05 17:58:19 +02:00
962ff827dc New version of the CAU tRNA reference database
Former-commit-id: 15ec72ca1243c88066c55e8f3655c1da929c25bd
Former-commit-id: b9344fb5e130a79dbe7247ecdbf7b00c603d9597
2018-04-05 17:56:45 +02:00
ee634cc779 Simplify CAU tRNA reference database building to keep onlyCAU tRNA
from plastomes where the three categories of CAU tRNA (Met/Ile/fMet)
are annotated

Former-commit-id: 67dc445698e22fe8a503c6700977c79e4817d302
Former-commit-id: 6e84303543b0752a7946bdde6e5114cfe6eef8da
2018-04-05 17:55:31 +02:00
c37c175fd8 Switch to clustalo version 1.2.4
Former-commit-id: 0e15ea203e7af3c7bf0504991cd99fef713bc133
Former-commit-id: 46d3bdd981e33e9f9d18e1c8946cda1f778dbd97
2018-03-30 13:47:59 +02:00
1d0600bd31 Switch Aragorn to version 1.2.38
Former-commit-id: 46bb36694fb02a83926b78d76aa367ffffd55706
Former-commit-id: 916ab5d14b599d8bc4fd3d47958165a5512a9849
2018-03-30 13:27:13 +02:00
fc821d6be8 Final small changes to patch the bug related to complex filenames
Former-commit-id: c59d7b5e7f8c8f37e955e44b354521c312cfc2c4
Former-commit-id: e9d8bc55d4542b276e91104672ba7dddb53c0c6a
2018-01-25 08:53:27 +01:00
640294b47e Always a new attempt to solve the bug...
Former-commit-id: 0a5ece1e927034a7001e2e1bcd2743d9b9e3ec6d
Former-commit-id: 0aafb797b73c8beb4d8662784c8537e6f0c13c5d
2018-01-24 16:41:35 +01:00
44a75f6fd7 Comment out phase 2 CDS searching
Former-commit-id: ca048f8c762475a2ca02735a20b90576b0222462
Former-commit-id: 455ffc2945c49f701f7406930fbe2e4e166d172d
2018-01-24 16:12:49 +01:00
238b500e1a Add missing file...
Former-commit-id: f71b0396212bb8cd2df1ca1a4e5847f30c613a48
Former-commit-id: 17cc9616d8835548e996712545d4cc0e1833f90f
2018-01-24 15:13:31 +01:00
1687b3acbf Merge branch 'master' of
ssh://git@git.metabarcoding.org/org-asm/org-annotate.git

Conflicts:
	detectors/cds/bin/do_exonerate.csh

Former-commit-id: aeae3d48b477eb7a8d348d6d3da938a83289dc05
Former-commit-id: 89a5d80d048cb0a96bb986c84a7346823e382fe8
2018-01-24 15:09:13 +01:00
8d2ec19fe8 Patch a bug to launch exonerate on complexe filename
Former-commit-id: e8357a639a22cb123985a0ed487dfd4018c9bb0a
Former-commit-id: a2e1c2ce75c0eac9574b7a68506f6f209e54ea89
2018-01-24 15:07:04 +01:00
2e5bdf2246 Patch a bug to launch exonerate on complexe filename
Former-commit-id: fd381611fc7ec543f374e8c6132d29c66612f1a4
Former-commit-id: 653603e547c675c80ace6a3a9d0f13be89048017
2018-01-24 14:30:00 +01:00
4c7ba137d9 Patch a bug on numering the last nucleotide in the embl file
Former-commit-id: 626566f2a64cb6628f1cd5ef7bfae63f4bd6a6ad
Former-commit-id: 40dd1f5246103aa6500ea6181bc4c010a873bbe5
2018-01-24 13:28:27 +01:00
f74bb0d973 Patch a bug blocking the exonerate execution when the genome filename is
too long or complex

Former-commit-id: a9da8eab920f422609b41be2e16d65e0569f953c
Former-commit-id: 6829ae3081bea4a1d16ec8d3bad10e51f01f51d7
2018-01-23 07:32:12 +01:00
a25ab81b38 Add logs to print the sequence length and if the sequence is reverse
complemented

Former-commit-id: ba55f354ea7a51119fe44bcb36aa5927194293e2
Former-commit-id: dd7715be54ac92c9625f0a2c30e572b7aee76dc7
2018-01-18 22:00:07 +01:00
08d7c940a4 Patch a bug in the final sequence formating occuring when the input
sequence has not 60 char per line

Former-commit-id: 213735f5b9f3cd817053e284d7844cfdd69726c6
Former-commit-id: 074b4aaac0eac00de9b3b48e75804417ce780a2d
2018-01-18 21:58:50 +01:00
04ea0f110d Allows for reporting
Former-commit-id: af7999b0f3c69be9c796799813950adbdb0fb0e8
Former-commit-id: f8a6f2a26c58a02aa6d076bd3005a02f906de82a
2016-10-20 09:31:54 -03:00
96b5993693 Patch a serious bug in the embl formating of the sequence leading to
frameshifts in the embl formated sequences

Former-commit-id: 92d73a91a2486e59809fef0cff0060b14b47be70
Former-commit-id: a034fb4a42dafca46f491d5e7cfe08de8b2eea92
2016-10-12 12:25:31 -03:00
1ac0af03c2 Patch the new ycf1 specific parameters
Former-commit-id: 66f848b351a6b8186ff03a7059aa167f39ed29a1
Former-commit-id: fd4260434739725ff967138089eaeeb013812784
2016-10-09 07:19:35 -03:00
b3b9955140 Force source to be the first feature
Former-commit-id: b78cd77b042bf3c7ec4251e5f2e77762b84d5995
Former-commit-id: 1f1ca1afd0359bd8fde039b8be0e80e9472124c7
2016-10-09 07:19:00 -03:00
8156d5dd2f Add specific exonerate parameters for ycf1
Former-commit-id: c956dde7ad2183b72fe5221333876747db97b361
Former-commit-id: 5ddf35ea93eadadecb063277afd513e8ae73e559
2016-10-09 07:11:20 -03:00
001c1dcac1 For a given protein consider only cluster with at list a score of 95% of
the best score

Former-commit-id: cfdc6fcd37a4036d8bcca27bc7e120e60a94998d
Former-commit-id: f45bb7922f28165fd3baa1bc67bf815a759d1590
2016-10-09 04:24:08 -03:00
54413e7420 Change awk to $AwkCmd
Signed-off-by: Eric Coissac <eric.coissac@metabarcoding.org>
Former-commit-id: 79d7c6cc4333c8f72cef71f9c5323c151bb0e6b7
Former-commit-id: 869cf28bb894c95297fc0f80e424a55d347f2a65
2016-10-09 01:25:57 -03:00
a4147f27e2 Add ignore file
Former-commit-id: 3d1c76d9c7fad829431fbfd7ea72b868fe1cf597
Former-commit-id: 03854a5d4f28a31145540cc72d097e5f9edbee64
2016-10-09 01:11:19 -03:00
970addd9df Change the awk call by $AwkCmd
Former-commit-id: c5642bbbe1c9c16139d36a4446e96fde120adce4
Former-commit-id: 68463a5c29dd90a48339d34d75efe830a32d1fe5
2016-10-09 01:01:57 -03:00
87453701b7 Change some parameters in program calls
Former-commit-id: 3ed8760844007def1d8c5a9cf4eaee01d571fe0b
Former-commit-id: b15127c8f8a601b33e09daccc645cbb8a1f23a2e
2016-10-06 12:37:57 -03:00
4992483b80 Change some blastx parametter to get better matches by taking into
account intron size and the good genetic code

Former-commit-id: 6600123fbdce2070058074e82c791c7fc260c39b
Former-commit-id: ac413cc4a49844d4fa4087107aa84680d36f3df1
2016-10-06 12:36:43 -03:00
e4f3081fa8 Switch to the speedup mode because of the slow down imposed by the new
exonarate parametters

Former-commit-id: 30f2caea735460bcc4dfa61adde72d7da2fb6f2e
Former-commit-id: 0537c77f5bc16d766b3cbd668dcd1e1711140937
2016-10-06 12:35:32 -03:00
3d91c88058 Merge branch 'master' of ssh://git@git.metabarcoding.org/org-asm/org-annotate.git
Former-commit-id: 04e163d5613d7f6319b2509f5d279a08a976b315
Former-commit-id: 6bcab2ff323e8cbd027cb958de6dfdf4970cd40d
2016-10-06 10:08:06 -03:00
16b5e2927d Make changes to better detect pseudo genes frameshited and annotate them
correctly

Former-commit-id: d827908d63149941538e686b48f60a132173cb80
Former-commit-id: 2841c75b415c6c8fa35a6a90e23cf82c3c84408b
2016-10-06 10:06:37 -03:00
860cd217d4 Add the management of pseudogenes
Former-commit-id: 26d91366e483cf17c440b251ab1e8ac5390699fe
Former-commit-id: 0d3d69ba351bd174fe08387a474fd1137559e38a
2016-10-06 08:56:45 -03:00
fd9a0ef686 Merge branch 'master' of git.metabarcoding.org:org-asm/org-annotate
Conflicts:
	org-annotate.sh

Former-commit-id: 284096bfdd04b0d0fb03606a3f9497771522bd1b
Former-commit-id: 7160ad77a2192781681e4e0a424f1f5d8904c269
2016-10-05 15:18:25 +02:00
cf5a5d1ce5 Add management of partial sequences
Former-commit-id: abd6112ac558592616c197f2fe761880a847b031
Former-commit-id: d2ea2e7e306512bf8ef92d3497a0302919b81010
2016-10-05 15:11:26 +02:00
d4da1d01fd A new set of protein cleaned for the CDS detector prepared using the
clusterizecore.sh script from the detectors/cds/lib folder.

The CDS detector is now modified to use the clean.fst files.


Former-commit-id: e30a53b5b6b658388af4b2640b30e6765c729894
Former-commit-id: 3015ad50d25248fb117ab00e816b00fde1f9ba1d
2016-10-05 09:31:24 -03:00
3a8860aaf7 Add the possibility to annotate partially sequenced genome.
Add the print of a source feature.

Former-commit-id: b4415ad285b00099a24094ab5e6d244a37d709ac
Former-commit-id: 3e7e958573b476c0e62e3e9dcaafec9ed1275b50
2016-08-08 14:44:08 +02:00
466308267e Add a patch for chloroplast annotation when no inverted repeats are
detected

Former-commit-id: 7e3ddd41cf0d0788223382fedbf45b183974233e
Former-commit-id: e5a8ceb825f78d243e37d22cd6b2e91f403c0ee8
2016-05-02 15:32:28 +02:00
8a1a1d57ba remove the printing of an extra empty line at the beginning of the embl
file


Former-commit-id: bba6581b86f21474d6e9b2acb3a9b4937984bbb8
Former-commit-id: 7570b9c21814614c45e57e253bdfae58c9ba3c2b
2016-05-02 12:25:05 +02:00
8113b80d47 Add annotation of nuclear rDNA cistron
Former-commit-id: ee54019ddddbea4d17956622968f6ce673b609e1
Former-commit-id: 5e5381cf59409ca3dc01098b0e3f330efe0a6a32
2016-05-02 10:56:40 +02:00
7d04371387 Add ITSx to th src
Former-commit-id: 7f75da850aa538ea79e633d2a600e3c42558cdba
Former-commit-id: 96200ce9daf7a9fd51d532cc2403611eaf8026f1
2016-04-28 10:08:10 +02:00
890605039b Merge branch 'master' of ssh://git@git.metabarcoding.org/org-asm/org-annotate.git
Former-commit-id: 481c726b17189e0728abcbe467ce0f7e994f968f
Former-commit-id: 25634afe2d8ee729ddd03b66cf31c993a6a815ac
2016-04-25 23:44:33 +02:00
20d0bcfbf8 First trial to automatcally cleanup the core CDS database
Former-commit-id: dc61a61816084f385f1aa89324b08f81602b4353
Former-commit-id: ee8bf1a08e4af4f4d8d12a1e2a83c5f688e5f7e8
2016-04-25 23:41:18 +02:00
644f154050 Add a fasta1line function reformating the sequence with a line for the
header and a single line for the sequence

Former-commit-id: 619dc4f5515b0080e5696806f9325f90b983d22e
Former-commit-id: c244fdbb5c84bebf9ae17d6e15c0fd4b00914d32
2016-04-25 11:15:14 +02:00
6f00381000 Add license
Former-commit-id: 08aa22fcdc5156b0e7cbabb18b1bd5c02288f8cf
Former-commit-id: 750c6de83d833f8eba190888dc96bfa9ba8413ea
2016-04-23 18:05:04 +02:00