200 Commits

Author SHA1 Message Date
b3b9955140 Force source to be the first feature
Former-commit-id: b78cd77b042bf3c7ec4251e5f2e77762b84d5995
Former-commit-id: 1f1ca1afd0359bd8fde039b8be0e80e9472124c7
2016-10-09 07:19:00 -03:00
8156d5dd2f Add specific exonerate parameters for ycf1
Former-commit-id: c956dde7ad2183b72fe5221333876747db97b361
Former-commit-id: 5ddf35ea93eadadecb063277afd513e8ae73e559
2016-10-09 07:11:20 -03:00
001c1dcac1 For a given protein consider only cluster with at list a score of 95% of
the best score

Former-commit-id: cfdc6fcd37a4036d8bcca27bc7e120e60a94998d
Former-commit-id: f45bb7922f28165fd3baa1bc67bf815a759d1590
2016-10-09 04:24:08 -03:00
54413e7420 Change awk to $AwkCmd
Signed-off-by: Eric Coissac <eric.coissac@metabarcoding.org>
Former-commit-id: 79d7c6cc4333c8f72cef71f9c5323c151bb0e6b7
Former-commit-id: 869cf28bb894c95297fc0f80e424a55d347f2a65
2016-10-09 01:25:57 -03:00
a4147f27e2 Add ignore file
Former-commit-id: 3d1c76d9c7fad829431fbfd7ea72b868fe1cf597
Former-commit-id: 03854a5d4f28a31145540cc72d097e5f9edbee64
2016-10-09 01:11:19 -03:00
970addd9df Change the awk call by $AwkCmd
Former-commit-id: c5642bbbe1c9c16139d36a4446e96fde120adce4
Former-commit-id: 68463a5c29dd90a48339d34d75efe830a32d1fe5
2016-10-09 01:01:57 -03:00
87453701b7 Change some parameters in program calls
Former-commit-id: 3ed8760844007def1d8c5a9cf4eaee01d571fe0b
Former-commit-id: b15127c8f8a601b33e09daccc645cbb8a1f23a2e
2016-10-06 12:37:57 -03:00
4992483b80 Change some blastx parametter to get better matches by taking into
account intron size and the good genetic code

Former-commit-id: 6600123fbdce2070058074e82c791c7fc260c39b
Former-commit-id: ac413cc4a49844d4fa4087107aa84680d36f3df1
2016-10-06 12:36:43 -03:00
e4f3081fa8 Switch to the speedup mode because of the slow down imposed by the new
exonarate parametters

Former-commit-id: 30f2caea735460bcc4dfa61adde72d7da2fb6f2e
Former-commit-id: 0537c77f5bc16d766b3cbd668dcd1e1711140937
2016-10-06 12:35:32 -03:00
3d91c88058 Merge branch 'master' of ssh://git@git.metabarcoding.org/org-asm/org-annotate.git
Former-commit-id: 04e163d5613d7f6319b2509f5d279a08a976b315
Former-commit-id: 6bcab2ff323e8cbd027cb958de6dfdf4970cd40d
2016-10-06 10:08:06 -03:00
16b5e2927d Make changes to better detect pseudo genes frameshited and annotate them
correctly

Former-commit-id: d827908d63149941538e686b48f60a132173cb80
Former-commit-id: 2841c75b415c6c8fa35a6a90e23cf82c3c84408b
2016-10-06 10:06:37 -03:00
860cd217d4 Add the management of pseudogenes
Former-commit-id: 26d91366e483cf17c440b251ab1e8ac5390699fe
Former-commit-id: 0d3d69ba351bd174fe08387a474fd1137559e38a
2016-10-06 08:56:45 -03:00
fd9a0ef686 Merge branch 'master' of git.metabarcoding.org:org-asm/org-annotate
Conflicts:
	org-annotate.sh

Former-commit-id: 284096bfdd04b0d0fb03606a3f9497771522bd1b
Former-commit-id: 7160ad77a2192781681e4e0a424f1f5d8904c269
2016-10-05 15:18:25 +02:00
cf5a5d1ce5 Add management of partial sequences
Former-commit-id: abd6112ac558592616c197f2fe761880a847b031
Former-commit-id: d2ea2e7e306512bf8ef92d3497a0302919b81010
2016-10-05 15:11:26 +02:00
d4da1d01fd A new set of protein cleaned for the CDS detector prepared using the
clusterizecore.sh script from the detectors/cds/lib folder.

The CDS detector is now modified to use the clean.fst files.


Former-commit-id: e30a53b5b6b658388af4b2640b30e6765c729894
Former-commit-id: 3015ad50d25248fb117ab00e816b00fde1f9ba1d
2016-10-05 09:31:24 -03:00
3a8860aaf7 Add the possibility to annotate partially sequenced genome.
Add the print of a source feature.

Former-commit-id: b4415ad285b00099a24094ab5e6d244a37d709ac
Former-commit-id: 3e7e958573b476c0e62e3e9dcaafec9ed1275b50
2016-08-08 14:44:08 +02:00
466308267e Add a patch for chloroplast annotation when no inverted repeats are
detected

Former-commit-id: 7e3ddd41cf0d0788223382fedbf45b183974233e
Former-commit-id: e5a8ceb825f78d243e37d22cd6b2e91f403c0ee8
2016-05-02 15:32:28 +02:00
8a1a1d57ba remove the printing of an extra empty line at the beginning of the embl
file


Former-commit-id: bba6581b86f21474d6e9b2acb3a9b4937984bbb8
Former-commit-id: 7570b9c21814614c45e57e253bdfae58c9ba3c2b
2016-05-02 12:25:05 +02:00
8113b80d47 Add annotation of nuclear rDNA cistron
Former-commit-id: ee54019ddddbea4d17956622968f6ce673b609e1
Former-commit-id: 5e5381cf59409ca3dc01098b0e3f330efe0a6a32
2016-05-02 10:56:40 +02:00
7d04371387 Add ITSx to th src
Former-commit-id: 7f75da850aa538ea79e633d2a600e3c42558cdba
Former-commit-id: 96200ce9daf7a9fd51d532cc2403611eaf8026f1
2016-04-28 10:08:10 +02:00
890605039b Merge branch 'master' of ssh://git@git.metabarcoding.org/org-asm/org-annotate.git
Former-commit-id: 481c726b17189e0728abcbe467ce0f7e994f968f
Former-commit-id: 25634afe2d8ee729ddd03b66cf31c993a6a815ac
2016-04-25 23:44:33 +02:00
20d0bcfbf8 First trial to automatcally cleanup the core CDS database
Former-commit-id: dc61a61816084f385f1aa89324b08f81602b4353
Former-commit-id: ee8bf1a08e4af4f4d8d12a1e2a83c5f688e5f7e8
2016-04-25 23:41:18 +02:00
644f154050 Add a fasta1line function reformating the sequence with a line for the
header and a single line for the sequence

Former-commit-id: 619dc4f5515b0080e5696806f9325f90b983d22e
Former-commit-id: c244fdbb5c84bebf9ae17d6e15c0fd4b00914d32
2016-04-25 11:15:14 +02:00
6f00381000 Add license
Former-commit-id: 08aa22fcdc5156b0e7cbabb18b1bd5c02288f8cf
Former-commit-id: 750c6de83d833f8eba190888dc96bfa9ba8413ea
2016-04-23 18:05:04 +02:00
536a451510 call explicitely tcsh to workaround a path bug
Former-commit-id: e6c05a695a6872dd5fb8acd96ee031844dd21fa0
Former-commit-id: 7740135e0861b796e85fce0c9c62a4793f836c2b
2016-04-13 17:32:10 +02:00
f466f5505a Change tha dash bang of the csh shell scripts
Former-commit-id: 115a1955c5883ffd0909cb05e887f70fa561b6e6
Former-commit-id: 5e6be182d5a3ec910f5deed27014227f34bd4745
2016-04-13 16:51:58 +02:00
69434c5b86 Add the latest tcsh able to deal with large PATH (at least 4096)
Former-commit-id: 32011d9b239e2c5ed93646a8173b285f377693a3
Former-commit-id: 6e804387bfacfc4e9242ef3f7014642044f3aa2c
2016-04-13 16:21:50 +02:00
756d541a82 Try to solve to "Warning: ridiculously long PATH truncated" message
generated by the csh shell.

Perhaps will we have to avoid this shell ;-)

Former-commit-id: 87663c6f9de10be9862b5fb2d47aa0d720d657f4
Former-commit-id: d776dc96b5440b731a4e2ab508f7c4fdee411024
2016-04-13 13:04:28 +02:00
721816ae87 Solve #18
Former-commit-id: 0ac744e900340ff27c2d65c4707e673baf9d1671
Former-commit-id: de3721c7af20985c63c0f96e22bb190b9a597e45
2016-04-13 12:47:41 +02:00
b55cf31e4b Patch a bug leading to add extra quote to command line parametters
Former-commit-id: f84fd562054be66141b10621da80c9bd8817a93e
Former-commit-id: be585a9ed7e20e2ecd2898b575b5efd2c7ceea22
2016-02-04 16:00:28 +01:00
b2cc474077 Add the cd-hit program to the binaries
Former-commit-id: 0ecdb8d90e9e78940ec9b109c3a0b484bcacf14d
Former-commit-id: 4c3551f7bca6c203fb7ffbba6c2e37d15d763487
2016-01-27 13:09:43 +01:00
647695d2a2 Patch a compilation bug for getopt
Former-commit-id: 0ea287f1ca4ed7808577b3e0a12c568da10f6ee8
Former-commit-id: 361c4e3e00ed587a7d7ca684fa92fafefd3cda83
2016-01-05 17:19:37 +01:00
f39e66d4c1 integrate options in the main script
Former-commit-id: e046ef88288559ed68463d3e9fd334ff0a8d1ec6
Former-commit-id: a2f99a532f7ce1b3114385c49affc197419a04d9
2015-12-18 22:05:51 +01:00
99865ad568 Add the standard getopt external command allowing to deal with short and
long options in scripts

Former-commit-id: 128f06564f712724c66304c46b32057a2f0943c3
Former-commit-id: 98246ddfe8807c4269408452b1bd71accd993a27
2015-12-18 11:47:56 +01:00
ab37af3b03 Add the name of the org.annot pipeline in the CDS inference
Former-commit-id: 497194fafc15da0d80ee7dcb4cf11551d21061bd
Former-commit-id: ea502a0d75d7ff638258a5a15b8ff759cd6e28fa
2015-12-18 08:56:55 +01:00
a4e053989b Specify the genetic code during the aragorn call.
Former-commit-id: 6f18008c34dcb33059accc02edef681a26848416
Former-commit-id: a7313f06a23a307a0384b88e3bc8a1d7b9292e07
2015-12-18 08:39:48 +01:00
cf54e7dcb1 Close #15. Actually the bug in intron location was related to a
misinterpretation of the aragorn output format. Now tRNA, and intron
location are coherent with most of the locations extracted from genbank
file with one or 2 base pairs of difference.

Former-commit-id: dac4fb731e0edaeaebde9edc5350fce38ad99601
Former-commit-id: f8a0590342aec2db1fe5deb4475b8a9380891a48
2015-12-18 08:39:04 +01:00
d09a1fc40a Ignore the build directory
Former-commit-id: 476e40707e6c884c2abfaecabbefe2691150eaa8
Former-commit-id: 2850ad1d963c6167c583c355936214ea2af6f963
2015-11-29 19:03:54 +01:00
89c4f17fc4 Patch a bug on the generation of the location of tRNA for gene on
reverse complement strand with an intron.

Former-commit-id: 729905450d60c9b2e76ac73567b3efb09cb1bb86
Former-commit-id: 722dc77682ef3da8a746879c52072c46adb9de71
2015-11-28 16:11:14 +01:00
b7282fb30d minor addition in cds/compare
Former-commit-id: e865ea931fb2fc76f49b72d823eda712138647e3
Former-commit-id: 3d8d2bd249907fa4fbb7fae2ee06cf6090f62d5e
2015-11-15 13:13:36 +01:00
2d404b5b24 removed need of R igraph from chlorodb/subdb
Former-commit-id: 574aace9be5804d728a877110f5f475d61644f75
Former-commit-id: 2e7ea63447643830a62f18a364327d7b396ec140
2015-11-14 22:13:55 +01:00
d83201fd2f minor bug in chlorodb
Former-commit-id: 7017655ac86e7b7837c7b581bf8a1abb86c08b30
Former-commit-id: dcedd4e32e3c7ce302eed94abd2b975a4506df97
2015-11-14 15:16:16 +01:00
a6ee2f0781 corrected bug in ncbiblast src/build-system/Makefile.in.top
Former-commit-id: edd07fd725f2ff1872d64bde8026ff41a68733c9
Former-commit-id: a8129cd2857df2414db1754da72ec8ad17465bd5
2015-11-14 03:21:51 +01:00
7cffc9b412 corrected compilation bug muscle on linux
Former-commit-id: 87269138a1b31f83bebc4a3f53f32d6b4c5b8289
Former-commit-id: 943c2fba5e4d2c1bfa4c785b930b97835c6669e1
2015-11-14 01:32:11 +01:00
3acb2b4ea0 squared src - added missing portclean targets
Former-commit-id: d233d03829d67d33161a1c7ea8f50e09d0a758f7
Former-commit-id: 24a9b61d46451c7dcf6ed5bd222cef3cc3cfc218
2015-11-14 00:39:04 +01:00
ac6dfb38e0 squared several distro in src - clean target added
Former-commit-id: ebb0e3167e76f9e9dbacde38c695e1f4533c24d4
Former-commit-id: c83d3ed57dc860d1e5a6c41481f850b41f04f0fc
2015-11-14 00:25:39 +01:00
085a5ff2a7 squared muscle
Former-commit-id: 54638fc7623ec0485109b0b5879144f8f2514213
Former-commit-id: 23f43cd31169d95df12ae1c207d9fd1c924d26b1
2015-11-14 00:11:27 +01:00
d52ebf0359 squared hmmer distro
Former-commit-id: 2d4971feca9edf04700af6f7765764123997f5c4
Former-commit-id: 97755cfa4930ae468817c1d11393652264df4c2a
2015-11-13 23:30:03 +01:00
80d7027d70 clean gawk distro
Former-commit-id: 5b6234f3446036d831aa131ec7617798a39b86b1
Former-commit-id: f8271890bc649168280f5d634c721e4883976da1
2015-11-13 23:00:27 +01:00
6f43ede11e cds test on core and shell
Former-commit-id: 9be1f2c23d00a2678489090c4f6d04ffc0124061
Former-commit-id: 823ca0890900bf6f81b158cafc46c78049fcf080
2015-11-13 22:41:34 +01:00