Commit Graph

74 Commits

Author SHA1 Message Date
8f96517f3c small changes
Former-commit-id: 1fee30445f03ff627dab1c335e75c3f278621f6e
2023-11-07 21:20:45 +02:00
f991886155 Force json encoding to ascii
Former-commit-id: b35110f3fed5fedfc03380863ece5c95a2b9d6a7
2023-11-07 12:10:14 +02:00
185b974d13 Adds a JSON output format
Former-commit-id: 26f07460772c0f735bf705d473f892878d3e57f0
2023-11-07 11:56:49 +02:00
61c30f9b6a Patch rev complement and first implementation of --auto in obicsv
Former-commit-id: f3020e81283b1073c4d1c2d2ff0887e3998e6764
2023-11-07 09:37:07 +02:00
6a6a6f6f2c Correctly handle empty files
Former-commit-id: d166aa352ce4bf32739ddc2f7d1c9967918822fd
2023-10-16 15:34:06 +02:00
e8c55a2b6b optimize sequence readers and patch a bug in the format guesser
Former-commit-id: 9dce1e96c57ae9a88c26fac5c8e1bdcdc2c0c7a5
2023-10-13 21:52:57 +02:00
157c26cdc7 Patch a bug in the fasta and fastq readers
Former-commit-id: 4998f157a90a6b077124d87d4a5cde0dd075d1ce
2023-10-13 14:21:27 +02:00
0a8f7afb2a Patch sequence writers to remove definition from header when json format is used.
Former-commit-id: a8a74cccb52d577d4ea1c0ad88b121a8e511149d
2023-10-05 07:31:31 +02:00
d23a911080 Change the way sequence definition are managed. They are now when present stored as an attribute
Former-commit-id: 6e618377c05b42937d2eace3c9668390980ab68c
2023-10-05 07:21:12 +02:00
5c30ec354f Go implementation of fasta and fastq parser
Former-commit-id: 3f4fd355c169afbef2d5fef1f5e407aabb71d031
2023-09-03 19:16:37 +02:00
6d1ac60c48 Convert first nucleotide of sequence to lower case in fasta reader
Former-commit-id: 856bb3a39a4f1143a34b1f8b4d8d12b0151c0c3e
2023-09-01 09:40:02 +02:00
62b57f4ede A go implementation of the fasta reader
Former-commit-id: 603592c4761fb0722e9e0501d78de1bd3ba238fa
2023-09-01 09:30:12 +02:00
3f8c0d6a2f Replace MakeBioSequence call by NewBioSequence call,
Implements a new file format guesser
Adds some more API doc


Former-commit-id: 9837bf1c28beca6ddb599b367f93548950ba83c1
2023-08-30 19:59:46 +02:00
2a11adb346 Add some doc and switch to the parallel gzip library
Former-commit-id: 2c1187001f989ba3de5895f516d4c8b54d52a4c4
2023-08-25 14:36:38 +02:00
8a98210103 A small bug in logging and API documentation
Former-commit-id: 1ab4534581987586741db13d3453db52973524fe
2023-08-14 15:21:30 +02:00
70a77c9ec6 Adds doc
Former-commit-id: e720718686da480e1abd4302ebff145aeee5eb4a
2023-08-14 10:04:16 +02:00
fd4c7c2572 Adds some docs
Former-commit-id: 7e96983ed707b2a574425799c32c32243f4876cf
2023-08-14 00:12:17 +02:00
be47ec909c add the --skip-empty option
Former-commit-id: ec9cb0ecaf90a61bf9289cf4c089b5cc2fcb65a5
2023-07-17 14:24:02 +02:00
3f69fa41d6 Patch a bug for multiple amplicon per sequence.
Former-commit-id: b252d2de8e1a85d65c2951aa1958ee038e35741d
2023-03-31 15:10:25 +02:00
e7b9ba3f30 Limit allocation during genbank parsing
Former-commit-id: eee3c1fa7ffb79943109ee32dbf21e78bf11b14f
2023-03-28 22:42:58 +07:00
988ae79989 Optimize memory allocation of the apat algorithms
Former-commit-id: 5010c5a666b322715b3b81c1078d325e1f647ede
2023-03-28 19:37:05 +07:00
bc82422bc5 Reduce redundante call to bytes.ToLower and substitute the last call by an home made version doing the conversion in place
Former-commit-id: d9ea22f649d97be352f8dbb37acc1495df830118
2023-03-28 11:43:04 +07:00
a33e471b39 First attempt for obiconsensus... The graph traversing algorithm is too simple
Former-commit-id: 0456e6c7fd55d6d0fcf9856c40386b976b912cba
2023-03-27 19:51:10 +07:00
d5e84ec676 rename goutils to obiutils
Former-commit-id: 2147f53db972bba571dfdae30c51b62d3e69cec5
2023-03-24 10:25:12 +07:00
5fbe52368c Patch the empty batch bug
Former-commit-id: fcee04b58f2c4a0bf2c27792f991391c0b6ce78e
2023-03-07 20:16:06 +07:00
b3922c3896 Produce less weird crash on non existing files
Former-commit-id: 74bb27bd53c685be530632994bd2ba24c1f362e1
2023-03-07 17:34:25 +07:00
d88de15cdc Refactoring codes for removing buffer size options. An some other changes...
Former-commit-id: 10b57cc1a27446ade3c444217341e9651e89cdce
2023-03-07 11:12:13 +07:00
072b85e155 change the model for representing paired reads and extend its usage to other commands 2023-02-23 23:35:58 +01:00
56722895e4 Adds to every obitools the ability to save in gzip compressed files 2023-02-18 12:06:52 +01:00
d04eb915b3 adds a directory option to obidistribute 2023-02-17 22:52:53 +01:00
9554a32490 Add capacity to obidistribute to save gzipped files 2023-02-17 12:52:19 +01:00
8458c0cd8b Patch a bug in the genbank reader for the sequence longer than 10kb. 2023-02-17 10:54:03 +01:00
2975042982 Adds the possiblility to append to files to obidistribute 2023-02-16 16:13:13 +01:00
f56363a100 Patch an embl/genbank parser error 2023-02-16 13:30:42 +01:00
8b70b1a5d8 Last bug on sequence writing 2023-02-08 13:56:50 +01:00
526bf79c7f Patch for some lost of data during sequence writing 2023-02-08 13:14:26 +01:00
e5835310cc Simplify code of the JSON header formater 2023-01-31 17:16:55 +01:00
57c7187e6d Patch the JSON header formater 2023-01-31 17:15:10 +01:00
2d375df94f move the worker class to the obiseq package 2023-01-22 22:39:13 +01:00
f97f92df72 rename the iterator class 2023-01-22 22:04:17 +01:00
20b16c0ba1 Force sequence reading to produce lowercase sequences.
Adds two columns to the obiclean ratio csv file
2022-11-22 15:06:09 +01:00
29563aa94e Rename the Length methods Len to follow GO standart 2022-11-17 11:09:58 +01:00
09fc426b67 Refactoring related to iterators 2022-11-16 17:13:03 +01:00
6f853da9df Remove single sequence ierators. Only batch iterators persist 2022-11-16 10:58:59 +01:00
7f863fb383 Prevens errors on FASTA formating an empty sequence 2022-11-16 09:22:55 +01:00
65210755cc Patch reading of annotation from stdin 2022-09-29 14:48:05 +02:00
fe5d8e3340 Patch a bug in ngsfilter reading and sample matching 2022-09-28 14:29:19 +02:00
ebefa28cc0 Patch memory error related to []byte pool 2022-09-28 14:28:37 +02:00
49b4d97803 Remove HTML escaping to json marshaling 2022-09-20 22:01:52 +02:00
989e678f6b refactor code and change algorithm used to read from many files 2022-08-23 15:07:06 +02:00