Skip to content

Commit

Permalink
Merge branch 'dev' into nf-test-conversion
Browse files Browse the repository at this point in the history
  • Loading branch information
jfy133 authored Sep 20, 2024
2 parents 9ce930f + a32a29e commit 5598eec
Show file tree
Hide file tree
Showing 67 changed files with 3,739 additions and 303 deletions.
1 change: 0 additions & 1 deletion .github/workflows/ci.yml
Original file line number Diff line number Diff line change
Expand Up @@ -47,7 +47,6 @@ jobs:
- "test"
profile:
- "docker"

steps:
- name: Check out pipeline code
uses: actions/checkout@0ad4b8fadaa221de15dcec353f45205ec38ea70b # v4
Expand Down
50 changes: 25 additions & 25 deletions CITATION.cff
Original file line number Diff line number Diff line change
Expand Up @@ -25,28 +25,28 @@ doi: 10.7717/peerj.10947
date-released: 2022-08-02
url: https://github.com/nf-core/eager
prefered-citation:
type: article
authors:
- family-names: Fellows Yates
given-names: James A.
- family-names: Lamnidis
given-names: Thiseas C.
- family-names: Borry
given-names: Maxime
- family-names: Andrades Valtueña
given-names: Aida
- family-names: Fagernãs
given-names: Zandra
- family-names: Clayton
given-names: Stephen
- family-names: Garcia
given-names: Maxime U.
- family-names: Neukamm
given-names: Judith
- family-names: Peltzer
given-names: Alexander
doi: 10.7717/peerj.10947
start: e10947
title: "Reproducible, portable, and efficient ancient genome reconstruction with nf-core/eager"
year: 2021
url: https://dx.doi.org/10.1038/10.7717/peerj.10947
type: article
authors:
- family-names: Fellows Yates
given-names: James A.
- family-names: Lamnidis
given-names: Thiseas C.
- family-names: Borry
given-names: Maxime
- family-names: Andrades Valtueña
given-names: Aida
- family-names: Fagernãs
given-names: Zandra
- family-names: Clayton
given-names: Stephen
- family-names: Garcia
given-names: Maxime U.
- family-names: Neukamm
given-names: Judith
- family-names: Peltzer
given-names: Alexander
doi: 10.7717/peerj.10947
start: e10947
title: "Reproducible, portable, and efficient ancient genome reconstruction with nf-core/eager"
year: 2021
url: https://dx.doi.org/10.1038/10.7717/peerj.10947
50 changes: 41 additions & 9 deletions CITATIONS.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,11 +2,11 @@

## [nf-core](https://pubmed.ncbi.nlm.nih.gov/32055031/)

> Ewels PA, Peltzer A, Fillinger S, Patel H, Alneberg J, Wilm A, Garcia MU, Di Tommaso P, Nahnsen S. The nf-core framework for community-curated bioinformatics pipelines. Nat Biotechnol. 2020 Mar;38(3):276-278. doi: 10.1038/s41587-020-0439-x. PubMed PMID: 32055031.
> Ewels PA, Peltzer A, Fillinger S, Patel H, Alneberg J, Wilm A, Garcia MU, Di Tommaso P, Nahnsen S. The nf-core framework for community-curated bioinformatics pipelines. Nat Biotechnol. 2020 Mar;38(3):276-278. doi: [10.1038/s41587-020-0439-x](https://doi.org/10.1038/s41587-020-0439-x). PubMed PMID: 32055031.
## [Nextflow](https://pubmed.ncbi.nlm.nih.gov/28398311/)

> Di Tommaso P, Chatzou M, Floden EW, Barja PP, Palumbo E, Notredame C. Nextflow enables reproducible computational workflows. Nat Biotechnol. 2017 Apr 11;35(4):316-319. doi: 10.1038/nbt.3820. PubMed PMID: 28398311.
> Di Tommaso P, Chatzou M, Floden EW, Barja PP, Palumbo E, Notredame C. Nextflow enables reproducible computational workflows. Nat Biotechnol. 2017 Apr 11;35(4):316-319. doi: [10.1038/nbt.3820](https://doi.org/10.1038/nbt.3820). PubMed PMID: 28398311.
## Pipeline tools

Expand All @@ -16,11 +16,11 @@
- [MultiQC](https://pubmed.ncbi.nlm.nih.gov/27312411/)

> Ewels P, Magnusson M, Lundin S, Käller M. MultiQC: summarize analysis results for multiple tools and samples in a single report. Bioinformatics. 2016 Oct 1;32(19):3047-8. doi: 10.1093/bioinformatics/btw354. Epub 2016 Jun 16. PubMed PMID: 27312411; PubMed Central PMCID: PMC5039924.
> Ewels P, Magnusson M, Lundin S, Käller M. MultiQC: summarize analysis results for multiple tools and samples in a single report. Bioinformatics. 2016 Oct 1;32(19):3047-8. doi: [10.1093/bioinformatics/btw354](10.1093/bioinformatics/btw354). Epub 2016 Jun 16. PubMed PMID: 27312411; PubMed Central PMCID: PMC5039924.
- [Falco](https://doi.org/10.12688%2Ff1000research.21142.2)

> de Sena Brandine, G., Smith, A.D. (2019) Falco: high-speed FastQC emulation for quality control of sequencing data. F1000Res., 8, 1874. doi: [10.12688%2Ff1000research.21142.2](https://doi.org/10.12688%2Ff1000research.21142.2)
> de Sena Brandine, G., Smith, A.D. (2019). Falco: high-speed FastQC emulation for quality control of sequencing data. F1000Res., 8, 1874. doi: [10.12688%2Ff1000research.21142.2](https://doi.org/10.12688%2Ff1000research.21142.2)
- [fastp](https://doi.org/10.1093/bioinformatics/bty560)

Expand All @@ -32,7 +32,7 @@
- [Picard Tools](https://broadinstitute.github.io/picard/)

> Broad Institute (2019). Picard Toolkit. GitHub Repository: https://broadinstitute.github.io/picard/
> Broad Institute (2019). Picard Toolkit. GitHub Repository: [https://broadinstitute.github.io/picard/](https://broadinstitute.github.io/picard/)
- [SeqKit](https://bioinf.shenwei.me/seqkit/)

Expand Down Expand Up @@ -126,6 +126,38 @@

> Sex.DetERRmine.py Lamnidis, T.C. et al., 2018. Ancient Fennoscandian genomes reveal origin and spread of Siberian ancestry in Europe. Nature communications, 9(1), p.5018. Available at: http://dx.doi.org/10.1038/s41467-018-07483-5. Download: https://github.com/TCLamnidis/Sex.DetERRmine
- [MALT](https://www.nature.com/articles/s41559-017-0446-6)

> Vågene, Å.J., Herbig, A., Campana, M.G., Nelly, M., García, R., Warinner, C., Sabin, S., Spyrou, M.A., Valtueña, A.A., Huson, D., Tuross, N., Bos, K.I. & Krause, J. (2018). Salmonella enterica genomes from victims of a major sixteenth-century epidemic in Mexico. Nat Ecol Evol 2, 520–528. doi: [10.1038/s41559-017-0446-6](https://doi.org/10.1038/s41559-017-0446-6)
- [HOPS](https://doi.org/10.1186/s13059-019-1903-0)

> Hübler, R., Key, F.M., Warinner, C. et al. (2019). HOPS: automated detection and authentication of pathogen DNA in archaeological remains. Genome Biol 20, 280. doi: [10.1186/s13059-019-1903-0](https://doi.org/10.1186/s13059-019-1903-0)
- [MEGAN](https://doi.org/10.1371/journal.pcbi.1004957)

> Huson DH, Beier S, Flade I, Górska A, El-Hadidi M, Mitra S, et al. (2016) MEGAN Community Edition - Interactive Exploration and Analysis of Large-Scale Microbiome Sequencing Data. PLoS Comput Biol 12(6): e1004957. doi: [10.1371/journal.pcbi.1004957] https://doi.org/10.1371/journal.pcbi.1004957
- [Kraken2](https://doi.org/10.1186/s13059-019-1891-0)

> Wood, Derrick E., Jennifer Lu, and Ben Langmead. 2019. Improved Metagenomic Analysis with Kraken 2. Genome Biology 20 (1): 257. doi: [10.1186/s13059-019-1891-0](https://doi.org/10.1186/s13059-019-1891-0).
- [KrakenUniq](https://doi.org/10.1186/s13059-018-1568-0)

> Breitwieser, Florian P., Daniel N. Baker, and Steven L. Salzberg. 2018. KrakenUniq: confident and fast metagenomics classification using unique k-mer counts. Genome Biology 19 (1): 198. doi: [10.1186/s13059-018-1568-0](https://doi.org/10.1186/s13059-018-1568-0)
- [MetaPhlAn](https://doi.org/10.1038/s41587-023-01688-w)

> Blanco-Míguez, A., Beghini, F., Cumbo, F. et al. Extending and improving metagenomic taxonomic profiling with uncharacterized species using MetaPhlAn 4. Nat Biotechnol (2023). doi: [10.1038/s41587-023-01688-w](https://doi.org/10.1038/s41587-023-01688-w)
- [TAXPASTA](https://doi.org/10.21105/joss.05627)

> Beber et al., (2023). TAXPASTA: TAXonomic Profile Aggregation and STAndardisation. Journal of Open Source Software, 8(87), 5627, doi: [10.21105/joss.05627](https://doi.org/10.21105/joss.05627)
- [CircularMapper](https://doi.org/10.1186/s13059-016-0918-z)

> Peltzer, A., Jäger, G., Herbig, A., Seitz, A., Kniep, C., Krause, J., & Nieselt, K. (2016). EAGER: efficient ancient genome reconstruction. Genome Biology, 17(1), 1–14. doi: [10.1186/s13059-016-0918-z](https://doi.org/10.1186/s13059-016-0918-z)
## Software packaging/containerisation tools

- [Anaconda](https://anaconda.com)
Expand All @@ -134,16 +166,16 @@
- [Bioconda](https://pubmed.ncbi.nlm.nih.gov/29967506/)

> Grüning B, Dale R, Sjödin A, Chapman BA, Rowe J, Tomkins-Tinch CH, Valieris R, Köster J; Bioconda Team. Bioconda: sustainable and comprehensive software distribution for the life sciences. Nat Methods. 2018 Jul;15(7):475-476. doi: 10.1038/s41592-018-0046-7. PubMed PMID: 29967506.
> Grüning B, Dale R, Sjödin A, Chapman BA, Rowe J, Tomkins-Tinch CH, Valieris R, Köster J; Bioconda Team. Bioconda: sustainable and comprehensive software distribution for the life sciences. Nat Methods. 2018 Jul;15(7):475-476. doi: [10.1038/s41592-018-0046-7](https://doi.org/10.1038/s41592-018-0046-7). PubMed PMID: 29967506.
- [BioContainers](https://pubmed.ncbi.nlm.nih.gov/28379341/)

> da Veiga Leprevost F, Grüning B, Aflitos SA, Röst HL, Uszkoreit J, Barsnes H, Vaudel M, Moreno P, Gatto L, Weber J, Bai M, Jimenez RC, Sachsenberg T, Pfeuffer J, Alvarez RV, Griss J, Nesvizhskii AI, Perez-Riverol Y. BioContainers: an open-source and community-driven framework for software standardization. Bioinformatics. 2017 Aug 15;33(16):2580-2582. doi: 10.1093/bioinformatics/btx192. PubMed PMID: 28379341; PubMed Central PMCID: PMC5870671.
> da Veiga Leprevost F, Grüning B, Aflitos SA, Röst HL, Uszkoreit J, Barsnes H, Vaudel M, Moreno P, Gatto L, Weber J, Bai M, Jimenez RC, Sachsenberg T, Pfeuffer J, Alvarez RV, Griss J, Nesvizhskii AI, Perez-Riverol Y. BioContainers: an open-source and community-driven framework for software standardization. Bioinformatics. 2017 Aug 15;33(16):2580-2582. doi: [10.1093/bioinformatics/btx192](https://doi.org/10.1093/bioinformatics/btx192). PubMed PMID: 28379341; PubMed Central PMCID: PMC5870671.
- [Docker](https://dl.acm.org/doi/10.5555/2600239.2600241)

> Merkel, D. (2014). Docker: lightweight linux containers for consistent development and deployment. Linux Journal, 2014(239), 2. doi: 10.5555/2600239.2600241.
> Merkel, D. (2014). Docker: lightweight linux containers for consistent development and deployment. Linux Journal, 2014(239), 2. doi: [10.5555/2600239.2600241](https://doi.org/10.5555/2600239.2600241).
- [Singularity](https://pubmed.ncbi.nlm.nih.gov/28494014/)

> Kurtzer GM, Sochat V, Bauer MW. Singularity: Scientific containers for mobility of compute. PLoS One. 2017 May 11;12(5):e0177459. doi: 10.1371/journal.pone.0177459. eCollection 2017. PubMed PMID: 28494014; PubMed Central PMCID: PMC5426675.
> Kurtzer GM, Sochat V, Bauer MW. Singularity: Scientific containers for mobility of compute. PLoS One. 2017 May 11;12(5):e0177459. doi: [10.1371/journal.pone.0177459](https://doi.org/10.1371/journal.pone.0177459). eCollection 2017. PubMed PMID: 28494014; PubMed Central PMCID: PMC5426675.
4 changes: 2 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -75,8 +75,8 @@ Additional functionality contained by the pipeline currently includes:
#### Metagenomic Screening

- Low-sequenced complexity filtering (`BBduk` or `PRINSEQ++`)
- Taxonomic binner with alignment (`MALT`)
- Taxonomic binner without alignment (`Kraken2`)
- Taxonomic binner with alignment (`MALT` or `MetaPhlAn 4`)
- Taxonomic binner without alignment (`Kraken2`,`KrakenUniq`)
- aDNA characteristic screening of taxonomically binned data from MALT (`MaltExtract`)

#### Functionality Overview
Expand Down
16 changes: 15 additions & 1 deletion assets/schema_fasta.json
Original file line number Diff line number Diff line change
Expand Up @@ -46,7 +46,21 @@
"circular_target": {
"type": "string",
"pattern": "^\\S+$",
"errorMessage": "The headers of the chromosome to be extended by circularmapper must not contain any spaces and no leading '>'."
"errorMessage": "The headers of the chromosome extended by circulargenerator must not contain any spaces and no leading '>'."
},
"circularmapper_elongatedfasta": {
"type": "string",
"format": "file-path",
"pattern": "^\\S+\\.f(na|asta|a|as)(\\.gz)?$",
"exists": true,
"errorMessage": "The elongated Fasta files for the mapping reference must be provided with file extensions '.fasta', '.fa', '.fas', '.fna', '.fasta.gz','.fa.gz','.fas.gz', '.fna.gz' and cannot contain any spaces."
},
"circularmapper_elongatedindex": {
"type": "string",
"format": "directory-path",
"pattern": "^\\S+$",
"exists": true,
"errorMessage": "The directories of the index files for the elongated mapping reference for circularmapper must not contain any spaces and have file extensions ''."
},
"mitochondrion_header": {
"type": "string",
Expand Down
Loading

0 comments on commit 5598eec

Please sign in to comment.