-
Notifications
You must be signed in to change notification settings - Fork 3
/
Copy pathSnakefile
50 lines (40 loc) · 1.94 KB
/
Snakefile
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
"""
The EvoNAPS workflow
--------------------
This Snakemake workflow was designed to be applied to a target alignment the user whishes to import to
the EvoNAPS database. Note, that this workflow assumes that the EvoNAPS database already exists on your
system.
Other than the MySQL database, the workflow includes all neccessary scripts, software, and *.yaml
files to run.
- Third-party software can be found in workflow/bin.
- Python scipts can be found in workflow/scipts.
- The instructions to create the neccessary conda environments (yaml files) can be found in workflow/envs.
- The Snakemake rules for this workflow are stored in workflow/rules.
Setup
-----
Please refer to the README.md file for instructions on how to set up the EvoNAPS database.
Usage
-----
The Snakemake workflow is applied onto an alignment as such:
>> snakemake PATH/TO/alignment.fasta_summary.txt --config seq_type=DNA --use-conda
Please specify the target file "PATH/TO/alignment.fasta_summary.txt" if you wish
to import the alignment into the EvoNAPS database directly.
Use "PATH/TO/alignment.fasta_ali_parameters.tsv" if you only wish to run IQTree2
and parse out all alignment and tree information.
Use "PATH/TO/alignment.fasta.pythia" if you only wish to calculate the Pythia
difficulty score (source: https://github.com/tschuelia/PyPythia).
Note, that the sequence type needs to be set manually. Declare the
sequence type with --config seq_type=[ DNA | AA ] from the command line.
Also note, that the flag --use-conda is neccessary because different environments
are used for different rules, which requires access to conda.
"""
include: "workflow/rules/generate_data.smk"
include: "workflow/rules/import_to_db.smk"
rule all:
input:
"{ali_id}_ali_parameters.tsv",
"{ali_id}_seq_parameters.tsv",
"{ali_id}_model_parameters.tsv",
"{ali_id}_branch_parameters.tsv",
"{ali_id}_tree_parameters.tsv",
"{ali_id}_summary.txt"