nf-core/differentialabundance
Differential abundance analysis for feature/ observation matrices from platforms such as RNA-seq
1.0.0
). The latest
stable release is
1.5.0
.
Define where the pipeline should find input data and save output data.
Path to comma-separated file containing information about the samples in the experiment.
string
^\S+\.(csv|tsv|txt)$
A CSV file describing sample contrasts
string
^\S+\.csv$
TSV-format abundance matrix
string
^\S+\.(tsv|csv|txt)$
The output directory where the results will be saved. You have to use absolute paths to storage on Cloud infrastructure.
string
A string to identify results in the output directory
string
study
A string identifying the technology used to produce the data
string
Email address for completion summary.
string
^([a-zA-Z0-9_\-\.]+)@([a-zA-Z0-9_\-\.]+)\.([a-zA-Z]{2,5})$
A text file listing technical features (e.g. spikes)
string
When set, use the control features in scaling/ normalisation
boolean
Rmd report template from which to create the pipeline report
string
^\S+\.Rmd$
A logo to display in the report instead of the generic pipeline logo
string
docs/images/nf-core-differentialabundance_logo_light.png
CSS to use to style the output, in lieu of the default nf-core styling
string
assets/nf-core_style.css
A markdown file containing citations to include in the fiinal report
string
CITATIONS.md
Column in the samples sheet to be used as the primary sample identifier
string
sample
Type of observation
string
sample
Options related to features
Feature ID attribute in the GTF file (e.g. the gene_id field)
string
gene_id
Feature name attribute in the GTF file (e.g. the gene symbol field)
string
gene_name
Type of feature we have, often ‘gene’
string
gene
Options related to filtering upstream of differential analysis
Minimum abundance value
integer
1
Minimum observations that must pass the threshold to retain the row/ feature (e.g. gene).
number
1
A minimum proportion of observations, given as a number between 0 and 1, that must pass the threshold. Overrides minimum_samples
number
An optional grouping variable to be used to calculate a min_samples value
string
Options related to data exploration
Clustering method used in dendrogram creation
string
ward.D2
Correlation method used in dendrogram creation
string
spearman
Number of features selected before certain exploratory analyses
integer
500
Length of the whiskers in boxplots as multiple of IQR. Defaults to 1.5.
number
1.5
Threshold on MAD score for outlier identification
integer
-5
How should the main grouping variable be selected? ‘auto_pca’, ‘contrasts’, or a valid column name from the observations table.
string
auto_pca
Options related to differential operations
The suffix associated tabular differential results tables
string
.deseq2.results.tsv
The feature identifier column in differential results tables
string
gene_id
The fold change column in differential results tables
string
log2FoldChange
The p value column in differential results tables
string
pvalue
The q value column in differential results tables.
string
padj
Minimum fold change used to calculate differential feature numbers
integer
2
Maximum q value used to calculate differential feature numbrers
number
0.05
Where a features file (GTF) has been provided, what attributed to use to name features
string
gene_name
Indicate whether or not fold changes are on the log scale (default is to assume they are)
boolean
true
test
parameter passed to DESeq()
string
fitType
parameter passed to DESeq()
string
sfType
parameter passed to DESeq()
string
‘minReplicatesForReplace’ parameter passed to DESeq()
integer
7
useT
parameter passed to DESeq2
boolean
independentFiltering
parameter passed to results()
boolean
true
lfcThreshold
parameter passed to results()
integer
altHypothesis
parameter passed to results()
string
greaterAbs
pAdjustMethod
parameter passed to results()
string
BH
alpha
parameter passed to results()
number
0.1
minmu
parameter passed to results()
number
0.5
variance stabilisation method to use when making a variance stabilised matrix
string
Shink fold changes in results?
boolean
true
Number of cores
integer
1
blind
parameter for rlog() and/ or vst()
boolean
true
nsub
parameter passed to vst()
integer
1000
Set to run GSEA to infer differential gene sets in contrasts
boolean
Permutation type
string
Number of permutations
integer
1000
Enrichment statistic
string
Metric for ranking genes
string
Gene list sorting mode
string
Gene list ordering mode
string
Max size: exclude larger sets
integer
500
Min size: exclude smaller sets
integer
15
Normalisation mode
string
Randomization mode
string
Make detailed geneset report?
boolean
true
Use median for class metrics
boolean
Number of markers
integer
100
Plot graphs for the top sets of each phenotype
integer
20
Seed for permutation
string
timestamp
Save random ranked lists
boolean
Make a zipped file with all reports
boolean
Gene sets in GMT or GMX-format
string
None
Reference genome related files and options required for the workflow.
Name of iGenomes reference.
string
Genome annotation file in GTF format
string
^\S+\.gtf(\.gz)?
Directory / URL base for iGenomes references.
string
s3://ngi-igenomes/igenomes
Do not load the iGenomes reference config.
boolean
Parameters used to describe centralised config profiles. These should not be edited.
Git commit id for Institutional configs.
string
master
Base directory for Institutional configs.
string
https://raw.githubusercontent.com/nf-core/configs/master
Institutional config name.
string
Institutional config description.
string
Institutional config contact information.
string
Institutional config URL link.
string
Set the top limit for requested resources for any single job.
Maximum number of CPUs that can be requested for any single job.
integer
16
Maximum amount of memory that can be requested for any single job.
string
128.GB
^\d+(\.\d+)?\.?\s*(K|M|G|T)?B$
Maximum amount of time that can be requested for any single job.
string
240.h
^(\d+\.?\s*(s|m|h|day)\s*)+$
Less common options for the pipeline, typically set in a config file.
Display help text.
boolean
Display version and exit.
boolean
Method used to save pipeline results to output directory.
string
Email address for completion summary, only when pipeline fails.
string
^([a-zA-Z0-9_\-\.]+)@([a-zA-Z0-9_\-\.]+)\.([a-zA-Z]{2,5})$
Send plain-text email instead of HTML.
boolean
Do not use coloured log outputs.
boolean
Incoming hook URL for messaging service
string
Directory to keep pipeline Nextflow logs and reports.
string
${params.outdir}/pipeline_info
Boolean whether to validate parameters against the schema at runtime
boolean
true
Show all params when using --help
boolean