Skip to content

An R script to run the pathfindR enrichment workflow

License

Notifications You must be signed in to change notification settings

sing-group/run-pathfindR

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Run pathfindR

The run-pathfindR.R script works in pathfindR v2.4.2 and implements the most common pathfindR workflow using these four input files:

  1. DEA results: path to the input file with the DEA results. It must be a CSV file with at least these four columns: gtf_gene_id, gene_name, logFC, and padj.
  2. Counts file: path to the CSV file with the counts for the samples (rows are genes and columns are samples; genes are specified as Ensembl IDs).
  3. Metadata file: path to the TSV file with the samples metadata.
  4. Reference file: path to the file with the reference condition (only one line with the condition name).

The script also takes three additional parameters:

  1. output directory: path to the directory where results should be stored.
  2. gene sets: the gene sets to be used for enrichment analysis, one of: KEGG, Reactome, BioCarta, GO-All, GO-BP, GO-CC and GO-MF (all for Homo sapiens)
  3. pin: the protein interaction network to be used for enrichment analysis, one of: Biogrid, STRING, GeneMania, IntAct, KEGG, and mmu_STRING

Motivation

This script was created in the context of a Compi pipeline for RNA-Seq data analysis. In this pipeline, DElite was used for performing DEA and so this is the input CSV file to the script. This file contains a column with name "gtf_gene_id" and was postprocessed to add gene names. These two fields are used to convert the counts file (that contains the Gene IDs from the GTF file used previously) into gene names before using it in pathfindR. Hopefully this script can be reused for similar purposes!

Test

The example files are provided in the test folder. They can be used to run the script using the pegi3s/r_pathfindr:2.4.2 Docker image from the Bioinformatics Docker Images Project:

docker run --rm -v $(pwd):$(pwd) -w $(pwd) \
    pegi3s/r_pathfindr:2.4.2 \
        Rscript run-pathfindR.R \
            test/test.csv \
            test/counts.tsv \
            test/metadata.tsv \
            test/reference.txt \
            test/results \
            KEGG \
            STRING

About

An R script to run the pathfindR enrichment workflow

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages