BICF SRA Downloading Pipeline
Main Authors: | Jonathan R. Gesell, Jeremy A. Mathews, Venkat S. Malladi |
---|---|
Format: | info software |
Terbitan: |
, 2020
|
Subjects: | |
Online Access: |
https://zenodo.org/record/3739789 |
Daftar Isi:
- This pipeline is a simple method for downloading bulk SRA data, and renaming it, and having it ready to use in our other Astrocyte pipelines BICF at UT Southwestern Dept. of Bioinformatics. The pipeline uses Nextflow, a bioinformatics workflow tool, to download multiple files with NCBI's SRA Toolkit simultaneously, speeding up the process. Once the files have been downloaded, it runs FastQC to check the individual file quality metrics, and MultiQC on the entire data set, once the downloads have finished. This pipeline was designed to use a SLURM setup, as used by the BioHPC Cluster, however should be adaptable to run on nearly any system using and supported by Nextflow. Pipeline is maintained at Git repository.