CWL run of Alignment Workflow (CWLProv 0.6.0 Research Object)

Main Author: Soiland-Reyes, Stian
Other Authors: Khan, Farah Zaib
Format: Dataset
Terbitan: Mendeley , 2018
Subjects:
Online Access: https:/data.mendeley.com/datasets/6wtpgr3kbj
ctrlnum 0.17632-6wtpgr3kbj.1
fullrecord <?xml version="1.0"?> <dc><creator>Soiland-Reyes, Stian</creator><title>CWL run of Alignment Workflow (CWLProv 0.6.0 Research Object)</title><publisher>Mendeley</publisher><description>The CWL alignment workflow included in this case study is designed by Data Biosphere. It adapts the alignment pipeline originally developed at Abecasis Lab, The University of Michigan. This workflow is part of NIH Data Commons initiative and comprises of four stages. First step, "Pre-align'' accepts a Compressed Alignment Map (CRAM) file (a compressed format for BAM files developed by European Bioinformatics Institute (EBI)) and human genome reference sequence as input and using underlying software utilities of SAMtools such as view, sort and fixmate returns a list of fastq files which can be used as input for the next step. The next step "Align'' also accepts the human reference genome as input along with the output files from "Pre-align'' and uses BWA-mem to generate aligned reads as BAM files. SAMBLASTER is used to mark duplicate reads and SAMtools view to convert read files from SAM to BAM format. The BAM files generated after "Align'' are sorted with "SAMtool sort''. Finally, these sorted alignment files are merged to produce single sorted BAM file using SAMtools merge in "Post-align'' step. This dataset folder is a CWLProv Research Object that captures the Common Workflow Language execution provenance, see https://w3id.org/cwl/prov/0.6.0 or use https://pypi.org/project/cwlprov/ to explore</description><subject>Bioinformatics</subject><subject>Workflow Management</subject><subject>Provenance</subject><contributor>Khan, Farah Zaib</contributor><type>Other:Dataset</type><identifier>10.17632/6wtpgr3kbj.1</identifier><rights>Creative Commons Attribution 4.0 International</rights><rights>http://creativecommons.org/licenses/by/4.0</rights><relation>https:/data.mendeley.com/datasets/6wtpgr3kbj</relation><date>2018-12-04T14:21:05Z</date><recordID>0.17632-6wtpgr3kbj.1</recordID></dc>
format Other:Dataset
Other
author Soiland-Reyes, Stian
author2 Khan, Farah Zaib
title CWL run of Alignment Workflow (CWLProv 0.6.0 Research Object)
publisher Mendeley
publishDate 2018
topic Bioinformatics
Workflow Management
Provenance
url https:/data.mendeley.com/datasets/6wtpgr3kbj
contents The CWL alignment workflow included in this case study is designed by Data Biosphere. It adapts the alignment pipeline originally developed at Abecasis Lab, The University of Michigan. This workflow is part of NIH Data Commons initiative and comprises of four stages. First step, "Pre-align'' accepts a Compressed Alignment Map (CRAM) file (a compressed format for BAM files developed by European Bioinformatics Institute (EBI)) and human genome reference sequence as input and using underlying software utilities of SAMtools such as view, sort and fixmate returns a list of fastq files which can be used as input for the next step. The next step "Align'' also accepts the human reference genome as input along with the output files from "Pre-align'' and uses BWA-mem to generate aligned reads as BAM files. SAMBLASTER is used to mark duplicate reads and SAMtools view to convert read files from SAM to BAM format. The BAM files generated after "Align'' are sorted with "SAMtool sort''. Finally, these sorted alignment files are merged to produce single sorted BAM file using SAMtools merge in "Post-align'' step. This dataset folder is a CWLProv Research Object that captures the Common Workflow Language execution provenance, see https://w3id.org/cwl/prov/0.6.0 or use https://pypi.org/project/cwlprov/ to explore
id IOS7969.0.17632-6wtpgr3kbj.1
institution Universitas Islam Indragiri
affiliation onesearch.perpusnas.go.id
institution_id 804
institution_type library:university
library
library Teknologi Pangan UNISI
library_id 2816
collection Artikel mulono
repository_id 7969
city INDRAGIRI HILIR
province RIAU
shared_to_ipusnas_str 1
repoId IOS7969
first_indexed 2020-04-08T08:29:35Z
last_indexed 2020-04-08T08:29:35Z
recordtype dc
_version_ 1686587752712241152
score 17.538404