SeqPig statistics

info info

Citations per year

Number of citations per year for the bioinformatics software tool SeqPig

Tool usage distribution map

info info

Associated diseases


Popular tool citations

chevron_left PaaS chevron_right
Want to access the full stats & trends on this tool?

SeqPig specifications


Unique identifier OMICS_01226
Name SeqPig
Software type Framework/Library
Interface Command line interface
Restrictions to use None
Operating system Unix/Linux
Parallelization MapReduce
License MIT License
Computer skills Advanced
Stability Stable
High performance computing Yes
Maintained Yes


No version available

Publication for SeqPig

SeqPig citations


Big Data Application in Biomedical Research and Health Care: A Literature Review

PMCID: 4720168
PMID: 26843812
DOI: 10.4137/BII.S31559

[…] rs embrace big data technology, novel methods are needed to integrate existing big data technologies with user-friendly operations. The following systems have been developed to help achieve this goal.SeqPig reduces the need for bioinformaticians to obtain the technological skills needed to use MapReduce. The SeqPig project extends the Apache Pig scripts to provide feature-rich sequence processing […]


Experiences with workflows for automating data intensive bioinformatics

Biol Direct
PMCID: 4539931
PMID: 26282399
DOI: 10.1186/s13062-015-0071-8

[…] ssing workload with frameworks such as Hadoop and Spark. To this end, we have participated in development of tools that allow bioinformatics data to be efficiently processed in Hadoop: Hadoop-BAM and SeqPig [, ]. This work is continued by integrating Hadoop and Spark into our IaaS environment and providing easy to use interfaces for data intensive computing. […]


A quantitative assessment of the Hadoop framework for analyzing massively parallel DNA sequencing data

PMCID: 4455317
PMID: 26045962
DOI: 10.1186/s13742-015-0058-5

[…] rough a generic adaptor like Hadoop Streaming library) regular Linux applications cannot generally be used directly on it. Two interesting frameworks to simplify scripting in Hadoop are BioPig [] and SeqPig [], which are both built on top of the Apache Pig framework and are capable of automatically distributing data and parallelizing tasks.At the moment, however, Hadoop is incompatible with conven […]

Want to access the full list of citations?
SeqPig institution(s)
Aalto University School of Science and Helsinki Institute for Information Technology HIIT, Finland; International Computer Science Institute, Berkeley, CA, USA; CRS4-Center for Advanced Studies, Research and Development in Sardinia, Italy; CSC-IT Center for Science, Finland

SeqPig reviews

star_border star_border star_border star_border star_border
star star star star star

Be the first to review SeqPig