Apache Hadoop statistics

Tool stats & trends

Looking to identify usage trends or leading experts?


Apache Hadoop specifications


Unique identifier OMICS_01210
Name Apache Hadoop
Software type Framework/Library
Interface Command line interface
Restrictions to use None
Operating system Unix/Linux
Parallelization MapReduce
Computer skills Advanced
Stability Stable
High performance computing Yes
Maintained Yes
Wikipedia https://en.wikipedia.org/wiki/Apache_Hadoop


No version available

Apache Hadoop citations


Hadoop Oriented Smart Cities Architecture

PMCID: 5948833
PMID: 29649172
DOI: 10.3390/s18041181

[…] ator, when scalability is salient or when controlling not merely Hadoop jobs but a whole data center is necessary. There are incubating projects, e.g., Apache Myriad that “enables the co-existence of Apache Hadoop and Apache Mesos on the same physical infrastructure. By running Hadoop YARN as a Mesos framework, YARN applications and Mesos frameworks can run side-by-side, dynamically sharing cluste […]


Adverse Drug Event Discovery Using Biomedical Literature: A Big Data Neural Network Adventure

PMCID: 5741828
PMID: 29222076
DOI: 10.2196/medinform.9170

[…] enced repository of scientific articles, and the social media blog posts existing in MedHelp, Patient, and WebMD. The model was developed using the word2vec neural network architectural on top of the Apache Hadoop cluster, Apache Spark, and Elasticsearch No-SQL distributed database to tackle efficient big data ADE identification. We accomplished extensive experimental validations to ensure that th […]


A parallel approximate string matching under Levenshtein distance on graphics processing units using warp shuffle operations

PLoS One
PMCID: 5634649
PMID: 29016700
DOI: 10.1371/journal.pone.0186251

[…] he works of paper [, ] presented several approaches for checking the similarity between the large-scale data of RNA/DNA sequences. The proposed approaches were developed based on MapReduce model with Apache Hadoop platform []. These approaches have archived significant performance and scalability. However, there was a problem of combining GPUs and Hadoop platform. In this case, GPUs should be driv […]


MMTF—An efficient file format for the transmission, visualization, and analysis of macromolecular structures

PLoS Comput Biol
PMCID: 5473584
PMID: 28574982
DOI: 10.1371/journal.pcbi.1005575

[…] dual files is inefficient, a single Hadoop Sequence file (https://wiki.apache.org/hadoop/SequenceFile) is provided. These files can be efficiently processed in parallel by Big Data frameworks such as Apache Hadoop (http://hadoop.apache.org/) or Apache Spark (http://spark.apache.org/).The preferred access to MMTF data is via the provided decoder APIs, which are available through open source GitHub […]


GENE IS: Time Efficient and Accurate Analysis of Viral Integration Events in Large Scale Gene Therapy Data

Mol Ther Nucleic Acids
PMCID: 5363413
PMID: 28325279
DOI: 10.1016/j.omtn.2016.12.001

[…] ference or vector genome, which is a crucial factor for flexible investigation. We also consider here that VISPA was not tested in the present study because it requires the installation of Galaxy and Apache Hadoop as the data processing framework. In our opinion, the hardware and software requirements needed to install and use VISPA are a substantial limitation and far beyond the possibilities of […]


A Scalable Data Access Layer to Manage Structured Heterogeneous Biomedical Data

PLoS One
PMCID: 5148592
PMID: 27936191
DOI: 10.1371/journal.pone.0168004

[…] lies the use of a big data search and query platform, therefore designed to be scalable. For our baseline solution, we chose the latter approach, which is the safer and simpler of the two. We adopted Apache Hadoop MapReduce, one of the most well-known big data frameworks for writing scalable applications. This framework has already been applied to processing big data in the medical sector [–]. The […]


Looking to check out a full list of citations?

Apache Hadoop reviews

star_border star_border star_border star_border star_border
star star star star star

Be the first to review Apache Hadoop