We present a framework that facilitates parallel execution of protein structure analysis tools to be carried out on the entire (or subsets of) the Protein Databank (PDB) using the Apache Hadoop platform. Our design enables structural Biologists to use the Hadoop platform without having to explicitly write Map-Reduce code. It is easily scalable and uses a mapper architecture that functions on a stand- alone basis or can be extended to include further Map-Reduce operations.
|Publication status||Published - Jul 2015|
|Event||3DSig Structural Bioinformatics and Computational Biophysics 2015 - Dublin, Ireland|
Duration: 10 Jul 2015 → 11 Jul 2015
|Conference||3DSig Structural Bioinformatics and Computational Biophysics 2015|
|Period||10/07/15 → 11/07/15|