Abstract
We present a framework that facilitates parallel execution of protein structure analysis tools to be carried out on the entire (or subsets of) the Protein Databank (PDB) using the Apache Hadoop platform. Our design enables structural Biologists to use the Hadoop platform without having to explicitly write Map-Reduce code. It is easily scalable and uses a mapper architecture that functions on a stand- alone basis or can be extended to include further Map-Reduce operations.
Original language | English |
---|---|
Publication status | Published - Jul 2015 |
Event | 3DSig Structural Bioinformatics and Computational Biophysics 2015 - Dublin, Ireland Duration: 10 Jul 2015 → 11 Jul 2015 |
Conference
Conference | 3DSig Structural Bioinformatics and Computational Biophysics 2015 |
---|---|
Country/Territory | Ireland |
City | Dublin |
Period | 10/07/15 → 11/07/15 |