Applying Apache Hadoop, Hive and Map Reduce to Legacy Systems and Applications

Jamal AlNasir

Research output: Contribution to conferenceOtherpeer-review

Abstract

The challenges posed to Computer Science from Bigdata generated from the Sciences can be tackled through the application of tools used in the emerging discipline of Datascience. My 1hr talk introduces Apache Hadoop and MapReduce and presents use cases of Apache Hive and Apache Hadoop for parallelising so-called "legacy" applications (i.e. those not especially developed for Hadoop using the MapReduce paradigm) on the Protein Databank. I also introduce Royal Holloways MSc in Datascience. The Innovation week programme has been organised through the collaboration of the British Council, MSTI (Mediterranean Space of Technology and Innovation) and ENSIAS (École nationale supérieure d'informatique et d'analyse des systèmes), Rabat, Morocco.
Original languageEnglish
Publication statusPublished - 28 May 2015
EventBig Data Analytics Training Workshop, MSTI ENSIAS - Rabat, Morocco
Duration: 25 May 201530 May 2015

Conference

ConferenceBig Data Analytics Training Workshop, MSTI ENSIAS
Country/TerritoryMorocco
CityRabat
Period25/05/1530/05/15

Keywords

  • Bigdata
  • Datascience
  • MapReduce
  • Structural Biology
  • Apache
  • Hadoop
  • Hive
  • PDB
  • Protein Databank

Cite this