Scientists and mathematicians have long loved Python as a vehicle for working with data and automation. Python has not lacked for libraries such as Hadoopy or Pydoop to work with Hadoop, but those ...
This repo contains all my work for this course. This includes the Python, Scala or SQL code for the programming tasks, scripts for setup and running various jobs both locally and on a Hadoop cluster ...