Here is a list of documents I found really useful when I first started working with Hadoop : Below are the two papers from Google on MapReduce paradigm and GFS
- Map Reduce: http://static.googleusercontent.com/external_content/untrusted_dlcp/labs.google.com/en/us/papers/mapreduce-osdi04.pdf
- Google File System: http://static.googleusercontent.com/external_content/untrusted_dlcp/labs.google.com/en/us/papers/gfs-sosp2003.pdf
Tips for improving Map-reduce:
General tutorials: