CSci 4409. Programming for Parallel Architecture: Resources
[Home]
[Syllabus]
[Assignments]
[Resources]
[Wiki]
In-class examples
Articles and other reading resources
Required reading is marked in light green.
Programming resources and software downloads
Clojure resources
Hadoop resources
- Description
of Hadoop
- Oct
14: MapReduce
and HDFS lecture by Aaron Kimball, the first engineer
at Cloudera
- Oct
21: Programming
with Hadoop by Aaron Kimball (key Hadoop terminology and
approaches) Terminology you need to know: job, task,
JobTracker, TaskTracker, NameNode, mapper, reducer, InputSplit,
RecordReader, RecordWriter, Partitioner. You also need
to know main Java interfaces used by Hadoop. Also, you need to
know what Hadoop streaming refers to.
- A wikipedia article
on Hadoop: quite detailed and helpful.
- clojure-hadoop
package with downloadable examples (the classical Hadoop
wordcount example done in Clojure).
- Hadoop + Clojure lecture
by Stuart Sierra (work done with Tim Dysinger)
- November 9:
Hadoop
Training: MapReduce Algorithms by
Aaron Kimball.
- Apache Hadoop tutorial (Java), including two versions of the wordcount example.
Erlang resources