Archive for the 'Course MapReduce' Category

MapReduce grades and evaluation

Friday, February 17th, 2012, posted by Djoerd Hiemstra

The MapReduce, Pig Latin and Cloud Computing assignments are graded. The final grades can be found in Blackboard’s grade center. Please join the course evaluation session on 21 February in hal B 2C from 12.30 - 13.30 hour (including a free lunch).

Keynote lecture by Jimmy Lin at Big Data tutorial

Tuesday, October 18th, 2011, posted by Djoerd Hiemstra

Jimmy Lin will give a keynote lecture at the SIKS/BigGrid Big Data tutorial that preceeds the DBDBD on 30 November and 1 December 2011. Dr. Lin, who holds a PhD from MIT, is associate professor in the iSchool at the University of Maryland. He also has appointments in the Institute for Advanced Computer Studies (UMIACS) and the Department of Computer Science at Maryland. Lin works at the intersection of natural language processing (NLP) and information retrieval (IR), with a recent emphasis on scalable algorithm design and large-data issues. He directs the recently-formed Cloud Computing Center, an interdisciplinary group which explores the many aspects of cloud computing as it impacts technology, people, and society. He is also a member of both the Computational Linguistics and Information Processing Lab (CLIP) and the Human-Computer Interaction Lab (HCIL). Lin worked on Cloudera, which aims to bring Hadoop MapReduce to the enterprise, and is currently spending a sabbatical at Twitter

See: DBDBD Big Data Tutorial

Solutions on Blackboard

Tuesday, January 4th, 2011, posted by Djoerd Hiemstra

Solutions for Assignment 4 (Sawzall) and for Assignment 5 (HBase Schema) are now on Blackboard.

Solutions to Assignment 3

Monday, December 6th, 2010, posted by Djoerd Hiemstra

The solutions to Assignment 3 are now on-line in the Course Material Section on Blackboard. You need the solutions for Assignment 4, deadline next Friday, 10 December.

Small Haskell wrap-up meeting

Friday, December 3rd, 2010, posted by Djoerd Hiemstra

Next Monday, 6 December at 14.30 - 15.15h. in ZI-3126, there is a short meeting to discuss the solutions for Assignment 2 and 3. The solutions, which are helpful for Assignment 4, will also be put on Blackboard.
Next Tuesday, 7 December: the Hadoop Hackathon!

Solution for Assignment 2

Monday, November 29th, 2010, posted by Djoerd Hiemstra

The grades for Assignment 2 are now on Blackboard’s Grade Center. A correct solution for Assignment 2, which is needed for Assignment 3, can be found under “Course Materials” on Blackboard.

Grades for Assignment 1 on Blackboard Grade Center

Tuesday, November 23rd, 2010, posted by Djoerd Hiemstra

The grades for Assignment 1 are now on Blackboard’s Grade Center. Please, send me an email as soon as possible, if you cannot find your grades, if you cannot find an explanation of your grade (including a per question result), or if you did not submit solutions at all for Assignment 1, but still want to participate in the course. Deadline for Assignment 2 is next Friday, 26 November.

Guest lecture by Peter Dickman from Google

Tuesday, November 16th, 2010, posted by Djoerd Hiemstra

Friday 26 November, Peter Dickman from Google will talk about Google’s infrastructure. The lecture will start at 10:30 h. (so 15 minutes earlier than usual) in RA-1501.

This a rapid overview of the approach Google uses to develop and offer global products. I will briefly (and somewhat superficially) cover the whole of our infrastructure from physical systems, such as the data centers, through the software stack to our software development methodology and the corporate engineering culture that both builds and utilizes the infrastructure.

Peter Dickman is an engineering manager in Google’s main European engineering centre in Zurich. He is involved with both the internals of the Google search engine and projects to protect user data in Google’s systems. Prior to working at Google, Peter was an academic in the UK, researching large-scale distributed systems (though on arrival at Google he discovered what large really meant).

Crash course Functional Programming

Friday, November 12th, 2010, posted by Maarten Fokkinga

The crash course Functional Programming, intended to be able to describe the word count program in a functional language, will be given by Maarten Fokkinga in room Zilverling, West 1, on Friday Nov 19, 13:45-15:30. We’ll use programming language Amanda (one executable running under Windows), but to do the homework any other functional programming language, such as Haskell, may be used as well. A download for Amanda is given at the material for Assignment 2.

SARA organizes Hadoop hackathon

Thursday, November 4th, 2010, posted by Djoerd Hiemstra

On December 7, SARA (the Dutch National High Performance Computing and e-Science Support Center) organizes a day-long hackathon to kick-off a Proof-of-Concept Hadoop service, and give the opportunity to experiment with Hadoop with support of experienced users. People who are interested can work with Hadoop on a case of choice, or only play with datasets like Wikipedia, the ENRON dataset, White House visitor records, Genome data or others.

See: SARA starts Apache Hadoop Proof-of-Concept.