195.075 Introduction to Data Mining: Clustering and Outlier Detection
This course is in all assigned curricula part of the STEOP.
This course is in at least 1 assigned curriculum part of the STEOP.

2014S, VU, 2.0h, 3.0EC

Properties

  • Semester hours: 2.0
  • Credits: 3.0
  • Type: VU Lecture and Exercise

Aim of course

To provide an overview on different categories of clustering and outlier detection and introduce fundamental methods as well as specialized methods for high-dimensional data.

Subject of course

Part 1: Lecture (public) - 8 units.
The lectures will give an overview on different categories of clustering and outlier detection and introduce fundamental methods as well as specialized methods for high-dimensional data.
An introductory chapter will survey the KDD pipeline. We will shortly discuss properties of feature spaces and distance measures.
The chapter on cluster analysis will cover partitional clustering and density-based clustering, and will give some ideas about hierarchical clustering. Finally, we will discuss challenges and some strategies for clustering high-dimensional data.
The chapter on outlier detection will survey standard methods from the data mining literature as well as the particular challenges and solutions for high-dimensional data.

During the lecture, we might occasionally examine the typical behavior of some representative algorithms on toy data sets. Participants interested in following these experiments on their own laptop computer are encouraged to download the latest release of ELKI (http://elki.dbs.ifi.lmu.de/ ) (requires java, e.g., OpenJDK 7). 

Part 2: Seminar, 16 participants (PhD school) - 6 units.
In the seminar, we will discuss recent literature on clustering and outlier detection, with a particular focus on high-dimensional data. The participants are to prepare talks on papers and to discuss the presented papers, based on the insights learned in the lecture.

In addition, we will have a short preparatory meeting on May 16, after the last lecture, to discuss the assignment of topics to the participants.

Additional information

This is a visiting professor course of the Vienna PhD School of informatics in the area of Media Informatics and Visual Computing. 

Lecturer: Arthur Zimek, LMU Munich, Germany.

Lecturers

Institute

Course dates

DayTimeDateLocationDescription
Wed10:00 - 12:0009.04.2014Seminarraum FAV 01 B (Seminarraum 187/2) Lecture
Wed14:00 - 16:0009.04.2014Seminarraum FAV EG C (Seminarraum Gödel) Lecture
Thu14:00 - 16:0010.04.2014Seminarraum FAV 01 A (Seminarraum 183/2) Lecture
Fri10:00 - 12:0011.04.2014Seminarraum FAV 01 A (Seminarraum 183/2) Lecture
Fri14:00 - 16:0011.04.2014Seminarraum FAV 01 A (Seminarraum 183/2) Lecture
Thu14:00 - 16:0015.05.2014Seminarraum FAV 01 A (Seminarraum 183/2) Lecture
Fri10:00 - 12:0016.05.2014FAV Hörsaal 3 Zemanek (Seminarraum Zemanek) Lecture
Fri14:00 - 16:0016.05.2014FAV Hörsaal 3 Zemanek (Seminarraum Zemanek) Lecture
Fri16:00 - 17:0016.05.2014FAV Hörsaal 3 Zemanek (Seminarraum Zemanek) Preparatory meeting
Thu10:00 - 12:0026.06.2014FAV Hörsaal 3 Zemanek (Seminarraum Zemanek) Seminar
Thu14:00 - 18:0026.06.2014Seminarraum FAV EG B (Seminarraum von Neumann) Seminar
Fri10:00 - 12:0027.06.2014Seminarraum FAV 01 A (Seminarraum 183/2) Seminar
Fri14:00 - 18:0027.06.2014Seminarraum FAV 01 A (Seminarraum 183/2) Seminar

Examination modalities

First (minor) part: for the homework given in the lectures, the participants in the seminar should prepare a short report summarizing your answers and send it to zimek@dbs.ifi.lmu.de until May 1, 2014. Please include, if possible, a picture of yourself and a short description of your research topics, to help me learn your names and prepare the seminar assignments.

The second part of the grade will be based on your presentation in the seminar. We will discuss paper assignments to the participants in the seminar on May 16 after the last lecture.

Course registration

Begin End Deregistration end
17.02.2014 00:00 08.05.2014 00:00 31.05.2014 00:00

Registration modalities

Registration takes place in TISS

Curricula

Study CodeObligationSemesterPrecon.Info
PhD Vienna PhD School of Informatics Not specified

Literature

No lecture notes are available.

Language

English