181.189 Applied Web Data Extraction and Integration
This course is in all assigned curricula part of the STEOP.
This course is in at least 1 assigned curriculum part of the STEOP.

2013S, VU, 2.0h, 3.0EC
TUWEL

Properties

  • Semester hours: 2.0
  • Credits: 3.0
  • Type: VU Lecture and Exercise

Aim of course

Overview about tools and methods for web data extraction and integration, Web Process Automation, Web Data for BI, Web Data Cleansing, Web Testing

Subject of course

- Web Data Extraction Frameworks and Scenarios: Commercial, Academic and Open Source
- Data Integration and Mapping
- Creation of more complex sample scenarios in some of the extraction/integration frameworks
- Functional Web 2.0 Application Testing
- Web Process Automation and SOA
- Web ETL Connectors: Web Data for Business Intelligence
- Sample Scenarios in vertical domains
- Web Data Cleansing and Free Text Extraction
- PDF Data Extraction
- Elog Extraction Language

The course comprises both a lecture and an exercise part. The lecture part is primarily intended to teach about methodologies as well as to illustrate concepts from practice including system live demonstrations. The goal of the exercises is to strengthen the knowlege of the participants, especially including practical usage of tools in the area of web data extraction. At the end of the course, student group talks will cover further aspects in more detail. One meeting will be devoted to give an overview about current (applied) research projects at DBAI to give a short glimpse on novel research in this area.

Additional information

ECTS-Breakdown:
lectures: 14 hours
discussion of the exercises: 8 hours
exercises: 26 hours
final project/exam: 27 hours
total: 75 hours (3 ECTS)

Fridays 16:00 to 18:00 (nine sessions). Please refer to the lecture web page. Registration via TISS for exercise groups. Lecture 16-17, Excercise Evaluation 17-18. First meeting date is 8th of March, the further schedule is available on the lecture web page.

Lecturers

  • Baumgartner, Robert

Institute

Course dates

DayTimeDateLocationDescription
Fri16:00 - 19:0008.03.2013EI 4 Reithoffer HS Applied Web Data Extraction and Integration
Fri16:00 - 19:0015.03.2013EI 4 Reithoffer HS Applied Web Data Extraction and Integration
Fri16:00 - 19:0022.03.2013EI 4 Reithoffer HS Applied Web Data Extraction and Integration Bk
Fri16:00 - 19:0012.04.2013EI 4 Reithoffer HS Applied Web Data Extraction and Integration
Fri16:00 - 19:0019.04.2013EI 4 Reithoffer HS Applied Web Data Extraction and Integration
Fri16:00 - 19:0003.05.2013EI 4 Reithoffer HS Applied Web Data Extraction and Integration
Fri16:00 - 19:0017.05.2013EI 4 Reithoffer HS Applied Web Data Extraction and Integration
Fri16:00 - 19:0024.05.2013EI 4 Reithoffer HS Applied Web Data Extraction and Integration
Fri16:00 - 19:0007.06.2013EI 4 Reithoffer HS Applied Web Data Extraction and Integration
Fri16:00 - 19:0014.06.2013EI 4 Reithoffer HS Applied Web Data Extraction and Integration Bk
Fri16:00 - 19:0021.06.2013EI 4 Reithoffer HS Applied Web Data Extraction and Integration
Fri16:00 - 19:0028.06.2013EI 4 Reithoffer HS Applied Web Data Extraction and Integration Bk

Examination modalities

On the one hand, course assessment is based on individual exercises, group exercises and the presentation thereof during the semester, on the other hand on a final project at semester end, in which a particular topic is elaborated as paper as well as being presented and discussed.

Course registration

Registration modalities

Lecture subscription via group registration for exercises in TISS. In case you are an ECML student who can not yet officially register via the system please subscribe to the exercises via email instead.

Group Registration

GroupRegistration FromTo
Gruppe 110.02.2013 10:0015.03.2013 16:00
Gruppe 210.02.2013 10:0015.03.2013 16:00
Gruppe 310.02.2013 10:0015.03.2013 16:00
Gruppe 410.02.2013 10:0015.03.2013 16:00
Gruppe 510.02.2013 10:0015.03.2013 16:00
Gruppe 610.02.2013 10:0015.03.2013 16:00
Gruppe 710.02.2013 10:0015.03.2013 16:00
Gruppe 806.02.2013 00:0008.03.2013 16:00
Gruppe 901.03.2013 00:0008.03.2013 16:00

Curricula

Literature

please refer to lecture slides

Previous knowledge

Helpful: Basic knowledge HTML, XML/XSLT, JavaScript

Preceding courses

Miscellaneous

Language

English