194.093 Natural Language Processing and Information Extraction
This course is in all assigned curricula part of the STEOP.
This course is in at least 1 assigned curriculum part of the STEOP.

2023W, VU, 2.0h, 3.0EC
TUWEL

Properties

  • Semester hours: 2.0
  • Credits: 3.0
  • Type: VU Lecture and Exercise
  • Format: Hybrid

Learning outcomes

After successful completion of the course, students are able to extract structure from natural language data by applying standard methods for text segmentation, word and sequence tagging, or syntactic parsing. They will have a high-level overview of the most important rule-based and learning-based approaches to each task and the standard methods for evaluating them. Students will gain a fundamental understanding of artificial neural networks and methods for training them, with a special emphasis on architectures for processing sequential data, allowing them to solve a variety of NLP tasks with deep learning. An overview of information extraction tasks will be given, allowing students to approach various problems involving the extraction of structured information from unstructured text data. A survey of common specialized IE tasks is also provided, acquainting the students with some of the most common NLP applications.

Subject of course

- Basics of text processing: segmentation, tokenization, decompounding, stemming, lemmatization; regular expressions

- N-gram language modeling, simple classification tasks in NLP

- Part-of-speech tagging, named entity recognition, and shallow parsing with Hidden Markov Models

- Syntactic representations and syntactic parsing

- Basics of natural language semantics

- Neural network basics. Feed forward networks and recurrent neural networks

- Sequence modeling and sequence-to-sequence models. 

- Neural language modeling. Word vectors and contextualized language models. 

- Information extraction tasks: entity recognition, relation extraction, knowledge base population

- Information extraction applications: summarization, question answering, chatbots

Teaching methods

Lectures on the fundamentals

1 Term project (done in groups) with Milestones

Mode of examination

Immanent

Additional information

Course material:

https://github.com/tuw-nlp-ie/tuw-nlp-ie-2023WS

Workload for Students (in hours):

  • Lectures: 24
  • Milestone 1: 8
  • Milestone 2: 8
  • Final Project: 35

Summe: 75


All lectures will be held in person, online participation is not possible. The official format is "hybrid" to allow for a switch to online teaching during the semester, if necessary.

Lecturers

Institute

Course dates

DayTimeDateLocationDescription
Fri13:00 - 15:0006.10.2023 - 19.01.2024EI 11 Geodäsie HS - GEO Natural Language Processing and Information Extraction Lecture
Natural Language Processing and Information Extraction - Single appointments
DayDateTimeLocationDescription
Fri06.10.202313:00 - 15:00EI 11 Geodäsie HS - GEO Natural Language Processing and Information Extraction Lecture
Fri13.10.202313:00 - 15:00EI 11 Geodäsie HS - GEO Natural Language Processing and Information Extraction Lecture
Fri20.10.202313:00 - 15:00EI 11 Geodäsie HS - GEO Natural Language Processing and Information Extraction Lecture
Fri27.10.202313:00 - 15:00EI 11 Geodäsie HS - GEO Natural Language Processing and Information Extraction Lecture
Fri03.11.202313:00 - 15:00EI 11 Geodäsie HS - GEO Natural Language Processing and Information Extraction Lecture
Fri10.11.202313:00 - 15:00EI 11 Geodäsie HS - GEO Natural Language Processing and Information Extraction Lecture
Fri17.11.202313:00 - 15:00EI 11 Geodäsie HS - GEO Natural Language Processing and Information Extraction Lecture
Fri24.11.202313:00 - 15:00EI 11 Geodäsie HS - GEO Natural Language Processing and Information Extraction Lecture
Fri01.12.202313:00 - 15:00EI 11 Geodäsie HS - GEO Natural Language Processing and Information Extraction Lecture
Fri15.12.202313:00 - 15:00EI 11 Geodäsie HS - GEO Natural Language Processing and Information Extraction Lecture
Fri12.01.202413:00 - 15:00EI 11 Geodäsie HS - GEO Natural Language Processing and Information Extraction Lecture
Fri19.01.202413:00 - 15:00EI 11 Geodäsie HS - GEO Natural Language Processing and Information Extraction Lecture

Examination modalities

15% for Milestone 1 15% for Milestone 2 50% for the final solution 10% for the presentation 10% for the management summary

Course registration

Begin End Deregistration end
08.09.2023 08:00 08.11.2023 23:55 08.11.2023 23:55

Curricula

Study CodeObligationSemesterPrecon.Info
066 645 Data Science Not specified

Literature

No lecture notes are available.

Language

English