Natural language processing is ubiquitous in modern intelligent technologies, serving as a foundation for language translators, virtual assistants, search engines, and many more. In this course, we cover the foundations of modern methods for natural language processing, such as word embeddings, recurrent neural networks, transformers, and pretraining, and how they can be applied to important tasks in the field, such as machine translation and text classification. We also cover issues with these state-of-the-art approaches (such as robustness, interpretability, sensitivity), identify their failure modes in different NLP applications, and discuss analysis and mitigation techniques for these issues.
Platform | Where & when |
---|---|
Lectures | Wednesdays: 11:15-13:00 [STCC - Cloud C] & Thursdays: 13:15-14:00 [CE16] |
Exercises Session | Thursdays: 14:15-16:00 [CE11] |
Project Assistance (not every week) |
Wednesdays: 13:15-14:00 [STCC - Cloud C] |
QA Forum & Annoucements | Ed Forum [link] |
Grades | Moodle [link] |
Lecture Recordings | Mediaspace [link] |
All lectures will be given in person and live streamed on Zoom. The link to the Zoom is available on the Ed Forum (pinned post). Beware that, in the event of a technical failure during the lecture, continuing to accompany the lecture live via zoom might not be possible.
Recording of the lectures will be made available on Mediaspace. We will reuse some of last year's recordings and we may record a few new lectures in case of different lecture contents.
Week | Date | Topic | Instructor |
---|---|---|---|
Week 1 | 20 Feb | Intro + Setup [code] | Beatriz Borges |
Week 2 | 27 Feb | LMs + Neural LMs: fixed-context models Language and Sequence-to-sequence models |
Simin Fan |
Week 3 | 6 Mar | Attention + Transformers, GPT | Badr AlKhamissi |
Week 4 | 13 Mar | Pretraining and Transfer Learning | Badr AlKhamissi |
Week 5 | 20 Mar | Transfer Learning | Simin Fan |
Week 6 | 27 Mar | Generation | Madhur Panwar |
Week 7/8 | MIDTERM / In-context Learning - GPT-3 | Mete Ismayilzad |
- TAs will provide a small discussion over the last week's exercises, answering any questions and explaining the solutions. (10-15mins)
- TAs will present this week's exercise. (5mins)
- Students will be solving this week's exercises and TAs will provide answers and clarification if needed.
Note: Please make sure you have already done the setup prerequisites to run the coding parts of the exercises. You can find the instructions here.
Your grade in the course will be computed according to the following guidelines.
We will be using the Huggingface Hub as a centralized platform for submitting project artifacts, including model checkpoints and datasets. Please take some time to familiarize yourself with the functionalities of Huggingface Hub in Python to ensure a smooth workflow.
All milestones are due by 23:59 on their respective due dates. However, we understand that meeting deadlines can sometimes be challenging. To accommodate this, you will be given 2 late days for the semester to use at your discretion for group project milestones. Additionally, you will have 3 individual late days that can be applied to the individual project milestone. For group projects, if all members still have late days remaining, those days can be pooled and converted to group late days at a rate of one group late day per four individual late days. No extensions will be granted beyond the due dates, except for the final report, code, and data, which have a strict final deadline of June 8th. Late days will be automatically tracked based on your latest commit, so there is no need to notify us. Once all your late days are used, any further late submissions will incur a 25% grade deduction per day.
More details will be announced in the next weeks.
The project will be divided into 2 milestones and a final submission. Each milestone will be worth 15% of the final grade with the remaining 30% being allocated to the final report. Each team will be supervised by one of the course TAs or AEs.
More details on the content of the project and the deliverables of each milestone will be released at a later date.
- Due: 4 May 2025
- Due: 18 May 2025
- The final report, code, and date will be due on June 8th. Students are welcome to turn in their materials ahead of time, as soon as the semester ends.
- Due: 8 June 2025
Please email us at nlp-cs552-spring2025-ta-team [at] groupes [dot] epfl [dot] ch for any administrative questions, rather than emailing TAs individually. All course content questions need to be asked via Ed.
Lecturer: Antoine Bosselut
Teaching assistants: Angelika Romanou, Badr AlKhamissi, Beatriz Borges, Zeming (Eric) Chen, Simin Fan, Silin Gao, Mete Ismayilzada, Sepideh Mamooler, Madhur Panwar, Auguste Poiroux, Ayush Tarun