GithubHelp home page GithubHelp logo

mva_2019_sl's Introduction

Algorithms for speech and natural language processing (MVA 2019)

Contact information

For any question/request related to this course, please send an email to this address: [email protected]

Course materials

Course Objectives

Speech and natural language processing is a subfield of artificial intelligence used in an increasing number of applications; yet, while some aspects are on par with human performances, others are lagging behind. This course will present the full stack of speech and language technology, from automatic speech recognition to parsing and semantic processing. The course will present, at each level, the key principles, algorithms and mathematical principles behind the state of the art, and confront them with what is know about human speech and language processing. Students will acquire detailed knowledge of the scientific issues and computational techniques in automatic speech and language processing and will have hands on experience in implementing and evaluating the important algorithms.

Topics:

  • speech features & signal processing
  • hidden markov & finite state modeling
  • probabilistic parsing
  • continuous embeddings
  • deep learning for language-related tasks (DNNs, RNNs)
  • linguistics and psycholinguistics
  • comparing human and machine performance

Prerequisites

Basic linear algebra, calculus, probability theory

Organization

Eight courses

The courses take place on monday, usually from 9am to 11/12am (but there are exceptions, see below). Be on time! Attendance is mandatory.

  • #1 Jan 28, 9:00am-11:00am (Dussane). Introduction (Sagot & Dupoux)
  • #2 Feb 04, 9:00am-11:00am (Dussane). ASR1: Features and Acoustic Models (Dupoux & Zeghidour)
  • #3 Feb 11, 9:00am-12:00am (Dussane). ASR2: Language Models (Dupoux, Zeghidour, Riad) + presentation TD#1
  • #4 Feb 18, 9:00am-11:00am (Dussane). NLP1: Language processing in the wild (Sagot)

Attention, deadline for returning TD#1 on Monday Feb 25 at 12am: penalty points for late submission start here

  • #5 Feb 25, 4:00pm-6:00pm (Dussane). NLP2: Formal languages (Sagot) (Attention, 4:00pm instead of 9:00am)
  • #6 Mar 4, 9:00am-12:00am (Dussane). NLP3: Parsing (Sagot) + presentation TD#2
  • #7 Mar 11, 9:30am-11:30am (Dussane). Translation (Guest: Schwenk) (Attention, 9:30am instead of 9:00am)

Attention, deadline for returning TD#2 on Monday Mar 18 at 12am: penalty points for late submission start here

  • #8 Mar 18, 9:00am-12:00am (Dussane). Perspectives (Sagot & Dupoux)

(Dussane): 45 rue d'Ulm, Paris 75005, Amphi Dussane, Ground Floor, left.

The course materials (PDFs, etc.) are listed in the subdirectories numbered #1 .. #8.

Validation

The validation is continuous: there is no final exam, but a combination of quizzes during the lessons (20%) and two practical assignments (TDs), (40% each). ATTENTION: since there is no exam, there is no possibility of "rattrapage" (ie, of compensating a bad mark by taking another exam). So, if the overall grade obtained in this course is less than 10/20, this course will not be considered validated by the MVA Master.

Practical assignments (TD)

The practical assignments are given on the courses #3 and #6. There will be one assignment for the speech part and one for the NLP part. For each assignment, students are provided with the necessary data and Python code, either as a list of requirements to install or in the form of a disk image (.ova) to be mounted and booted from a virtual machine. They will hand in their source code and a max two page report, detailing their work, the difficulties encountered and the results. Students will have a max of 2 weeks to complete the assignment; assignment will be graded from 0 to 20, with a -1 point removed from the score for each day of being late. Each assignment will count for 40% of the final grade. We may organise special Q&A sessions regarding these assignments from 11am to 12am upon request.

Quizzes

During the courses, we will use on-line quizzes (on the smartphone/computer) to probe comprehension and trigger discussion. The quizzes will be used (1) to check that you attend the course, (2) that you have read the supporting documents and are following what is being presented. Each quiz will be graded as follows: 0 (no response), 1 (wrong response), 2 (good response). The scores will be averaged and converted from on a 0 to 20 scale. If there are N quizzes, we will use the N-1 best scores for averaging. The overall score will count for 20% of the final grade.

Q&A

What happens if i am ill and cannot attend one course?

Presence to the course is mandatory. Each unattended course will give rise to points substracted to the final grade unless they are motivated by an official document, like a medical doctor's statement. This document should be sent to [email protected] together with the name and date of missed lesson.

mva_2019_sl's People

Contributors

bsagot avatar edupoux avatar dpx-fair avatar rachine avatar lienz avatar jukaradayi avatar louismartin avatar caofrance avatar

Watchers

James Cloos avatar Remy Sun avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.