Computational Linguistics Syllabus |
Course Outline |
Linguistics 581 |
Week |
Day |
Reading |
Assignment |
Lecture |
Background |
Code |
Thu Jan 21 | Chapter 1 of Jurafsky and Martin (J&M) History of Computational Linguistics. Chapter 2 J&M [Canvas], Finite-State Automata, Regular Expressions Read BOTH Chapter 2 and the extra file labeled "Finite-State Automata" | Assignment 1, Regular expression Python demo script. | Textbook Ch. 2 slides Non-deterministic automata, epsilon transitions, [Also see Canvas slides] Transitions demo notebook (Python FSA implementation]. Not runnable as Colab NB w/o first running install directions at Tal Yarkoni's github transitions page. | |||
Week 1 | Tue Jan 26 | Chapter 4: J&M. Language modeling (Ngram models). Word counting, frequency dictionaries, simple ngram models, the training corpus. Introduction to NLTK. | Python Distributions, Demo Script for computing ngram counts, Github Google colab extension for Chrome, Pollard assignment, Smoothing assignment. | Ngrams, Brief probability intro. | ||
Thu Jan 28 | Peter Norvig on ngrams. | |||||
Week 2 | Tue Feb 02 | Chapter 4: Section 4.1-4.5 of chapter 4. Practicalities. Sections 4.5.3 Chapter 4. Sections 4.4 and 4.5.1, 4.5.2 Chapter 4. Smoothing, Add-1 smoothing, Kneser-Ney smoothing. Unknown words. | Assignment One due! | Entropy and Cross-Entropy. (Entropy as expected information value; cross-entropy as an evaluation tool) Smoothing Lecture. Kneser-Ney Lecture. | ||
Thu Feb 04 | ||||||
Week 3 | Tue Feb 09 | Chapter 4: Section 4.5 Kneser-Ney smoothing. | Pollard assignment and smoothing assignment due. Smoothing, Pollard. | |||
Thu Feb 11 | ||||||
Week 4 | Tue Feb 16 | Chapter 5: J&M. [Canvas] Spelling Correction. Noisy Channel. Generative models. | Smoothing assignment solution. , Pollard assignment solution. | |||
Thu Feb 18 | ||||||
Week 5 | Tue Feb 23 | Chapter 6: J&M. [Canvas] Naive Bayes Classification. More generative models. | Noisy channel assignment Google Colab notebook. Due Feb. 23. | |||
Thu Feb 25 | Word senses. Computing word senses. Naive Bayes assignment: Disambiguating a word. | |||||
Week 6 | Tue Mar 02 | Chapter 7. [Canvas] Logistic Regression | ||||
Thu Mar 04 | ||||||
Week 7 | Tue Mar 09 | Logistic Regression Notebook (Not an assignment; does not need turning in). | ||||
Thu Mar 11 | Assignments: Naive Bayes assignment (Due Mar. 11). | |||||
Week 8 | Tue Mar 16 | Chapter 9: J&M. [Canvas] Noisy channel models, Sequence models. HMMs. | Assignments: Naive Bayes solution, Noisy Channel assignment solution. | Intro to HMMs, Forward/Backward Algorithm on Jason Eisner's ice cream HMM. | ||
Thu Mar 18 | Assignments: Midterm 2020! Midterm 2020 solution, Midterm 2021. (This is it!) | |||||
Week 9 | Tue Mar 23 | Chapter 11. J&M Context Free Grammars of English, Treebanks, [Jurafsky and Martin 3rd Ed, Canvas] | Jurafsky/ Martin Grammar chapter lecture (see Canvas), Tree structure: Main ideas. | |||
Thu Mar 25 | Assignments: Midterm due Mar. 25 (as a Google Colab notebook). Turn in on Canvas. | |||||
Week 10 | Tue Mar 30 | H'day | H'day | H'day | H'day | H'day |
Thu Apr 01 | Chapter 12 Parsing. [Jurafsky and Martin 3rd Ed, Canvas] CKY algorithm (bottum up parsing with a chart), | Prob parsing lecture, also see Jurafsky and Martin prob parsing slides (see Canvas), | ||||
Week 11 | Tue Apr 06 | Viterbi assignment (due on Canvas Apr. 6 ), Practice Viterbi Problem, Practice Problem solution, Practice problem template. Grammar assignment (Due Apr. 13). | td_parser-0.1:Python implementation of a recursive descent top down recognizer. | |||
Thu Apr 08 | Chapter 13 Probabilistic Parsing. [Jurafsky and Martin 3rd Ed, Canvas] probabilistic extension of CKY. Shift-reduce Dependency parsing, Deep learning parsers (RNNs) | Prob parsing lecture. | ||||
Week 12 | Tue Apr 13 | |||||
Thu Apr 15 | H'day | H'day | H'day | H'day | H'day | |
Week 13 | Tue Apr 20 | Assignment: Parsing assignment (Due Apr. 20) Solution to Midterm. Midterm interpretation. Grammar assignment solution. | ||||
Thu Apr 22 | Notes on probability parsing assignment, CKY assignment solution, Viterbi assignment solution. | |||||
Week 14 | Tue Apr 27 | Chapter 15. Vector Semantics [Jurafsky and Martin 3rd Ed, Canvas] Quiz on Vector Semantics & Neural Nets. | ||||
Thu Apr 29 | Word vector calculation example. Word Vector practice NB, | |||||
Week 15 | Tue May 04 | Chapter 16. Word Embeddings [Jurafsky and Martin 3rd Ed, Canvas] | Final review materials, 2020 Final. Practice Viterbi Problem, Practice Problem solution. | |||
Thu May 06 | Chapter 8, 12-13, 15, 16. Review | Probability Parsing assignment, Due Thurs., May 6), Notes on probability parsing assignment, Word Vector Assignment, due Thurs. May 6. | Last class day | |||
Week 16 | Tue May 11 | |||||
Thu May 13 | ||||||
Week 17 | Tue May 18 | |||||
Thu May 20 | ||||||
Week 18 | Tue May 25 |