Computational Linguistics Syllabus

Course Outline

Linguistics 581

Week

Day

Reading

Assignment

Lecture

Background

Code

Week 0 Wed Jan 22 Chapter 1 of Jurafsky and Martin (J&M) History of Computational Linguistics. Chapter 2 J&M [Blackboard], Finite-State Automata, Regular Expressions Read BOTH Chapter 2 and the extra file labeled "Finite-State Automata" Assignment 1, Regular expression Python demo script. Textbook Ch. 2 slides Non-deterministic automata, epsilon transitions. [Also see blackboard slides]    
Week 1 Wed Jan 29 Chapter 4: J&M. Language modeling (Ngram models). Word counting, frequency dictionaries, simple ngram models, the training corpus. Introduction to NLTK. Python releases, Demo Script for computing ngram counts, The Pollard assignment and smoothing assignment are due Feb. 11. Ngrams, Brief probability intro, Python & unicode. Peter Norvig on ngrams.  
Week 2 Wed Feb 05 Chapter 4: Section 4.1-4.5 of chapter 4. Practicalities. Sections 4.5.3 Chapter 4. Sections 4.4 and 4.5.1, 4.5.2 Chapter 4. Smoothing, Add-1 smoothing, Kneser-Ney smoothing. Unknown words.   Entropy and Cross-Entropy. (Entropy as expected information value; cross-entropy as an evaluation tool) Smoothing Lecture. Kneser-Ney Lecture.    
Week 3 Wed Feb 12 Chapter 4: Section 4.5 Kneser-Ney smoothing.        
Week 4 Wed Feb 19 Chapter 5: J&M. [Blackboard] Spelling Correction. Noisy Channel. Generative models. Noisy channel assignment zip file (contains Python notebook). Due Feb. 26. Regular expressions assignment solution script (Python).      
Week 5 Wed Feb 26 Chapter 6: J&M. [Blackboard] Naive Bayes Classification. More generative models. Chapter 7. [Blackboard] Logistic Regression Naive Bayes assignment (Due Mar. 4). Naive Bayes lecture.    
Week 6 Wed Mar 04 Chapter 7. [Blackboard] Logistic Regression Logistic Regression assignment (Due Mar. 11).      
Week 7 Wed Mar 11 Chapter 9: J&M. [Blackboard] Noisy channel model redux, Sequence models. HMMs.   Intro to HMMs, Forward/Backward Algorithm on Jason Eisner's ice cream HMM.    
Week 8 Wed Mar 18 Chapter 10: J&M [Blackboard] Word-class and part of speech tagging. Rule-based taggers, decoding with HMMs. Viterbi assignment (Hardcopy due Mar. 18 or Mar. 25). Practice Viterbi Problem, Practice Problem solution, Practice problem template. Lecture. Tagging slides HMM Taggers/HMM models.    
Week 9 Wed Mar 25   Brown tagging NB, Naive Bayes solution, Noisy channel assignment solution, Viterbi assignment part one solution, Viterbi assignment solution, Viterbi assignment calculations, Midterm 2020! Lecture: Review    
Week 10 Wed Apr 01 H'day H'day H'day H'day H'day
Week 11 Wed Apr 08 Chapter 11. J&M Context Free Grammars of English, Treebanks, [Jurafsky and Martin 3rd Ed, Blackboard] Grammar assignment (Due Apr. 15).      
Week 12 Wed Apr 15 Chapter 12 Parsing. [Jurafsky and Martin 3rd Ed, Blackboard] CKY algorithm (bottum up parsing with a chart), Parsing assignment (CKY) (Due Tues., April 22, Note: Part C, using probabilities is Due Tu, Apr 29. Solution for Grammar assignment. Jurafsky/ Martin Grammar chapter lecture (see blackboard), Tree structure: Main ideas. Jurafsky Martin parsing lecture (see blackboard).   td_parser-0.1:Python implementation of a recursive descent top down recognizer.
Week 13 Wed Apr 22 Chapter 12 Probabilistic Parsing. [Jurafsky and Martin 3rd Ed, Blackboard] probabilistic extension of CKY. Shift-reduce Dependency parsing, Deep learning parsers (RNNs) Parsing assignment (CKY) (Part C: Probabilities, Due Wed., May 6), Solution for Grammar assignment, CKY assignment solution      
Week 14 Wed Apr 29 Chapter 15. Vector Semantics [Jurafsky and Martin 3rd Ed, Blackboard] Solution for Parts A and B, Prob parsing assignment notes, Parsing assignment (CKY) (Part C: Probabilities, Due Wed., May 6), Word Vector Assignment, due Wed. May 6,      
Week 15 Wed May 06 Chapter 16. Word Embeddings [Jurafsky and Martin 3rd Ed, Blackboard] 2020 Compling final, Solution for Word Vector problem, Word vector calculation example. Word Vector practice NB, Solution to parsing problem, Part C. Solution for Parts A and B, Prob parsing assignment notes,   Last class day    
Week 16 Wed May 13 Chapter 8. Neural Nets [Jurafsky and Martin 3rd Ed, Blackboard] Quiz on Vector Semantics & Neural Nets.      
Week 17 Wed May 20          
Week 18 Wed May 27 Chapter 8, 12-13, 15, 16. Review        
Week 19 Wed Jun 03   2019 Final,      
Week 20 Wed Jun 10          
Week 21 Wed Jun 17          
Week 22 Wed Jun 24