Computational Linguistics Syllabus

Course Outline

Linguistics 581

Day

Reading

Assignment

Lecture

Background

Code

Thu Jan 19 Chapter 1 of Jurafsky and Martin (J&M) History of Computational Linguistics. Assignment 1. Some history. Textbook intro. Polish fried fish.    
Tue Jan 24 Chapter 2 J&M [Blackboard], Finite-State Automata, Regular Expressions   Textbook Ch. 2 slides Non-deterministic automata, epsilon transitions. [Also see blackboard slides]    
Thu Jan 26 Chapter 2 [Blackboard], Text normalization, edit distance, lemmatization, segmentation.        
Tue Jan 31 Reading: Sections 3.1-3.3 of Chapter 3. Introducing words and word parts. Sections 3.4-3.9 of Chapter 3. Assignment 1 due, Assignment 2 Installing Python. Textbook Ch. 3 slides Introduction to transducers    
Thu Feb 02          
Tue Feb 07 Chapter 3. XFST intro. XFST assignment (Due Feb. 14) [Compling lab (if needed): SHW 243]      
Thu Feb 09          
Tue Feb 14 Chapter 4: J&M. Language modeling (Ngram models). Word counting, frequency dictionaries, simple ngram models, the training corpus. The Pollard assignment and smoothing assignment are due Feb 28, Malay/Indonesian solution, Lakota solution. Ngrams Brief probability intro Introduction to NLTK. Peter Norvig on ngrams.    
Thu Feb 16          
Tue Feb 21 Chapter 6: J&M. [Blackboard] Naive Bayes Classification. More generative models.        
Thu Feb 23          
Tue Feb 28 Chapter 7: J&M. [Blackboard] Logistic regression Max entropy assignment, due Mar. 14 Maximum entropy models    
Thu Mar 02          
Tue Mar 07   Pollard and smoothing assignment solutions. Chapter 9: J&M. [Blackboard] Sequence models. HMMs. Intro to HMMs, Forward/Backward Algorithm on Jason Eisner's ice cream HMM.    
Thu Mar 09          
Tue Mar 14 Chapter 10: J&M [Blackboard] Word-class and part of speech tagging. Rule-based taggers, decoding with HMMs. Computing Viterbi by hand. Lecture. Tagging slides HMM Taggers/HMM models.    
Thu Mar 16          
Tue Mar 21   Assignment: midterm.pdf, Viterbi assignment: answer, Max ent problem model answer.      
Thu Mar 23          
Tue Mar 28 H'day H'day H'day H'day H'day
Thu Mar 30 H'day H'day H'day H'day H'day
Tue Apr 04 Chapter 12. J&M Context Free Grammars of English, Treebanks Grammar assignment, Due Thurs., April 13.      
Thu Apr 06   Grammar assignment.. Jurafsky/ Martin Grammar chapter lecture (see blackboard), Tree structure: Main ideas.    
Tue Apr 11 Chapter 13, 14 J&M Parsing. CKY algorithm (bottum up parsing with a chart), Earley algorithm (top down parsing with chart), probabilistic extension of CKY. Parsing assignment (CKY) (Parts A and B Due Thurs., April 20). Jurafsky Martin parsing lecture (see blackboard).   td_parser-0.1:an implementation of a recursive descent top down recognizer.
Thu Apr 13   Assignment: Midterm model answer, email return of XFST and Max Ent assignments.      
Tue Apr 18 Chapter 14. Top down parsing with chart, probabilistic extension of CKY. Parsing assignment (CKY) (Part C: Probabilities, Due Thurs., April 27), Solution for Grammar assignment, Solution for Parts A and B. Prob parsing lecture, also see Jurafsky and Martin prob parsing slides (see BlackBoard), Formal properties of PCFGs (for the mathematically inclined),    
Thu Apr 20          
Tue Apr 25 Chapter 15. Vector Semantics [Jurafsky and Martin Ch. 15, 3rd Ed, Blackboard] (Optional) Word2 Vec Assignment, CKY probability assignment calculations.      
Thu Apr 27 Chapter 12-13, 15. Review        
Tue May 02          
Thu May 04   2017 Final (Due May 11, 2017, not yet available).2016 Final. Where next? Tom Mitchell's Online Machine Learning Lectures.   Last class day