Toggle navigation
Digital Humanities Approaches to Textual Objects, Fall 2018
Policies
Weekly Calendar
Assignments
Readings
Login
Text Analysis Methods Workshop 10: The vector space model and cosine similarity
Text Analysis Methods Workshop 1:
Initial setup; a bit of Git
Text Analysis Methods Workshop 2:
Using a Jupyter Notebook; basic Python an Example Script
Text Analysis Methods Workshop 3:
Python Fundamentals continued; another Example Script
Text Analysis Methods Workshop 4:
Basics of Text Processing; tokenization, word counts, lemmatization
Text Analysis Methods Workshop 5:
POS tagging; Named Entity Recognition
Text Analysis Methods Workshop 6:
Working with textual corpora and datasets
Text Analysis Methods Workshop 7:
Dictionaries and lexicons
Text Analysis Methods Workshop 8:
Levenshtein distance and fuzzy matching
Text Analysis Methods Workshop 9:
Collocations and N-grams
Text Analysis Methods Workshop 10:
The vector space model and cosine similarity
Text Analysis Methods Workshop 11:
TF-IDF and clustering
Text Analysis Methods Workshop 12:
Topic models and Word2vec
Text Analysis Methods Workshop 14:
Machine learning classification (linear and logistic regression)
Text Analysis Methods Workshop 15:
Network analysis and visualization
Link to Content