Data Analysis: Text ProcessingCovers text processing techniques for data analysis, including text cleaning, tokenization, stemming, and lemmatization.
Probabilistic RetrievalCovers Probabilistic Information Retrieval, modeling relevance as a probability, query expansion, and automatic thesaurus generation.
Information retrieval: vector spaceCovers the basics of information retrieval using vector space models and practical exercises on relevance feedback and posting list scanning.