This course aims to give an introduction to text mining concepts and applications, and increase student awareness of the power of large amounts of text data and computational methods to find patterns in corpora.

The course is broken into three phases. Phase I gives a grounding in the basics of programming with R. Phase II introduces essential concepts and methods in text mining. Phase III demonstrates some interesting case studies and discusses the state-of-the-art techniques in text mining.

The course is taught as a series of workshops. Methods will be introduced and discussed, applications will be shown, and students will be expected to perform some tasks during the lectures.