Machine learning for language toolkit

Tutorial slides / video
Quick Start
Importing Data
Data Transformations
Sequence Tagging
Topic Modeling

View the Project on GitHub mimno/Mallet

About Mallet

Mallet has been maintained for the past 10 years by David Mimno, who contributed the topic modeling package. Development is currently focused on stability, with small improvements and bug fixes. For specific questions and user support, use the #mallet tag on Stack Overflow. For bug reports use Github.

It was written by Andrew McCallum, with contributions from several graduate students and staff, including Kedar Bellare, Gaurav Chandalia, Aron Culotta, Gregory Druck, Al Hough, Wei Li, David Mimno, David Pinto, Sameer Singh, Charles Sutton, Jerod Weinman, and Limin Yao, at University of Massachusetts Amherst, as well as contributions from Fernando Pereira, Ryan McDonald, and others at University of Pennsylvania.

The MALLET website was designed and written by David Mimno, with contributions from Charles Sutton, Gaurav Chandalia, and Al Hough.

The toolkit is Open Source Software, and is released under the Apache 2.0 License. You are welcome to use the code under the terms of the licence for research or commercial purposes, however please acknowledge its use with a citation:

McCallum, Andrew Kachites.  "MALLET: A Machine Learning for Language Toolkit." 2002.

Here is a BiBTeX entry:

  author = "Andrew Kachites McCallum",
  title = "MALLET: A Machine Learning for Language Toolkit",
  note = "",
  year = 2002}