Getting Started with Topic Modeling and MALLET

dc.contributor.authorGraham, Shawn
dc.contributor.authorWeingart, Scott
dc.contributor.authorMilligan, Ian
dc.date.accessioned2017-04-26T18:00:03Z
dc.date.available2017-04-26T18:00:03Z
dc.date.issued2012-09-02
dc.descriptionThis article Published by the Editorial Board of the Programming Historian is made available under a Creative Commons Attribution 2.0 Generic License. Available at: http://programminghistorian.org/lessons/topic-modeling-and-malleten
dc.description.abstractIn this lesson you will first learn what topic modeling is and why you might want to employ it in your research. You will then learn how to install and work with the MALLET natural language processing toolkit to do so. MALLET involves modifying an environment variable (essentially, setting up a short-cut so that your computer always knows where to find the MALLET program) and working with the command line (ie, by typing in commands manually, rather than clicking on icons or menus). We will run the topic modeller on some example files, and look at the kinds of outputs that MALLET installed. This will give us a good idea of how it can be used on a corpus of texts to identify topics found in the documents without reading them individually.en
dc.identifier.urihttp://programminghistorian.org/lessons/topic-modeling-and-mallet
dc.identifier.urihttp://hdl.handle.net/10012/11751
dc.language.isoenen
dc.publisherThe Editorial Board of the Programming Historianen
dc.rightsAttribution 2.0 Generic*
dc.rights.urihttps://creativecommons.org/licenses/by/2.0/*
dc.subjectTopic modelingen
dc.subjectNatural language processingen
dc.subjectMALLETen
dc.subjectDistant readingen
dc.titleGetting Started with Topic Modeling and MALLETen
dc.typeTechnical Reporten
dcterms.bibliographicCitationShawn Graham, Scott Weingart, and Ian Milligan. “Getting Started with Topic Modeling and MALLET.” The Programming Historian, September 2012.en
uws.contributor.affiliation1Faculty of Artsen
uws.contributor.affiliation2Historyen
uws.peerReviewStatusRevieweden
uws.scholarLevelFacultyen
uws.typeOfResourceTexten

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Getting Started with Topic Modeling and MALLET _ Programming Historian.pdf
Size:
2.4 MB
Format:
Adobe Portable Document Format
Description:
Publisher's version

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
4.46 KB
Format:
Item-specific license agreed upon to submission
Description: