Quick Links NLTK Gutenberg Term Frequency - Inverse Document Frequency: Practice Repo Setup If necessary, register for a Cloud9 IDE account. (http://c9.io/) Sign into Cloud9 IDE. Create a new workspace. Name the workspace in the Workspace name field. For example: tf_idf. Add a description if you like. Set the repo in the Clone from Git or Mercurial URL field to https://github.com/WomenWhoCode/tf_idf.git. Click Create Workspace. If necessary, open the workspace. Prepare the Cloud9 virtual machine. sudo pip3 install -r requirements.txt Download the NLTK sample corpora. python3 import nltk nltk.download() Download the gutenberg corpus. Exit the Python interpreter (press Ctrl-D). Development Run python3 -m tf_idf.main to run the webserver to see results.