Assignment 3

Attention

The way to obtain the assignment skeleton changed. Please read Part 1 carefully.
A detailed walkthrough is available for Parts 2 and 3. It is located on a separate webpage, to keep this page shorter.

Summary

In this assignment, you will continue developing a movie review classifier. Mostly, you will be practicing evaluating a system. In particular, you will compute accuracy and precision and recall of the simplistic “system” you built in Assignment 2.

You will start with the code you submitted for Assignment 2 and will develop it further so that it now processes a bunch of files. You will represent the files as labeled data points. With that in hand, you will be able to run the simplistic prediction system on all of the files, compare the predicted labels with the actual labels, and compute the performance metrics.

After we are sure we know how to process multiple files, store the data as vectors and compute things, we will leave our simplisticPrediction() behind and will proceed with deploying more sophisticated (off-the-shelf) techniques on our data.

Spec: After you are done, your program should do the following:

Accept two filepaths as input, one the first one for postive (training) files and the second one for negative (training) files.
Output 5 numbers in the exact order specified below in Part 3 Step 2.

Do not output any text; we need just the numbers, for faster grading.

We should be able to run your program in command line as follows: python netID_assignment3.py path_to_the_pos_dir path_to_the_neg_dir. It’s a good idea to test it out in command line before you submit.

Below you will find the assignment description in several parts. If you are not sure how to do something, please make sure to also consult the detailed walkthrough for Parts 2 and 3. That walkthrough is not the only way of doing things; it is intended as helpful though. Ask questions!

Just one hint for everyone: Make it so your program only pays attention to files with the .txt extension and ignores everything else which may have ended up in the directory!

As before, you will submit your final version to the repository you created for the class in Assignment 1, and you will submit the commit number to Canvas, for time stamp.

Part 1: Assignment 2 clean-up/fixes and Setup

Address any comments you received for Assignment 2 (if you already got it back), or otherwise make sure your Assignment 2 program is working correctly.
Set up a private repository to work on this assignment. Some options are listed here. You can download the skeleton repository and copy it to your submission repository (but then the instructors will see all your commits; it is not a problem for us though so long as you do not add other collaborators to the repository). Alternatively, import or otherwise privately copy the repository we created for Assignment 3 setup. Please do not fork (that has privacy issues because it turns out forks are public). To import, go to https://github.com/new/import and enter the assignment 3 url: https://github.com/olzama/Ling471-SP2021-HW3.git. Make sure to create a private repo at this step. In any case, please make sure to not publish any solutions to this assignment anywhere, including but not limited to GitHub.
You will find two files in the skeleton directory: evaluation.py and review_vector.py, as well as a folder for a tiny test dataset. Now add a copy your Assignment2 python program to your private repo in which you will work on Assignment 3. Rename it appropriately: yourNetID_assignment3.py.

Part 2: Running simplistic prediction on many files

Refactor your assignment3.py program so that your main() function iterates over all files in both train/pos and train/neg directories, outputting a prediction for each file.

Part 3: Computing accuracy, and computing precision and recall

Compute accuracy, precision, and recall of your system. The acuracy is for the overall system while precision and recall are with respect to positive and negative review classes, separately. The NONE predictions should be counted as mistakes (think carefully here!).
At the end of your program, output 5 numbers, one per line, rounded to 4 decimal points, in the exact following order:
1. Overall system accuracy
2. Precision wrt positive (training) reviews
3. Recall wrt positive (training) reviews
4. Precision wrt negative (training) reviews,
5. Recall wrt positive (training) reviews.
Do not output any text; we need just the numbers, for faster grading.

You can use the python built-in round() function to round up the floats. It accepts two arguments: the float and the number of decimal points.

The expected output is:
```
 0.2751
 0.5558
 0.3214
 0.8072
 0.2287
```

Part 4: Error analysis

Take note of 5 positive and 5 negative reviews which were classified incorrectly. (You can collect the filenames for which the prediction is incorrect, in a list, and then inspect it in the debugger.) Inspect the contents of these 10 files. What do you notice? What kind of phenomena (syntactic, semantic) can you observe which led to your system mistaking a good review for a bad review, and vice versa?
Why do you think the precision is so much better than the recall?
Why do you think the precision for negative reviews may be so much better than for positive reviews?
Summarize your findings in a text file called yourNetID_EA_assignment3.txt and add it to the repository you created for Assignment 1.

Part 5 Submit

Link to the commit you want us to grade in the repository which you created for Assignment 1. In that repository, we should be able to find yourNetID_assignment3.py along with any files it imports, as well as yourNetID_EA_assignment3.txt.

You are now done with Assignment 3!

In the remaining couple of assignments, we will only be using our simplisticPrediction() as a baseline. In other words, we will use more sophisticated models (including off-the-shelf models) and compare their performance to each other and to our simplistic procedure. The purpose of writing simplisticPrediction() ourselves was mostly to practice basic programming.

Home

Ling471

UW Spring 2021 Computational methods for linguists