- Install dependencies
- At a bare minimum, you need to have R installed. Installing RStudio is recommended.
- Must have
python3
installed. Install the required packages. Creating a virtual environment is recommended.
pip install -r requirements.txt
- Download the datasets
In the root directory, run the shell scripts.
./download_kaggle_datasets.sh
./download_torvik_mens_datasets.sh
./download_torvik_womens_datasets.sh
You may need to chmod u+x
.
To download the Kaggle datasets, you need to create and add a Kaggle API key to local environment to use the Kaggle CLI. Consult the Kaggle documentation.
-
Open
notebook.ipynb
in the Jupyter notebok interface and run the first 28 blocks. -
Then run the R scripts in the following order
M_OLRE.R
andW_ORLE.R
M_WinProb.R
andW_WinProb.R
This is enough to do interpretation of the model.
- To generate a csv file to submit to the Kaggle comptition, run the rest of the blocks in
notebook.ipynb