HypOptRL

We used policy gradient method with multi-head MLP and RNN to optimize hyperparameters of MLP architucture .We got a very similar results of test loss compared to the baseline model , optimizing 4 hyperparameters :learning_rate,hidden size, weight decay and Batch sizes . Tasks optimized are regression and classification using Tabuler data wine dataset from UCL and Letter Recognition multi-classfication task .More experiments could be done on other data modalities and archituctres.

Other Hyperparameters can be added in the future.also CNN arichtuctre could be experimented usig more compute resourse .

Major modules implemented in the code

Environment class
Multi-head MLP policy Network
RNN policy Network
Building the Neural architucture
Baseline using grid search method

Artricle Reference for more details

How to use code

process the experiment in the following format:

perform any sort of preprocessing to the data beforehand, except for label encoding for target and scaling which is done already in the code
add the url of the data on the config file and the link to the saved model.
Results are saved in the results file where you can visulize them later.
you can run the baseline model to compare results.
below is how to setup the main function and get results.

Clone the repository

git clone https://github.com/AMNAALMGLY/HypOptRL.git

Setup a new environment using `requirements.txt` in repo

pip3 install -r requirements.txt

Setup configuration in `config.py` file

go to src > config.py

Run `python main.py` with command-line arguments or with edited config file

e.g To train regression task with MLP Policy MLP neural architucture, run;

python main.py --task regression --model_type MLP --policy MLP

to Run the baseline that we comapred our resluts to

python -m src.baseline

TODO

Evaluate on more datasets , hyperparameters and architucure
Experiment with longer training(more epochs)
Experiment with actor critic (A2C) algorithm
Improve documentation

Contributors

(names in alphabetical order)

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
models		models
results		results
src		src
MLP_policy.png		MLP_policy.png
README.md		README.md
RL_project_report _final.pdf		RL_project_report _final.pdf
RNN.png		RNN.png
main.py		main.py
requirements.txt		requirements.txt
utils.py		utils.py
visualize.py		visualize.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

HypOptRL

How to use code

process the experiment in the following format:

Clone the repository

Setup a new environment using `requirements.txt` in repo

Setup configuration in `config.py` file

Run `python main.py` with command-line arguments or with edited config file

to Run the baseline that we comapred our resluts to

TODO

Contributors

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

AMNAALMGLY/HypOptRL

Folders and files

Latest commit

History

Repository files navigation

HypOptRL

How to use code

process the experiment in the following format:

Clone the repository

Setup a new environment using requirements.txt in repo

Setup configuration in config.py file

Run python main.py with command-line arguments or with edited config file

to Run the baseline that we comapred our resluts to

TODO

Contributors

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Setup a new environment using `requirements.txt` in repo

Setup configuration in `config.py` file

Run `python main.py` with command-line arguments or with edited config file

Packages