Skip to content

neel-singhania/CS698R-NEELABH-SINGHANIA-190538-ASSIGN-1

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

CS698R-NEELABH-SINGHANIA-190538-ASSIGN-1

This is the first assignment of the courrse CS698R which consisted of two questions.

  • The first question was to implement various strategies like Exploration, Exploitation, Epsilon Greedy, decay-Epsilon greedy, Softmax and UCB to determine the how agents perform in a stochastic Environment.
  • The second question was to employ Monte-Carlo and Temporal Difference estimates to determine their relativistic performance of a symmetric random walk environment.

The code implementation is in the main.ipynb notebook. The notebook is well documented to let a user know the logic of codes I implemented.

The environment is in the dir myenv/myenv/env/.

There are three Environments, each for:

  • Two arm bandit problem
  • Ten arm bandit problem
  • Random Walk

To install the gym environment, run:

git clone git@github.com:neel-singhania/CS698R-NEELABH-SINGHANIA-190538-ASSIGN-1.git
cd myenv
pip install -e .

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published