Add stochastic taxi (rainy+fickle) #1315

foreverska · 2025-02-21T01:47:12Z

Description

Adds rainy transition probabilities and fickle passenger to align environment with paper.

Fixes #161

Type of change

New feature (non-breaking change which adds functionality)
This change requires a documentation update

Checklist:

I have run the pre-commit checks with pre-commit run --all-files (see CONTRIBUTING.md instructions to set it up)
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
My changes generate no new warnings
I have added tests that prove my fix is effective or that my feature works
New and existing unit tests pass locally with my changes

pseudo-rnd-thoughts

Hey @foreverska, thanks for the PR.

Generally the PR looks good.
Could you change the np.random to self.np_random and revert the environment version increment (new features that don't affect default behaviour shouldn't require version bumps)

and a couple of questions before approving and merging

Is this backward compatible with default parameters? If it does currently, could we make it backward compatible.
Is the probability of the behaviour reported in step and reset correctly?

foreverska · 2025-02-22T03:03:18Z

@pseudo-rnd-thoughts

Addressed Comments.

Is this backward compatible with default parameters

Yes, default values for rainy and fickle are both False. This aligns functionality with pre-commit default behavior.

Is the probability of the behaviour reported in step and reset correctly?

It matches the other ToyText environments (Cliff/Frozen) with step returning the probability of the taken transition and reset always returning 1. I added lines in the unit test to guard against regression.

…table accuracy

gymnasium/envs/toy_text/taxi.py

tests/envs/test_env_implementation.py

pseudo-rnd-thoughts

With the tests and documentation updates, then should be good to merge.
Thanks for making the changes

gymnasium/envs/toy_text/taxi.py

Kallinteris-Andreas

Looks good to me,

tests/envs/test_env_implementation.py

gymnasium/envs/toy_text/taxi.py

foreverska · 2025-03-05T14:40:16Z

@pseudo-rnd-thoughts I think this needs one more review since there was a commit after your last one. Thanks.

pseudo-rnd-thoughts · 2025-03-08T20:39:01Z

Hey @foreverska, sorry I'm on holiday currently. Looking over the PR again, I'm a tad worried about the is_rainy change.
Is there a way of making the code only run if is_rainy is true?

foreverska · 2025-03-13T01:26:37Z

Hey @foreverska, sorry I'm on holiday currently. Looking over the PR again, I'm a tad worried about the is_rainy change. Is there a way of making the code only run if is_rainy is true?

Please enjoy vacation and ignore this until you're good and rested.

Is the ask to restore the original code and switch to it when it's not rainy and switch to this new code if it is? Or is there a more pointed change you'd like to see?

pseudo-rnd-thoughts · 2025-03-19T13:22:14Z

hi @foreverska, I'm back. Looking over the PR with a week or two rest, I agree that I would prefer if the new code was "disabled" by default and wouldn't run at some, unlike the current solution.
I know that this produces arguably less elegant code, I think it will be better maintenance and for people understanding the code.
Could you make that change?

foreverska · 2025-03-19T23:44:55Z

@pseudo-rnd-thoughts Not a problem at all. Pushed up a change that restores the old code when dry. Let me know what you think.

pseudo-rnd-thoughts · 2025-03-23T15:34:50Z

gymnasium/envs/toy_text/taxi.py

@@ -220,11 +212,148 @@ def __init__(self, render_mode: Optional[str] = None):
                            self.P[state][action].append(
                                (1.0, new_state, reward, terminated)
                            )
+
+    def __build_rainy_transitions(


To minimise changes, could this function, take row, col, pass_idx, dest_idx, action as arguments that we run the original code unless is_rainy then we call this function.
Then return the data.
It just means that we don't need to copy and paste the massive for loop and is clear what the differences between the current functions is

I believe I have done a reasonable job at making the code as reusable as possible. Please let me know if you had something else in mind.

Looks good @foreverska.

One last request is to change the function names to _{name} rather than double underscore to make the style of the rest of the project.
Then we should be good to merge

Roger, adjusted the function names, ready for final review.

Add stochastic taxi (rainy+fickle)

14a0560

pseudo-rnd-thoughts requested changes Feb 21, 2025

View reviewed changes

Address PR comments

68ecf6b

foreverska requested a review from pseudo-rnd-thoughts February 22, 2025 03:04

foreverska added 2 commits February 21, 2025 23:06

Simplified transition logic and implemented unit test for transition …

f4c2964

…table accuracy

documentation update

e537532

Kallinteris-Andreas reviewed Feb 22, 2025

View reviewed changes

gymnasium/envs/toy_text/taxi.py Outdated Show resolved Hide resolved

Adjusting version history to match format shown by cartpole

115fd3e

pseudo-rnd-thoughts requested changes Feb 23, 2025

View reviewed changes

Cleaner code and more complete testing

8d6154b

pseudo-rnd-thoughts reviewed Feb 24, 2025

View reviewed changes

gymnasium/envs/toy_text/taxi.py Show resolved Hide resolved

gymnasium/envs/toy_text/taxi.py Outdated Show resolved Hide resolved

Update documentation

875f7bb

Kallinteris-Andreas reviewed Feb 24, 2025

View reviewed changes

tests/envs/test_env_implementation.py Outdated Show resolved Hide resolved

gymnasium/envs/toy_text/taxi.py Outdated Show resolved Hide resolved

Efficient ignoring of parameters in tests and documentation update

ee9b5a8

pseudo-rnd-thoughts reviewed Feb 25, 2025

View reviewed changes

gymnasium/envs/toy_text/taxi.py Outdated Show resolved Hide resolved

Change to fickle seed and update documentation

116f6e8

foreverska requested a review from pseudo-rnd-thoughts February 26, 2025 14:57

Restore old logic as dry transitions, seperate out wet transitions logic

aad84e3

pseudo-rnd-thoughts requested changes Mar 23, 2025

View reviewed changes

foreverska and others added 5 commits March 23, 2025 15:15

make common code common again

434c8be

There was more code to make resuable

4ddd3e7

match existing style

78d2381

Update code format

2e93e0b

Update version data

d7895db

pseudo-rnd-thoughts approved these changes Mar 25, 2025

View reviewed changes

pseudo-rnd-thoughts merged commit 69471be into Farama-Foundation:main Mar 25, 2025
12 checks passed

Uh oh!

Add stochastic taxi (rainy+fickle) #1315

Add stochastic taxi (rainy+fickle) #1315

Uh oh!

Conversation

foreverska commented Feb 21, 2025

Description

Type of change

Checklist:

Uh oh!

pseudo-rnd-thoughts left a comment

Choose a reason for hiding this comment

Uh oh!

foreverska commented Feb 22, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

pseudo-rnd-thoughts left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Kallinteris-Andreas left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

foreverska commented Mar 5, 2025

Uh oh!

pseudo-rnd-thoughts commented Mar 8, 2025

Uh oh!

foreverska commented Mar 13, 2025

Uh oh!

pseudo-rnd-thoughts commented Mar 19, 2025

Uh oh!

foreverska commented Mar 19, 2025

Uh oh!

pseudo-rnd-thoughts Mar 23, 2025

Choose a reason for hiding this comment

Uh oh!

foreverska Mar 25, 2025

Choose a reason for hiding this comment

Uh oh!

pseudo-rnd-thoughts Mar 25, 2025

Choose a reason for hiding this comment

Uh oh!

foreverska Mar 25, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!