Eis by eisDNV · Pull Request #12 · dnv-opensource/crane-controller

eisDNV · 2026-04-30T10:40:15Z

Improvements to q_agent
Addition of parameter report function in pendulum environment.
Trained start-pendulum model added.

…nvironment.

aleksandarbabicdnv · 2026-04-30T12:49:27Z

+            with path.open(encoding="utf-8") as _f:
+                from_dump = json.load(_f)
+            self.previous_steps = int(from_dump["q_agent"]["steps"])
+            self.epsilon = float(from_dump["q_agent"].get("epsilon", 1.0))


@eisDNV read_dumped() restores epsilon (so exploration picks up where it left off) and previous_steps — but silently ignores epsilon-decay and final-epsilon. Those come from whatever was passed to the constructor instead.

So if someone trained with epsilon_decay=0.0005, saved, then loaded to continue — the continuation would run with the constructor default (1e-3) unless they remembered to pass the same value again. The saved value is right there in the file but unused.

That is true. We can change that to also using the saved epsilon_decay when continuing a training.

Feel free to implement this change before you approve.

@eisDNV not sure that I can commit to your PR.
but it is addition of 2 lines after existing line 313

`# line 313 (existing)
self.epsilon = float(from_dump["q_agent"].get("epsilon", 1.0))

add after:

self.epsilon_decay = float(from_dump["q_agent"].get("epsilon-decay", self.epsilon_decay))
self.final_epsilon = float(from_dump["q_agent"].get("final-epsilon", self.final_epsilon))
`

aleksandarbabicdnv · 2026-04-30T12:50:35Z

        The environment to be trained. Must provide `.reset()` and `.step()` methods.
    learning_rate : float, optional
        How quickly to update Q-values, in the range (0, 1] (default 0.1).
    initial_epsilon : float, optional


@eisDNV Should this be replaced with epsilon_decay?

Maybe I do not quite understand. Fresh training starts at initial_epsilon and reduces down to final_epsilon through the episodes (using epsilon_decay). The learning rates relates to how the q_values are updated (independent of epsilon_decay)

The constructor no longer has initial_epsilon as a parameter — it was replaced by epsilon_decay in this PR. The docstring still references it, so it needs updating to epsilon_decay.

…e value from the (new) agent. epsilon_decay default changed to 1e-4. New results from q-learning included.

eisDNV added 4 commits April 30, 2026 10:43

Optimisations in q_agent. Added reporting of parameters to pendulum e…

3162ee5

…nvironment.

Fixed conflicts in controlled_crane_pendulum.py

36ec27d

Fixed quality issues

8d5cc93

Fixed remaining quality issues

7578bb9

aleksandarbabicdnv reviewed Apr 30, 2026

View reviewed changes

Comment thread src/crane_controller/q_agent.py

aleksandarbabicdnv reviewed Apr 30, 2026

View reviewed changes

Include read epsilon_decay when continuing a training, overwriting th…

b9ff6d9

…e value from the (new) agent. epsilon_decay default changed to 1e-4. New results from q-learning included.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Eis#12

Eis#12
eisDNV wants to merge 5 commits intomainfrom
eis

eisDNV commented Apr 30, 2026

Uh oh!

Uh oh!

aleksandarbabicdnv Apr 30, 2026 •

edited

Loading

Uh oh!

eisDNV Apr 30, 2026

Uh oh!

eisDNV Apr 30, 2026

Uh oh!

aleksandarbabicdnv Apr 30, 2026

Uh oh!

aleksandarbabicdnv Apr 30, 2026

Uh oh!

eisDNV Apr 30, 2026

Uh oh!

aleksandarbabicdnv Apr 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

eisDNV commented Apr 30, 2026

Uh oh!

Uh oh!

aleksandarbabicdnv Apr 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

eisDNV Apr 30, 2026

Choose a reason for hiding this comment

Uh oh!

eisDNV Apr 30, 2026

Choose a reason for hiding this comment

Uh oh!

aleksandarbabicdnv Apr 30, 2026

Choose a reason for hiding this comment

add after:

Uh oh!

aleksandarbabicdnv Apr 30, 2026

Choose a reason for hiding this comment

Uh oh!

eisDNV Apr 30, 2026

Choose a reason for hiding this comment

Uh oh!

aleksandarbabicdnv Apr 30, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

aleksandarbabicdnv Apr 30, 2026 •

edited

Loading