Notes
This function is in beta test. Please help improve it in the issues
here.
Traceback (most recent call last): File "autograder.py", line 369, in <module> display=getDisplay(True, options)) File "autograder.py", line 225, in runTest testCase.execute(grades, moduleDict, solutionDict) File "reinforcementTestClasses.py", line 621, in execute if self.testEpsilonGreedy(moduleDict): File "reinforcementTestClasses.py", line 642, in testEpsilonGreedy agent = self.runAgent(moduleDict) File "reinforcementTestClasses.py", line 638, in runAgent agent.update(*lastExperience) File "qlearningAgents.py", line 113, in update sample = reward + self.discount * self.computeValueFromQValues(nextState) File "qlearningAgents.py", line 56, in computeValueFromQValues return max([self.getQValue(state, action) for action in legalActions]) File "qlearningAgents.py", line 56, in <listcomp> return max([self.getQValue(state, action) for action in legalActions]) File "qlearningAgents.py", line 42, in getQValue return float(self.values[(state, action)]) Attribute
Hints
Your answer may be identical to the JOJ answer in the first several lines.
However, the main problem you meet now is Runtime Error. And the exit code of your program is 1, which should be 0.
Please double check your code to solve this problem and try again.
Your Answer
JOJ Answer
*** PASS: ./test_cases/q7/4-discountgrid.test