You are using a neural network to train a robot vacuum to navigate without bumping into objects. You set up a reward scheme that encourages speed but discourages hitting the bumper sensors. Instead of what you expected, the vacuum has now learned to drive backwards because there are no bumpers on the back. This is an example of what type of behavior?

Question

Accepted Answer

B. Reward-hacking

Answer

A. Error-shortcircuiting

Answer

C. Transparency

Answer

D. Interpretability

CT-AI Question #106: Real Exam Question with Answer & Explanation

Question

Options

Explanation

Community Discussion