Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

The Ministry of Love is interested in using reinforcement learning agents and serious gaming to ensure ideological consistency amongst the nation's increasingly online youth. These

The Ministry of Love is interested in using reinforcement learning agents and serious gaming
to ensure ideological consistency amongst the nation's increasingly online youth. These agents
must interact by means of large conversation trees. The ministry wishes to train these agents in
acceptable conversational pathways through a punishment and reward-based system. Choices
in the tree can be grouped into three broad categories namely badthink (i.e. those inconsistent
with current party policy), neutral, and goodthink (i.e. ideologically correct opinions).
ARTIFICIAL INTELLIGENCE
Describe how Q-Learning can be applied to this problem in terms of the following:
a. How rewards and punishments could be modelled (3 marks)
b. How expected values are updated (5 marks)
c. The two modes of operation of such a system (2 marks)
image text in transcribed

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Database And Expert Systems Applications Dexa 2021 Workshops Biokdd Iwcfs Mlkgraphs Al Cares Protime Alsys 2021 Virtual Event September 27 30 2021 Proceedings

Authors: Gabriele Kotsis ,A Min Tjoa ,Ismail Khalil ,Bernhard Moser ,Atif Mashkoor ,Johannes Sametinger ,Anna Fensel ,Jorge Martinez-Gil ,Lukas Fischer

1st Edition

3030871002, 978-3030871000

More Books

Students also viewed these Databases questions