Assessments
In reinforcement learning lingo, we call the ______ a policy.
State whether the following statement is True or False: The goal of reinforcement learning is to discover a good strategy. One of the most common ways to solve it is by observing the long-term consequences of actions in each state.
We need to train the agent by taking actions to the environment and receiving ______.
State whether the following statement is True or False: Using the contextual bandits, we cannot introduce and make the proper utilization of the state.
To find the stock prices, we can use the _______ library in Python.
get_prices
plot_prices
yahoo_finance
finance_yahoo