Sunday, June 14, 2020
Computer Science And Information Technology Research - 275 Words
Computer Science And Information Technology Research (Other (Not Listed) Sample) Content: Machine LearningBy NameInstitution of AffiliationCourseDate Machine Learning involves an agent taking actions within an environment that would enable maximization for a reward in the long term and find the policy that is capable of mapping states to the actions that the agent will take in those states. Question 1: Definitions State: It refers to the set of agents and environments that give a vivid illustration of where the action mapping will take place. Action: It refers to the characteristic way of identifying the basic problems of a computer system to initiate computational cognitive science and artificial intelligence. It is associated to the animats and intelligent agents. Reward: It refers to the interpretation of the action in a computer machine learning environment to give a representation of the entire state that is taken back to the agent of the machine learning. It is the scalar immediate outcome of the transition process in the machine learning for basic reinforcement. Question 2: A). Using Bellman Equation To Obtain The Values Of V To 2 Decimal Places. The state at the top left = 1V(s)=r(s)+sp(sÃ¢Ë £s)r(s)Maximization state is given by ttu(ct)The constraints of the state is given by at+1=1(at+ytct)Converting to a stochastic equation, we get 1=Et[u(ct+1)u(ct)] which is formulated by the t factor To give yt+1=yt+à µt Hence, the value of V is yt+1=yt+à µt = 10*2 + 1*2 = 22Top right = 2. the value of V is yt+1=yt+à µt = 0*2 + 0*2 = 0Bottom left = 3. the value of V is yt+1=yt+à µt = 0*2 + 5*2 = 10Bottom right = 4. the value of V is yt+1=yt+à µt = 1*2 + 1*2 = 4B). Using policy iteration to identify the optimal policyStep 1: Identify the optimal policy using policy iteration at a discounted factor of =0.1 and continue with a larger number from 0.99 to 0.9 and then using a smaller number such as 0.01. start with a random policy INCLUDEPICTURE "/afs/cs/project/jair/pub/volume4/kaelbling96a-html/img55.gif" \* MERGEFORMATINET fo r each iteration while evaluating each policy INCLUDEPICTURE "/afs/cs/project/jair/pub/vo...
Subscribe to:
Post Comments (Atom)
No comments:
Post a Comment
Note: Only a member of this blog may post a comment.