Computer Architecture > CHEAT SHEET > COMPSCI Final Cheat Sheet - SEARCH MDP II CSP II Iterative (All)

COMPSCI Final Cheat Sheet - SEARCH MDP II CSP II Iterative

Document Content and Description Below

University of California, Berkeley COMPSCI 188 Final CheatSheet SEARCH Cartoon of search tree b - branching factor m - max depth Search Problem - Heuristic h is admissible (optimistic) if 0 ≤... h(A) ≤ cost(A to G) - Heuristic h is consistent if h(A) - h(C) ≤ cost(A to C) - States (congurations of the world) - Actions and costs - Successor function (world dynamics) - Start state and goal state Admissibility vs Consistency CSP I A H C D E F Complete Guaranteed to nd a solution if one exists Optimal Guaranteed to nd the least cost path J - One variable at a time, x ordering - Check constraints as you go Backtracking Search A C D E F H Filtering: Forward Checking ! - Assigns variable, checks to see if ts within constraints Arc Consistency - An arc X -> Y is consistent i for every x in the tail there is y in the head which can be assigned without violating a constraint A C -Algorithm: While not solved: Choose a value that violates the fewest constraints Performance: - R = (# constraints) / (# variables) - Very inecient when at critical ratio R D E F H - Cutset conditioning: instantiate (in all ways) a set of variables such that the remaining constraint graph is a tree- - Cutset size c gives runtime O( (dc) (n-c) d2 ), fast for small c -Method: - Choose a cutset - Instantiate the cut set in all possible ways - Compute Residual CSP for each assignment - Solve the residual CSP. - If X loses a value, neighbors of X need to be rechecked - 1 Consistency (Node Consistency): Each single node’s domain has a value which meets that node’s unary constraints - 2 Consistency (Arc Consistency): For each pair of nodes, any consistent assignment to one can be extended to the other - K Consistency: For each k nodes, any consistent assignment to k-1 can be extended to the kth node. (More work) - Strong k-consistency: also k-1, k-2, … 1 consistent (no longer a need for back tracking) K - Consistency Cutset CSP II - improve a single option until you can’t make it better - faster, less memory, incomplete, suboptimal Iterative Algorithms def minV(state): initialize v = +∞ for each suc of state: v = minV(v, maxV(suc)) return v V(s) = min(V(s’)) for s’ in suc Local Search def maxV(state): initialize v = -∞ for each suc of state: v = maxV(v, minV(suc)) return v V(s) = max(V(s’)) for s’ in suc GAME TREES & UTILITIES Evaluation Functions - We’re computing the MIN-VALUE at some node n - We’re looping over n’s children - n’s estimate of the children's’ min is dropping - Who cares about n’s value? MAX - Let a be the best value that MAX can get at any choice point along the current path from the root - Ideal returns actual minimax value - Tries to estimate utility at a certain position in minimax tree Local Search (minimax) Alpha-Beta Pruning (Min version) - An MDP is dened by: - A set of states s in S - A set of actions a in A - Transition function T(s,a,s’) - Reward function R(s,a,s’) - A start state - Maybe terminal state - Prizes A, B - Lottery L: [p, A; (1-p), B] -Notation: - Preference: A B - Indierence: A ~ B Preferences and Lotteries - If n becomes worse than a, MAX will avoid it, so we can stop considering n’s other children (it’s already bad enough that it won’t be played) - A policy π gives an action for each state - Optimal policy (π*: S --> A) maximizes expected utility - Discount Factor of 0.5: 1*1 + 2*.5 + 3*.75 Optimal Policy MDP I MDP II The value (utility) of a state s: - V*(s) = expected utility starting in s and acting optimally The value (utility) of a q-state (s,a): - Q*(s,a) = expected utility starting out having taken action a from state s and (thereafter) acting optimally The optimal policy: π*(s) = optimal action from state s Values of States Value Iteration Value Iteration Convergence (MDP II) - Basic idea: approximations get rened towards optimal values. - Policy may converge long before values do. Computing Actions from Values Computing Actions from Q-Values Fixed Policy Policy Evaluation - For xed current policy π, nd values with policy evaluation - Still assume MDP - Now T(s,a,s’) and R(s,a,s’) are gone - Still looking for a policy π(s) - For xed values, get a better policy using policy extraction REINFORCEMENT LEARNING I Model-Based Learning - Model-Based Idea: - Learn an approximate model based on experiences - Solve for values as if the learned model were correct - Step 1: Learn empirical MDP model - Count outcomes s’ for each s, a - Normalize to give an estimate of - Discover each when we experience (s, a, s’) - Step 2: Solve the learned MDP - For example, use value iteration, as before Direct Evaluation - Goal: Compute values for each state under π - Idea: Average together observed sample values - Act according to π - Every time you visit a state, write down what the sum of discounted rewards turned out to be - Average those samples [Show More]

Last updated: 1 week ago

Preview 1 out of 4 pages

Reviews( 0 )

Recommended For You

 Financial Auditing> CHEAT SHEET > Capsim Cheat Sheet. CAPISM FOR: Research and Development, Marketing, Production, Finance. (All)

preview
Capsim Cheat Sheet. CAPISM FOR: Research and Development, Marketing, Production, Finance.

capsim cheat sheet draft 1. Avoid loans at all costs during the initail rounds. These will weigh your company down in later rounds and destroy your future margins. 2. Release products in low tech s...

By QuizMaster , Uploaded: Jul 31, 2022

$9

 Psychology> CHEAT SHEET > WJEC Psychology Unit 1 Summary Table (All)

preview
WJEC Psychology Unit 1 Summary Table

An easy-to-read comparison sheet between approaches: Assumptions, Treatment, Classical Research, and Conclusions. The cheat sheet is designed according to the structure of the course, can allow st...

By Mavis Siew , Uploaded: Jan 05, 2023

$7

 *NURSING> CHEAT SHEET > Diabetes Cheat Sheet. (All)

preview
Diabetes Cheat Sheet.

Diabetes Cheat Sheet Drug Interactions Diabetes Drug Interacting Drug(s) Mechanism Repaglinide 2C8 Inhibitors: Gemfibrozil Fluconazole Nicardipine Delvirdine Ketoconazole Repaglinide is a 2C...

By PAPERS UNLIMITED™ , Uploaded: Apr 26, 2023

$7

 *NURSING> CHEAT SHEET > Patient Positioning Cheat Sheet (All)

preview
Patient Positioning Cheat Sheet

Patient Positioning Cheat Sheet 2023

By EVERYSTUDY , Uploaded: Dec 21, 2023

$9.5

 *NURSING> CHEAT SHEET > Pharmacology Cheat Sheet: Bundle- Antibiotics, Antidepressants, Psych Meds, Cardiac Meds, Diuretics, Respiratory Meds, Insulin Guide, Analgesics, All the Most Commonly Used (All)

preview
Pharmacology Cheat Sheet: Bundle- Antibiotics, Antidepressants, Psych Meds, Cardiac Meds, Diuretics, Respiratory Meds, Insulin Guide, Analgesics, All the Most Commonly Used

Bundle includes all 8 pharmacology/ drug cheat sheets. Cheat sheets included: Antibiotics Antidepressants Analgesics Psych meds Cardiac meds Diuretics Respiratory meds Insulin Guide **...

By NurseGrade , Uploaded: Oct 27, 2021

$9.5

 *NURSING> CHEAT SHEET > Peds 1.02 Peds Clinical Cheat sheet (All)

preview
Peds 1.02 Peds Clinical Cheat sheet

Peds 1.02 Peds Clinical Cheat sheet

By EVERYSTUDY , Uploaded: Dec 21, 2023

$15

 Engineering> CHEAT SHEET > AXIALLY LOADED MEMBERS,MECHANICS OF MATERIAL (Tension, Compression, and Shear) (All)

preview
AXIALLY LOADED MEMBERS,MECHANICS OF MATERIAL (Tension, Compression, and Shear)

AXIALLY LOADED MEMBERS,MECHANICS OF MATERIAL (Tension, Compression, and Shear)

By Kimmich32 , Uploaded: Jul 29, 2023

$21.5

 Statistics> CHEAT SHEET > University of California, Davis MAE Statics and Strengths of Materials. Problems CHEAT SHEET. (All Calculations) (All)

preview
University of California, Davis MAE Statics and Strengths of Materials. Problems CHEAT SHEET. (All Calculations)

Problem 1 (10 points). Which of the following two truss designs (A or B) should a designer choose? Please explain by discussing forces in one or two members at Problem 3 (10 points). In the champagne...

By CourseWorks,Inc , Uploaded: Mar 23, 2023

$6

 Mathematics> CHEAT SHEET > NUR 316 - Medical Calculation Quizzes With Accurate Answers (Scored 98%) 2023 (All)

preview
NUR 316 - Medical Calculation Quizzes With Accurate Answers (Scored 98%) 2023

NUR 316 - Medical Calculation Quizzes With Accurate Answers (Scored 98%) 2023

By Academia1434 , Uploaded: Apr 11, 2023

$14

 Chemistry> CHEAT SHEET > OCR June 2022 GCSE (9–1) Chemistry A (Gateway Science) J248 01/02/03/04 Data Sheet (Insert) (All)

preview
OCR June 2022 GCSE (9–1) Chemistry A (Gateway Science) J248 01/02/03/04 Data Sheet (Insert)

June 2022 GCSE (9–1) Chemistry A (Gateway Science) J248 01/02/03/04 Data Sheet (Insert)

By AQA/A LEVEL MASTER , Uploaded: Mar 22, 2023

$5

$7.00

Add to cart

Instant download

Can't find what you want? Try our AI powered Search

OR

GET ASSIGNMENT HELP
57
0

Document information


Connected school, study & course



About the document


Uploaded On

Aug 11, 2022

Number of pages

4

Written in

Seller


seller-icon
QuizMaster

Member since 4 years

1084 Documents Sold


Additional information

This document has been written for:

Uploaded

Aug 11, 2022

Downloads

 0

Views

 57

THE BEST STUDY GUIDES

Avoid resits and achieve higher grades with the best study guides, textbook notes, and class notes written by your fellow students

custom preview

Avoid examination resits

Your fellow students know the appropriate material to use to deliver high quality content. With this great service and assistance from fellow students, you can become well prepared and avoid having to resits exams.

custom preview

Get the best grades

Your fellow student knows the best materials to research on and use. This guarantee you the best grades in your examination. Your fellow students use high quality materials, textbooks and notes to ensure high quality

custom preview

Earn from your notes

Get paid by selling your notes and study materials to other students. Earn alot of cash and help other students in study by providing them with appropriate and high quality study materials.

WHAT STUDENTS SAY ABOUT US


What is Browsegrades

In Browsegrades, a student can earn by offering help to other student. Students can help other students with materials by upploading their notes and earn money.

We are here to help

We're available through e-mail, Twitter, Facebook, and live chat.
 FAQ
 Questions? Leave a message!

Follow us on
 Twitter

Copyright © Browsegrades · High quality services·