feat: 2109s midterm cheatsheet

2024-03-03 15:40:41 +08:00
parent 553439bba7
commit 4aaca9f487
4 changed files with 287 additions and 0 deletions
--- a/cs2109s/midterm.pdf
+++ b/cs2109s/midterm.pdf
--- a/cs2109s/midterm.typ
+++ b/cs2109s/midterm.typ
@@ -0,0 +1,267 @@
+#import "template.typ": *
+
+// Take a look at the file `template.typ` in the file panel
+// to customize this template and discover how it works.
+#show: project.with(
+  title: "CS2109S",
+  authors: (
+    "Yadunand Prem",
+  ),
+)
+
+// We generated the example code below so you can see how
+// your document will look. Go ahead and replace it with
+// your own content!
+
+= Introduction
+Agent percepts the environment using *sensors*, when then go into *functions*, which map into actions and *actuators* perform actions that modify the environment. A rational agent will choose the action that maximises its performance measure. 
+== Environment
+- *Fully Observable vs Partially Observable* Agent sensors has access to the complete state of the environment
+- *Deterministic vs Stochastic* Next state is determined by current state and action. If deterministic but there is another agent, then *strategic*
+- *Episodic vs Sequential* Each episode consists of agent perceiving and performing action, and choice of action depends only on episode. In sequential, current action affects all future actions
+- *Static vs Dynamic* Environment doesn't change while agent is delibertaing. Semi dynamic if performance score changes with time
+- *Discrete vs Continuous* Limited no of distinct, defined percepts and actions
+- *Single Agent vs Multi Agent* Agent operating by itself. 
+
+== Structure of Agents
+- *Reflex Agents* Select actions on current percept, condition to action mapping
+- *Model Based* Tracks world it can't see and updated through transitions
+- *Goal Based* Tracks goals and picks action that brings it closer to goal
+- *Utility Based* Score the next state, and pick the most optimal score
+
+Agent must choose between 
+- *Exploitation* Maximising utility according to knowledge of the world
+- *Explore* Learn more about the world
+
+= Solving Problems by Searching
+*Problem Solving Agents* _Plan ahead_ to consider sequence of actions that form a path to a goal, through *search*
+== Problem Formulation
+- *States* Set of possible states for the environment to be in
+- *Initial State* Starting State
+- *Goal State* End State
+- *Actions* Given State `s`, `actions(s)` returns finite set of actions that can be executed in `s`.
+- *Transition Model* `transition(s, a)` returns the next state when the action has been applied on state
+- *Action cost function* `cost(s, a, s')` gives the cost of applying action `a` to state `s` to reach state `s'`
+== Search Algorithms
+Evaluation Criteria
+- *Time Complexity* No of nodes expanded
+- *Space Complexity* Max no of nodes in memory
+- *Completeness* Does it always return a solution?
+- *Optimality* Does it always find least cost solution?
+
+Measure the above using: branching factor *`b`*, depth *`d`*, max depth *`m`*
+
+`f(n)` `f` is the evaluation function, `n` is the node.
+=== Uninformed Search / Tree Search
+- *Breadth First Search*
+  - Frontier: Queue
+  - *`f(n)`*: `d` (depth of next node)
+  - *Time*: $O(b^d)$
+  - *Space*: $O(b^d)$
+  - *Completeness*: Yes if $B$ is finite
+  - *Optimality*: Yes, if step cost is same
+- *Uniform Cost Search* (Dijkstra)
+  - Frontier: Priority Queue(cost from root to state)
+  - *`f(n)`*: `tc + c(s, a, s')` (total cost + cost to next node)
+  - *Time*: $O(b^(C*\/e))$, $C*$ is the optimal cost, $e$ is the min edge cost
+  - *Space*: $O(b^(C*\/e))$, $C*$
+  - *Completeness*: Yes if $e>0$ and $C*$ is finite
+  - *Optimality*: Yes, if $e>0$
+- *Depth First Search*
+  - Frontier: Stack
+  - *Time*: $O(b^(m))$
+  - *Space*: $O(b m)$
+  - *Completeness*: No, if depth is incomplete / loops
+  - *Optimality*: No
+- *Depth Limited Search*
+  - Frontier: Stack, backtrack when depth limit *`l`* is reached
+  - *Time*: $O(b^(l))$
+  - *Space*: $O(b l)$
+  - *Completeness*: No
+  - *Optimality*: No
+  - $N_(d l s) = b^0 + b^1 + ... + b^d$
+- *Iterative Deepening Search*
+  - Frontier: Stack, DLS with max depth from $0..N$
+  - *Time*: $O(b^(d))$
+  - *Space*: $O(b d)$
+  - *Completeness*: Yes
+  - *Optimality*: Yes if step cost is same.
+  - $N_(i d s) = (d+1)b^0 + (d)b^1 + ... + (1)b^(d)$
+- *Bidirectional Search*
+  - Search from initial state and goal state at same time since $b^(d/2) + b^(d/2) < b^d$. 
+
+```python
+frontier = []; initial_state = x
+visited = set() # if graph search
+frontier.add(initial_state)
+while len(frontier) != 0:
+  state = frontier.pop()
+  if state in visited: continue # if graph search
+  visited.add(next_state) # if graph search
+  for action in actions(state):
+    next_state = transition(state, action)
+    if next_state == goal: return solution
+    frontier.add(next_state)
+return failure
+```
+
+== Informed Search Algorithms
+- *Greedy Best Fit Search*
+  - Frontier: Priority Queue(`f(n)`)
+  - *`f(n)`*: `h(n)`: Heuristic: estimated cost from `n` to goal.
+  - *Time*: $O(b^m)$, Good heuristic improves
+  - *Space*: $O(b^m)$ Keeps all nodes in memory
+  - *Complete*: No
+  - *Optimal*: No (Doesn't consider cost so far)
+- *A\**
+  - Frontier: Priority Queue(`f(n)`)
+  - *`f(n)`*: `g(n) + h(n)`, `g` : total cost, `h` : estimated cost
+  - *Time*: $O(b^m)$, Good heuristic improves
+  - *Space*: $O(b^m)$ Keeps all nodes in memory
+  - *Complete*: Yes
+  - *Optimal*: Yes (Doesn't consider cost so far)
+== Heuristics
+=== Admissible
+Admissible if for every node $n, h(n) lt.eq h\*(n)$, where $h\*(n)$ is the true cost to reach goal state from n. An admissible heuristic never over-estimates the cost to reach the goal, i.e. its a conservative estimate
+
+If $h(n)$ is admissible, A\* using *tree search* is optimal
+
+=== Consistent
+If it obeys the triangle inequality, $forall n, h(n) lt.eq c(n, a, n') + h(n')$, or estimated cost of reaching goal node through node $n$ is $lt.eq$ to the estimated cost of reaching goal node through $n'$ + cost of going to node $n'$ from $n$.
+
+If our heuristic is consistent, then the first time we visit a node, the estimated cost to the goal $h(n)$ is guaranteed to be smallest. Therefore, we do not need to visit the node again as any other path we might visit it with has a larger overall cost, and graph search (i.e. tree search with memoisation) is optimal.
+However, if our heuristic is not consistent, then consider a path that we might visit later where $h(n) gt.eq c(n, a, n') + h(n')$. Then, if we have visited $h(n)$ already, we still need to revisit it as we have found another shorter path later on in our traversal, hence making graph search sub-optimal.
+=== Dominance
+If $forall n, h_2(n) gt.eq h_1(n)$, then $h_2$ dominates $h_1$ and $h_2$ would be better for search.
+
+=== Creating Admissible Heuristics
+Cost of an optimal solution to a relaxed problem (problem with fewer restrictions) is an admissible heuristic for the original problem
+
+== Local Search
+When path to solution not important, state is the solution (chess, sudoku, bin packing)
+
+Keep the current state, and iteratively try improving using heuristic we define. Local search can get stuck in local minima/maxima, so random restarts can help get better results
+=== Trivial Algorithms
+- *Random Sampling* Random sample a state until solution is found
+- *Random walk* Go to random neighbours until solution found
+
+=== Non-trivial Algorihthms
+- *Hill Climbing* Pick the best among neighbours, repeat
+- *Simulated Annealing* Hill climbing but allows bad moves
+- *Beam Search* k-kill climbing in parallel
+- *Genetic* Marry the best, mutate, repeat
+
+== Adversarial Search
+When trying to find moves against rational agent. 
+
+=== Minimax
+heuristic $h(n)$ that defines "goodness" of current state. We want to _maximise_ $h(n)$, whereas opponent wants to _minimise_ $h(n)$. 
+
+We can optimise minimax algo by introducing $alpha\/beta$ pruning, where $alpha$ is best value for max player, and $beta$ is best value for min player. If at any node, $alpha$ and $beta$ does not overlap($beta lt.eq alpha$), we can prune that node.
+
+- *Time*: $O(b^m)$
+- *Space*: $O(b m)$
+- *Complete* Yes, if tree is finite
+- *Optimal* Yes, against optimal opponent
+
+#[
+#set text(size: 8pt)
+
+```python
+def alpha_beta_search(state):
+  _ = max_value(state, -INF, INF)
+  return action in succesors(state) with value v
+
+def max_value(state, alpha, beta):
+  if is_terminal(state): return utility(state)
+  v = -INF
+  for action, next_state in successors(state):
+    min_v = min_value(next_state, alpha, beta)
+    v = max(v, min_v)
+    if v >= beta: return v
+    alpha = max(alpha, v)
+  return v
+
+def min_value(state, alpha, beta):
+  if is_terminal(state): return utility(state)
+  v = INF
+  for action, next_state in successors(state):
+    max_v = max_value(next_state, alpha, beta)
+    if v <= alpha: return v
+    beta = min(beta, v)
+  return v
+```
+]
+
+= Machine Learning and Decision Trees
+== Supervised Learning
+Learns from being given the right answers
+- *Regression*: Predict Continuous outputs
+- *Classification*: Predict discrete outputs
+
+Assumption: $y$ is generated by true mapping function $f: x arrow.r y$. We want to find a hypothesis $h:x arrow.r hat(y)$
+
+== Performance Measure
+We can measure error for regression:
+- Absolute Error: $|hat(y) - y|$
+- Squared Error: $(hat(y) - y)^2$
+
+For a set of $N$ examples, we can compute average error for regression:
+- Mean Squared Error: $1/N sum^N_(i=1)(hat(y)_i - y_i)^2$
+- Mean Absolute Error: $1/N sum^N_(i=1)|hat(y)_i - y_i|$
+
+Average correctness for classification: 
+$"Accuracy" = 1/N sum^N_(i=1)1_(hat(y)_i = y_i)$
+
+Confusion Matrix: 
+
+$"Accuracy" = ("TP" + "TN")/("TP" + "FN" + "FP" + "TN")$ 
+
+$"Precision P" = ("TP")/("TP" + "FP")$ (Maximise if FP is very costly, e.g. email spam)
+
+$"Recall R" = ("TP")/("TP" + "FN")$ (Maximise if FN is bad, e.g. Cancer)
+
+$"F1" = 2/((1/P) + (1/R))$ (Maximise if FN is bad, e.g. Cancer)
+
+== Decision Trees
+Choosing an attribute to split a decision tree: Ideally, select attribute that splits all examples into 2 distinct groups
+
+The amount of information at a given node is the entropy, $I(P(v_1), ..., P(v_n)) = -sum^n_(i=1)P(v_I)log_2P(v_i)$, where $v_i$ are the different classifications of the dataset. 
+For a binary classification, (Positive, Negative), $I(p/(p+n), n/(p+n)) = -p/(p+n)log_2 n/(p+n) - n/(p+n)log_2 n/(p+n)$
+
+When an attribute is divided into subsets, we can calculate the _Information Gain_ (reduction in entropy) by 
+
+$"IG"("Attrib") = I(P(p/(p+n)), P(n/(p+n))) - sum^v_(i=1)(p_i+n_i)/(p+n)I(p_i/(p_i+n_i), n_i/(p_i+n_i))$
+
+= Misc
+== Minimax Intuition
+
+- In Maximiser:
+  - When $v >= alpha$, update #math.alpha.
+  - If $v >= beta$, prune the rest
+- In Minimiser:
+  - When $v <= beta$, update #math.beta.
+  - If $v <= alpha$, prune the rest
+
+== Proof of Admissibility
+- If a relaxed version of the problem's optimal solution is $H_a$, then $H_a$ is an admissible heuristic for the current problem
+- Check if $h_a ("Goal") > 0$. If it is, then its inadmissible
+- Compare cost of moves vs maximum rate of loss of heuristic. This is the intuition when checking consistency because if one move improves heuristic “too much”, it’s likely to be inconsistent. For example, in the snake question, if the snake can eat the apple and improve heuristic by 5 when the cost of the move to eat the apple is only 1, we can immediately say it’s inconsistent.
+- Proof that it fails by counterexample / (contradiction / contrapositive)
+- Proof that it passes by contradiction / contrapositive
+- Proof that it passes by abuse of inequalities
+- Proof that it passes by proving the correctness for extreme cases, and proving that everything in between is therefore correct (dodgy proof technique)
+- Lemma that guarantees admissibility as long as we have consistency and h(Goal)=0
+- Contrapositive of this Lemma: inconsistent as long as inadmissible and h(Goal)=0
+
+
+== Proof of InAdmissibility
+- Show that $exists n, H_a (n) > H\*(n)$
+
+== Proof of Inconsistency
+- First show that $H_a ("Goal") = 0$
+- Try show that $H_a (n) > c(n, a, n') + H_a(n')$, assuming $n'$ is goal, and $c$ is the true cost function. Then show $H_a (n) > c(n, a, n')$, that the cost calculated is $>$
+- If cost is always 1, then try to ensure that $H_a (n)$ is always <=1.
+
+== Information Gain
+- Pick the one with the lowest remainder
--- a/cs2109s/template.pdf
+++ b/cs2109s/template.pdf
--- a/cs2109s/template.typ
+++ b/cs2109s/template.typ
@@ -0,0 +1,20 @@
+// The project function defines how your document looks.
+// It takes your content and some metadata and formats it.
+// Go ahead and customize it to your liking!
+#let project(title: "", authors: (), body) = {
+  // Set the document's basic properties.
+  set document(author: authors, title: title)
+  set page(
+    margin: (left: 10mm, right: 10mm, top: 10mm, bottom: 10mm),
+    numbering: "1",
+    number-align: center,
+    flipped: true
+  )
+  set text(font: "Source Sans Pro", lang: "en", size: 8pt)
+  set heading(numbering: "1.1")
+  set par(justify: true)
+
+  show: columns.with(4, gutter: 1em)
+
+  body
+}