Skip to main content

[Machine Learning]: #4 Multi-Variables Linear Regression

[Machine Learning]: Multi-Variables Linear Regression



****************************************

[Write on front]:

lets us consider the house price prediction problem,  where the house price is related many parameters, i.e. house year, house size, number of bedrooms, house location......., the problem we are solving before is just one parameter. How to do the multi-variable linear regression is today's topic.

*********************************************************************************
[Multiple Features]:

We now introduce notation we gonna use in this chapter.
\(x_j^i\) is the value of feature \(j\) in the \({i^{th}}\) training example.
\(x^i\) is the input (features) of the  \({i^{th}}\) training example.
\(m\) is the number of training example.
\(n\) is the number of features.

Features are the parameters, of house size, number of bedrooms ......
*********************************************************************************

[Hypothesis Function of Multi-Variables]:

The hypothesis function is as follows:
$${H_\theta }(x) = {\theta _0}{x_0} + {\theta _1}{x_1} + {\theta _2}{x_2} + {\theta _3}{x_3} + .... + {\theta _n}{x_n}$$

To simplify the function, we use matrix to represent:
$${H_\theta }(x) = [{\theta _0},{\theta _1},{\theta _2}......{\theta _n}]\left[ {\matrix{
   {{x_0}}  \cr
   {{x_1}}  \cr
   {...}  \cr
   {{x_n}}  \cr

 } } \right] = {\theta ^T}X$$
*********************************************************************************

[How to come up with a good hypothesis function?]:

Note:
    if you forget Cost Function, please review:

Back to the question of how to solve the hypothesis function,  Gradient Descent, obviously!

repeat the following function until it converage:
\[{\theta _0} = {\theta _0} - \alpha \frac{1}{m}\sum\limits_{i = 1}^m {({H_\theta }({x^i}) - {y^i})x_0^i} \]
\[{\theta _1} = {\theta _1} - \alpha \frac{1}{m}\sum\limits_{i = 1}^m {({H_\theta }({x^i}) - {y^i})x_1^i} \]
\[{\theta _2} = {\theta _2} - \alpha \frac{1}{m}\sum\limits_{i = 1}^m {({H_\theta }({x^i}) - {y^i})x_2^i} \] ......
\[{\theta _n} = {\theta _n} - \alpha \frac{1}{m}\sum\limits_{i = 1}^m {({H_\theta }({x^i}) - {y^i})x_n^i} \]

In other words:
\[{\theta _j} = {\theta _j} - \alpha \frac{1}{m}\sum\limits_{i = 1}^m {({H_\theta }({x^i}) - {y^i})x_j^i} \]
where \(j\) is the number of parameters.

Please Note that we have to update \({\theta_j}\) simultaneously.
*********************************************************************************

[Some tips to make the Gradient Descent Faster]

#1. [Feature Scaling]:

The problem: may have when we decide to use the Gradient Descent to find the opt, is  as the picture shows:

\({\theta_j}\) will decrease quickly on the small range and slowly in the large range.

idea: to make the features are on the same scale sowe want to modify the range of our input variables so that they are roughly the same. Ideally:
\[ - 1 \le {x_i} \le 1\]
These are not the exact requirement, range from -3 to 3 is ok.

method: Feature Scaling and Mean Normalization, Feature Scaling involves dividing the input values by the range(i.e. max-min). Mean Normalization using:\[{x_i} = \frac{{{x_i} - {\mu _i}}}{{{s_i}}}\], where \({{\mu _i}}\) is the average, \({{s_i}}\) is the range value(max-min).

example: \({x_i}\) is the house price with the range of 100 to 2000 and the mean is 1000, then$${x_i} = {{price - 1000} \over {1900}}$$

#2. [Learning Rate \(\alpha\)]:

[Is the Gradient Descend Work?]:
As shown in the picture, if the Gradient Descend Algo works, the value of Cost Function \(J(\theta )\) has to decrease all the way. if the plot graph start to increase, change the value of \(\alpha\), this is so big. Similarly, if the Cost Function plot goes up and down, make your \(\alpha\) smaller.

[How the \(\alpha\) influent our Cost Function?]:
if \(\alpha\) is too small -> slow convergence;
if \(\alpha\) is too big, maybe cannot converge;

[How to choose \(\alpha\)?]
Try it!















Comments

Popular posts from this blog

[LeetCode Solution 230]: Kth Smallest Element in a BST

Question: Given a binary search tree, write a function  kthSmallest  to find the  k th smallest element in it. ************************************************************************************************************************************ Write Infront To read to a tutorial, please to read the tutorial of in-order traversal of BST, please check: LeetCode Solution 94: Binary Tree Inorder Traversal We are going to solve this question using the following 4 methods: ->Binary Search ->Recursive ->Iterative ->Morris  Approach #1 Binary Search [Accepted] Detail Explanation The first method to solve this problem is using Binary Search. The idea is very easy and extremely to think. We use BST's property that the left child of the root is smaller than the root while the right child of the root is always bigger. We consider that the root is the pivot, and find the number of the nodes in the left subtree and the number of ...

[LeetCode Solution 145] Binary Tree Postorder Traversal

[LeetCode Solution 145]: Binary Tree Postorder Traversal Question: Given a binary tree, return the  postorder  traversal of its nodes' values. For example: Given binary tree  {1,#,2,3} , 1 \ 2 / 3 return  [3,2,1] . Approach #1 Recursive [Accepted] Detail Explanation The first method to solve this problem is using recursive. This is the classical method and straightforward. we can define a helper function to implement recursion. The java code is as follows: Java public class Solution { public List<Integer> postorderTraversal (TreeNode root) { List<Integer> res = new ArrayList<>(); helper(root, res); return res; } public void helper (TreeNode root, List<Integer> res) { if (root != null ) { if (root.left != null ) { helper(root.left, res); } if (root.right != null ) { helper(root.right, res); } res.add(root.val); } } } Complexity Analysis Ti...

[Interview]: URLify

[Interview]  URLify: -------------------------------------------------------------------------------------------------------------------------- Question: URLify: Write a method to replace all spaces in a string with ‘%20’, you may assume that the string has sufficient space at the end to hold the additional characters. Example  input: ' mr john smith '  output: ' mr %20john%20smith' --------------------------------------------------------------------------------------------------------------------------   Idea 1:  Start from the back and start replacing until the character is not ' ', and replace the characters in reverse order. Solution 1: public class Solution{ public String replace(char[] str) { boolean flag = false; StringBuffer sb = new StringBuffer(); for (int i = str.length - 1; i >= 0; i--) { if (str[i] != ' ') flag = true; if (flag == true) { if (str[i] == ' ') { s...