Skip to content

Commit 643cce5

Browse files
author
Amogh Singhal
authored
Update interview_prep.md
1 parent 2bec118 commit 643cce5

File tree

1 file changed

+11
-0
lines changed

1 file changed

+11
-0
lines changed

interview_prep.md

Lines changed: 11 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -4,6 +4,17 @@ When two or more predictors are highly correlated to each other such that one pr
44
can be derived using the linear combinations of other predictors, then the predictors are said to be collinear
55

66
### 2. What is the difference between standardisation and normalization ? Why is it useful?
7+
Standardisation is a scclaing technique in which values are shifted and rescaled so that the mean is 0 and the variance is 1
8+
9+
Normalization is a scaling technique in which values are shifted and rescaled so that they end up ranging between 0 and 1. It is also known as Min-Max scaling
10+
11+
* Algorithms which use gradient descent based optimisation (linear regression, logistic regression, neural networks) will require features to be scaled so that optimization will be faster and the convergence will be more accurate.
12+
* **Having features on a similar scale can help the gradient descent converge more quickly towards the minima.**
13+
* Distance algorithms like KNN, K-means, and SVM are most affected by the range of features. This is because behind the scenes they are using distances between data points to determine their similarity.
14+
* **Therefore, we scale our data before employing a distance based algorithm so that all the features contribute equally to the result.**
15+
16+
![](https://i.pinimg.com/originals/1c/16/04/1c160466f8bfd26ca66a44f79514fb5d.jpg)
17+
718
### 3. What is the central limit theorem ? Why is it useful ?
819
### 4. What is the inter quartile range ? Why is it useful ?
920
### 5. What is the difference between t-test and z-test ? Why is it useful ?

0 commit comments

Comments
 (0)