Deep Learning FC

May 28, 2024 | 14:18

Image data

methods to represent image data:
- RGB(r,g,b)
- r,g,b → [0, 255]
- White: (255,255,255)
- Grayscale
Image size
- Size = Width * Height * #Channels
Image datasets:
- ImageNet
- MNIST
- CIFAR-10
- …

Basics:
- Cluster image based on their “similarity”
- Visual, distance, correlation, …
- Extract features by whatever the fuck method and use k-means to cluster
Useful when labels are unknown and labeling is expensive
Can be subjective and inaccurate

Parameter shits
- Parametric: logistic classifier
- Non-Parametric: KNN
- Hyperparameters: predefined / fixed
- Parameters: updated during model training
Loss function
- How does the final result deviate form the actual result, the smaller the better
- $\large{L{data} = \sum{i=1}^NL_i}$
- For classification task, loss function can be defined by using Cross-entropy
- Need to control overfitting using regularization methods
Gradient descent
- how to find the best:
- Calculate L
- Update W in next step as:
  - $W{k+1} ← 𝑊k − \alpha∇_WL$
  - $b{k+1} ← bk − \beta∇_bL$
- Repeat until gradient is small enough or k reached a limit
Number of parameters:
- param number = input_shape x layer width (W) + layer width (b)
- 1st hidden layer: input_shape = shape of input
- later hidden layer & output layer : input_shape = layer width of prev layer
- for output layer, layer width = output types

Post Views: 78

Tags:

No tags

Comments are closed