What do you mean by saturation in neural network training? Discuss the problems associated with saturation
What is an activation function? What are the different types of activation functions? Discuss their pros and cons