recategorized by
809 views
0 votes
0 votes
Let $f: \mathbb{R} \rightarrow \mathbb{R}$ be the function $f(x)=\frac{1}{1+e^{-x}}$.

The value of the derivative of $f$ at $x$ where $f(x)=0.4$ is $\_\_\_\_\_\_\_$. (rounded off to two decimal places).

Note: $\mathbb{R}$ denotes the set of real numbers.
recategorized by

1 Answer

2 votes
2 votes
Here, $f(z)=\frac{1}{1+e^{-z}}$

$f'(z) =\frac{e^{-z}}{(1+e^{-z})(1+e^{-z})}=\frac{1}{(1+e^{-z})}\frac{1+e^{-z}-1}{(1+e^{-z})}$

$f'(z)=\frac{1}{(1+e^{-z})} \left(1-\frac{1}{1+e^{-z}}\right)$

$f'(z)=f(z)(1-f(z))$

Say at $z_0,$ $f(z_0)=0.4$ then

$f'(z_0)=f(z_0)(1-f(z_0))=0.4 \times 0.6= 0.24$

$\textbf{Note:}$

$(1)$ This function $f(z)$ is present in most of textbooks because it is a very famous type of sigmoid activation function which is called as logistic function. This logistic function is highly non-linear and used in places like logistic regression and the artificial neural networks.

$(2)$ The way we have written the derivative $f'(z)=f(z)(1-f(z))$ has a special purpose. For large $z$, $f'(z) \rightarrow 0$ and we get very slow weight updates using the gradient descent and it creates a problem vanishing gradient while minimizing the loss function. ReLU (Rectified Linear Unit) is resistant to this problem.

$(3)$ To remove the $f'(z)$ from the picture, we use the chain rule and write the loss function ing a different way which is called as Binary Cross Entropy Loss in the context of logistic regression.
Answer:

Related questions

958
views
2 answers
0 votes
Arjun asked Feb 16
958 views
The sample average of $50$ data points is $40$. The updated sample average after including a new data point taking the value of $142$ is $\_\_\_\_\_\_\_\_$.
1.1k
views
1 answers
0 votes
Arjun asked Feb 16
1,136 views
Consider a joint probability density function of two random variables $X$ and $Y$\[f_{X, Y}(x, y)=\left\{\begin{array}{rll}2 x y, & 0<x<2, & 0<y<x \\ 0, & \text { otherwise ... array}\right.\]Then, $E[Y \mid X=1.5]$ is $\_\_\_\_\_\_\_\_\_$
1.2k
views
1 answers
0 votes
Arjun asked Feb 16
1,174 views
Let $\text{u}=\left[\begin{array}{l}1 \\ 2 \\ 3 \\ 4 \\ 5\end{array}\right]$, and let $\sigma_{1}, \sigma_{2}, \sigma_{3}, \sigma_{4}, \sigma_{5}$ be the singular values ... . The value of $\sum_{i=1}^{5} \sigma_{i}$ is $\_\_\_\_\_\_\_\_\_$
996
views
1 answers
0 votes
Arjun asked Feb 16
996 views
Details of ten international cricket games between two teams "Green" and "Blue" are given in Table $\mathrm{C}$. This table consists of matches played on different pitches, across ... Blue$7$S$O$Green$8$F$T$Blue$9$F$O$Blue$10$S$O$Green