Understanding Softmax

When running an inference server, you can choose settings like temperature, top-p, and top-k. To understand these values, we really just need an understanding of the softmax activation function. I couldn’t really find one single ...