| ▲ | Approximating Hyperbolic Tangent(jtomschroeder.com) | |
| 37 points by jtomschroeder 6 hours ago | 4 comments | ||
| ▲ | raphlinus 4 hours ago | parent | next [-] | |
A different approach, refining the square root based sigmoid with a polynomial, is in my blog post "a few of my favorite sigmoids" [1]. I'm not sure which is faster without benchmarking, but I'm pretty sure its worst case error is better than any of the fast approximations. [1]: https://raphlinus.github.io/audio/2018/09/05/sigmoid.html | ||
| ▲ | AlotOfReading 4 hours ago | parent | prev | next [-] | |
A different floating point hack makes exp() easier to compute in hardware (and consequently tanh). You cast the input to an int and take the first 2 bits of what would be the mantissa. LUT[Index] and LUT[Index+1] from your 5-entry table are used to either lerp or poly approx. the function, with the remaining mantissa bits to help. | ||
| ▲ | agalunar 5 hours ago | parent | prev | next [-] | |
There’s an analysis of the Schraudolph approximation of the exponential function (along with an improvement upon it) that someone might find interesting at https://typ.dev/attention#affine-cast | ||
| ▲ | mjcohen 5 hours ago | parent | prev [-] | |
Looks interesting. Should start with a definition of the Hyperbolic Tangent. It is only about 2/3 of the way that the definition occurs in a discussion of computing exp(x). | ||