Witth a precalculated inverse table, division by inverse multiplication is roughly 20% faster at n=33m+. This method eats a LOT of memory for the 20% gain. Timed, (125/128)s, precalculated table, (75/64)s, using 32-bit operands and double prescision tables.
At single precision table, the difference is quite a bit more significant, being 11/128s, versus (75/64)s, nearly 14 times faster. I'd use the single-precision inverse table *IFF* speed is your passion. Take my word for it. I've tested it already. These numbers apply whether assignment is to double or single.
Assignment to integer using single-precision inverse table is 25% faster than division.
Single prescision inverse table seems to be a very good bet for speed AND accuracy. Oh yeah, some changes to the size of the table had to be made because I wanted to reduce effects of background processes on timings, so those are for 33m+. My answer still stands: single-precision inverse table, regardless of precision of operands.