For the absolute FASTEST filled circle routine, what one could do is simply pre-calculate the start and end points for each line of a circle of X size, and then save that data into an array. Then, instead of the program having to do any math, it'd just lookup the plot points directly from memory, and insta-draw the circle.
Take all the math out ahead of time, utilize a lookup table, and it'd be much quicker than anything anybody has shared previously. Only drawback would be the memory required to store the lookup table, which, in regards to modern OS memory limits, really shouldn't be that big an issue at all, honestly.