Author Topic: numerically stable average (Read 6807 times)

codeguy · « **on:** October 11, 2018, 07:23:33 pm »

In this post are two methods for calculating the average of a dataset. The standard method proves to be problematic for some cases, where the element values are large or the range is large. These two algorithms agree UNTIL n becomes large, then the truncation errors start creeping in for the standard method, using a running sum of all the elements in the range. The numerically stable method avoids this error quite well. In this particular instance, the numerically stable version lags the standard method by 50% or so, but the fact that overflow is completely avoided may be of benefit in some cases when dealing with large datasets or numbers that are large or perhaps even both.

Code: QB64: [Select]

ntest& = 0
DO
    ntest& = ntest& + 1
    REDIM a(0 TO ntest&) AS DOUBLE
    FOR q& = LBOUND(a) TO UBOUND(a)
        a(q&) = ntest& / 2 + .125
    NEXT
    s0! = TIMER(.001)
    AverageArrayNS a(), LBOUND(a), ntest&, average#
    f0! = TIMER(.001)
    PRINT "numerically stable "; ntest&; average#; f0! - s0!
    s1! = TIMER(.001)
    AverageStandardX a(), LBOUND(a), ntest&, averagx#
    f1! = TIMER(.001)
    PRINT "standard not stable"; ntest&; averagx#; f1! - s1!
    ntest& = ntest& * 2
LOOP
 
'****************
'* this method is subject to truncation/overflow errors -- fails at large N.
'****************
SUB AverageStandardX (a() AS DOUBLE, start AS LONG, finish AS LONG, a#)
sum# = 0
FOR c& = start TO finish
    sum# = sum# + a(c&)
NEXT
a# = sum# / (finish - start + 1)
END SUB
 
'****************
'* a numerically stable way to calculate the average of elements in an array without the dreaded overflow.
'****************
SUB AverageArrayNS (a() AS DOUBLE, start AS LONG, finish AS LONG, average AS DOUBLE)
DIM aa_int AS DOUBLE: aa_int = 0
DIM aa_temp AS DOUBLE: aa_temp = 0
DIM StatN AS LONG: StatN = finish - start + 1
 
FOR c& = start TO finish
    aa_temp = a(c&) + aa_temp
    DO
        IF aa_temp < StatN THEN
            EXIT DO
        ELSE
            aa_int = aa_int + 1
            aa_temp = aa_temp - StatN
        END IF
    LOOP
NEXT
average = aa_int + aa_temp / StatN
END SUB
 

Just another perhaps boring but useful algo to add to your code arsenal if overflow is to be avoided at all costs and results MUST be accurate.

News:

Author Topic: numerically stable average (Read 6807 times)

codeguy

numerically stable average