Author Topic: Better Bench by Ed Davis (Read 21991 times)

bplus · « **on:** April 11, 2022, 04:15:53 pm »

Generic Basic code convert to QB64 by me for timing and Types:

Code: QB64: [Select]

$Checking:Off
DefLng A-Z
Dim As Double x, y, xx, yy, start
start = Timer(.001)
accum = 0
count = 0
While count < 1545
    leftedge = -420
    rightedge = 300
    topedge = 300
    bottomedge = -300
    xstep = 7
    ystep = 15
 
    maxiter = 200
 
    y0 = topedge
    While y0 > bottomedge
        x0 = leftedge
        While x0 < rightedge
            y = 0
            x = 0
            thechar = 32
            xx = 0
            yy = 0
            i = 0
            While i < maxiter And xx + yy <= 800
                xx = Int((x * x) / 200)
                yy = Int((y * y) / 200)
                If xx + yy > 800 Then
                    thechar = 48 + i
                    If i > 9 Then
                        thechar = 64
                    End If
                Else
                    temp = xx - yy + x0
                    If (x < 0 And y > 0) Or (x > 0 And y < 0) Then
                        y = (-1 * Int((-1 * x * y) / 100)) + y0  ' << this line was revised in later post
                    Else
                        y = Int(x * y / 100) + y0
                    End If
                    x = temp
                End If
 
                i = i + 1
            Wend
            x0 = x0 + xstep
            accum = accum + thechar
        Wend
        y0 = y0 - ystep
    Wend
 
    If count Mod 300 = 0 Then
        Print accum,
    End If
    count = count + 1
Wend
 
Print accum
Print Timer(.001) - start; " seconds"
 
'This is the output:
 
' 200574 60372774 120544974 180717174 240889374 301061574 309886830
 
 

I added DEFLNG A-Z and Dim as Double x, y, xx, yy
and I got below 10 secs on Timer with $Checking:Off, 9.61 was best time with everything turned off including Wifi.

Ed has QB64 down for 9.35 secs maybe on better machine...
http://basic4all.epizy.com/index.php?topic=21.msg166#msg166

Can anyone make a significant drop? FreeBasic at top at 1.09 secs

SMcNeill · « **Reply #1 on:** April 11, 2022, 04:41:57 pm »

y = (-1 * Int((-1 * x * y) / 100)) + y0 <-- isn't a negative * negative = positive? How is this any different from the ELSE case?

bplus · « **Reply #2 on:** April 11, 2022, 04:57:54 pm »

Honestly I haven't done any analysis of code yet. I just know the first time I tried I didn't get what I was supposed to and when I made the substitution, the output came out right.

Can FB be so much faster?

Hey! Ed's here!

bplus · « **Reply #3 on:** April 11, 2022, 05:01:12 pm »

Quote from: SMcNeill on April 11, 2022, 04:41:57 pm

y = (-1 * Int((-1 * x * y) / 100)) + y0 <-- isn't a negative * negative = positive? How is this any different from the ELSE case?

looks to me like this forces neg when one or the other is out of bounds but not both.

EDIT

Ed Davis · « **Reply #4 on:** April 11, 2022, 05:05:04 pm »

Quote from: SMcNeill on April 11, 2022, 04:41:57 pm

y = (-1 * Int((-1 * x * y) / 100)) + y0 <-- isn't a negative * negative = positive? How is this any different from the ELSE case?

Note this funky code:

Code: [Select]

    if (x < 0 and y > 0) or (x > 0 and y < 0) then
        y = (-1 * Int((-1 * x * y) / 100)) + y0
    else
        y = int(x * y / 100) + y0
    end if

When I originally developed this, I could not get the same results with Python that I got with C.

It turns out that there is no consensus as to whether integer division, when one of the operands is negative, should round towards zero, negative infinity, or positive infinity.

Python and Ruby both round towards negative infinity. C rounds towards zero.

In order to get the same output from each language (to verify that each language was essentially computing the same thing and doing similar work), I had to figure out how to preclude one of the operands from being negative.

The code checks, and if either x or y is < 0 (but not both), then it multiplies by minus one to force positive division, and then by minus one again at the end to restore the sign.

And I agree, there is probably a better way to do this. But at the time, I just wanted a solution that worked, and that is what my little mind came up with.

Note that later on, I found out that Python had a integer divide operator, so there was no need to go through the hoops I did. But, since I had trouble with my "and" and "or" logic and precedence in my own interpreter, I left it in as a sort of unit test :) And, it does give whatever language is doing this more work to chew on, and more opportunities for optimization.

jack · « **Reply #5 on:** April 11, 2022, 05:19:07 pm »

Hi bplus and Ed Davis
here's my result

Code: [Select]

QB64
 200574   60372774            120544974           180717174           240889374
 301061574          309886830
 4.637000000002445  seconds

FreeBasic
 200574        60372774      120544974     180717174     240889374     301061574     309886830
 2.118056500097737 seconds

well done bplus :)
note: QB64 includes a lot of runtime checks which slowdown performance, even with $Checking:Off

SMcNeill · « **Reply #6 on:** April 11, 2022, 05:20:05 pm »

Best I can come up with, without trying to revert to mem or compiler flags, is about 4.9 seconds on my laptop.

Code: QB64: [Select]

$Checking:Off
'DefLng A-Z
'Dim As Double x, y, xx, yy, start
_Define A-Z As DOUBLE
start = Timer(.001)
accum = 0
count = 0
While count < 1545
    leftedge = -420
    rightedge = 300
    topedge = 300
    bottomedge = -300
    xstep = 7
    ystep = 15
 
    maxiter = 200
 
    y0 = topedge
    While y0 > bottomedge
        x0 = leftedge
        While x0 < rightedge
            y = 0
            x = 0
            thechar = 32
            xx = 0
            yy = 0
            i = 0
            While i < maxiter And xx + yy <= 800
                xx = Int((x * x) / 200)
                yy = Int((y * y) / 200)
                If xx + yy > 800 Then
                    thechar = 48 + i
                    If i > 9 Then
                        thechar = 64
                    End If
                Else
                    temp = xx - yy + x0
                    If (x < 0 And y > 0) Or (x > 0 And y < 0) Then
                        y = (-Int((-x * y) / 100)) + y0 ' << this line was revised in later post
                    Else
                        y = Int(x * y / 100) + y0
                    End If
                    x = temp
                End If
 
                i = i + 1
            Wend
            x0 = x0 + xstep
            accum = accum + thechar
        Wend
        y0 = y0 - ystep
    Wend
 
    If count Mod 300 = 0 Then
        Print accum,
    End If
    count = count + 1
Wend
 
Print accum
Print Timer(.001) - start; " seconds"
 
'This is the output:
 
Print "Compared to expected output of: "
Print "200574 60372774 120544974 180717174 240889374 301061574 309886830"

Only 2 real changes here:

1) I made all variables _DOUBLE type. When it comes to optimizing speed, QB64 always seems to run better if you can stick with all variables of the same type -- either go all integers, or all floats, if you want speed. Mixing them seems to always slow things down.

2) I took out the negative multiplication and swapped to negation, which we always process fast. (-1 * x isn't as efficient as -x, for example.)

Nothing else stares out obviously to me as a means to run the code faster, without going to _MEM access or tweaking compiler options for -fastmath.

George McGinn · « **Reply #7 on:** April 11, 2022, 05:39:12 pm »

I get 5.533 seconds.

Ed Davis · « **Reply #8 on:** April 11, 2022, 05:43:41 pm »

Quote from: SMcNeill on April 11, 2022, 05:20:05 pm

Only 2 real changes here:

...

2) I took out the negative multiplication and swapped to negation, which we always process fast. (-1 * x isn't as efficient as -x, for example.)

But that change makes it a slightly different comparison between different languages.

I purposefully did not try to optimize it for any single language, but rather wanted the same code for each language.

For example, I could have used "i++" instead of "i = i + 1". Some really poor byte-code interpreters (some of mine :( ) do not recognize that these are the same, and generate:

Code: QB64: [Select]

inc [i]

For the former, and:

Code: QB64: [Select]

push [i]
add
pop [i]
 

For the latter. Definitely slower, as it has to go two extra times through the byte code interpreter loop.

I could have used "for" loops, which some languages optimize better than "while" loops.

But since all languages I was testing did not offer those things, I avoided them.

So the goal was not to write the fastest algorithm - but to write to write something to test that could be easily converted to many languages.

But of course, you are free to do what you want :) However, if you're going to compare it with another language processor, it would be more fair to optimize the code for that processor also.

Finally, not trying to cause trouble or anything like that. Just trying to explain the purpose of the code.

It is at best a silly diversion, and like all bench marks, should be taken with a grain of salt, or just completely ignored!

SMcNeill · « **Reply #9 on:** April 11, 2022, 05:49:55 pm »

Quote from: Ed Davis on April 11, 2022, 05:43:41 pm

I purposefully did not try to optimize it for any single language, but rather wanted the same code for each language.

All I was doing was as I was asked here:

Quote from: bplus on April 11, 2022, 04:15:53 pm

Can anyone make a significant drop?

It's hard to make a drop in speeds, without making any changes to the code to help optimize for speed. ;)

But, I agree -- you can't compare apples to pineapples. Speed tests between systems/languages should match so you can compare the performance between those systems/languages without skewing the results.

Ed Davis · « **Reply #10 on:** April 11, 2022, 06:10:07 pm »

Quote from: SMcNeill on April 11, 2022, 05:49:55 pm

All I was doing was as I was asked here:

Just to make sure: I appreciate your posts, and help that you so graciously freely provide.

And I appreciate this forum - lots of knowledgeable, helpful folks around here! And all the great example programs!

Thanks!

jack · « **Reply #11 on:** April 11, 2022, 08:30:11 pm »

in Linux x64 same PC I get
3.79 seconds with no optimization
3.58 seconds with -O2 optimization
3.06 seconds with -Ofast optimization

Aurel · « **Reply #12 on:** April 15, 2022, 06:49:12 am »

Here is my results with qb64 1.5v on win7 32bit

News:

Author Topic: Better Bench by Ed Davis (Read 21991 times)

bplus

Better Bench by Ed Davis

SMcNeill

Re: Better Bench by Ed Davis

bplus

Re: Better Bench by Ed Davis

bplus

Re: Better Bench by Ed Davis

Ed Davis

Re: Better Bench by Ed Davis

jack

Re: Better Bench by Ed Davis

SMcNeill

Re: Better Bench by Ed Davis

George McGinn

Re: Better Bench by Ed Davis

Ed Davis

Re: Better Bench by Ed Davis

SMcNeill

Re: Better Bench by Ed Davis

Ed Davis

Re: Better Bench by Ed Davis

jack

Re: Better Bench by Ed Davis

Aurel

Re: Better Bench by Ed Davis