Welcome to MilkyWay@home

79XX Dont Run

Message boards : Number crunching : 79XX Dont Run
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 . . . 7 · Next

AuthorMessage
Zydor
Avatar

Send message
Joined: 24 Feb 09
Posts: 620
Credit: 100,587,625
RAC: 0
Message 52427 - Posted: 12 Jan 2012, 19:04:23 UTC

Another impressive factor .... I just had a driver reset, my fault I was messing with a few things. It all paused as the driver reset itself, and carried on like nothing happened - all WUs running at time of reset validated, and all back to normal without me doing a thing.

If thats not a one off fluke ... thats brilliant, no more getting up in the morning to find never ending WUs cycling for nothing for hours on end in the night as a result of a reset in the night.

Regards
Zy
ID: 52427 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Matt Arsenault
Volunteer moderator
Project developer
Project tester
Project scientist

Send message
Joined: 8 May 10
Posts: 576
Credit: 15,979,383
RAC: 0
Message 52432 - Posted: 12 Jan 2012, 22:26:12 UTC - in response to Message 52424.  


BTW, the page below needs to be corrected and updated:

http://milkyway.cs.rpi.edu/milkyway/gpu_list.php

The HD 69xx series isn't even listed (unless the top AMD listing of HD5800 is a typo) and no the GTX480 is not faster than the GTX580 according to the WU completion times in "Top Computers".
I'm guessing that since the ATI branding was dropped for 6xxx and the BOINC script seems to look for things with "ATI" in the name it never finds it.
ID: 52432 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile kashi

Send message
Joined: 30 Dec 07
Posts: 311
Credit: 149,490,184
RAC: 0
Message 52434 - Posted: 12 Jan 2012, 23:26:54 UTC - in response to Message 52424.  


It's interesting that even at this point the HD 7970 is running the NVidia optimized app at 3.5x faster than the top GTX580 card that I could find in the database. Pretty impressive and the HD 7970 also uses less power.....

Also interesting is the low CPU usage. Some other OpenCL GPU applications use a lot more CPU.
ID: 52434 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
[boinc.at] Nowi

Send message
Joined: 22 Mar 09
Posts: 99
Credit: 503,422,495
RAC: 0
Message 52443 - Posted: 13 Jan 2012, 7:19:24 UTC - in response to Message 52424.  
Last modified: 13 Jan 2012, 7:28:47 UTC

Run Time is 52 secs with GPU set to 1125/1575, the max speed inside the driver, power slider is +5

I hope this is only the beginning without optimized applications. This is small better than the performance of a 6970 at stock, look here: http://milkyway.cs.rpi.edu/milkyway/results.php?hostid=357618&offset=0&show_names=0&state=3&appid= (Thanks to joe!)

It's interesting that even at this point the HD 7970 is running the NVidia optimized app at 3.5x faster than the top GTX580 card that I could find in the database. Pretty impressive and the HD 7970 also uses less power.


I´m not surprised. The customer versions of Fermi are crippled on the DP-performance. Good for this project! I´m interested in the comparison on SP-projects like collatz or primegrid or maybe in a long, long, long future GPUGrid.
ID: 52443 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Zydor
Avatar

Send message
Joined: 24 Feb 09
Posts: 620
Credit: 100,587,625
RAC: 0
Message 52445 - Posted: 13 Jan 2012, 8:09:05 UTC
Last modified: 13 Jan 2012, 8:09:46 UTC

Would be interesting to see how the NVidia OpenCL app runs on a 6970 .... that would give an early level playing field comparison. I suspect it would be very significantly slower.

Regards
Zy
ID: 52445 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Zydor
Avatar

Send message
Joined: 24 Feb 09
Posts: 620
Credit: 100,587,625
RAC: 0
Message 52449 - Posted: 13 Jan 2012, 10:55:32 UTC

I ran the cards overnight @1115/300 to see if low memory worked (I run my 5970s @770/175). The invalids came in, whilst still a very small number, they were still there.

At 1030am I changed it to 1125/1000 (default is 1125/1375), and the invalids have pretty well stopped - insignificant now. So its not a happy bunny down at 300, probably not surprising given the 4.3Bn transistor count.

So there is a sweet spot between 300 & 1000 on memory for MW on this NVidia app, dont yet know where, not worth fiddling more for the moment, going to just let it run @1125/1000. The latter is a safe start point for anyone getting a 7970.

For sure, staying down at 925 gpu is a travesty on this card, wack it up to 1125 immediately, and power slider to 20%. Then play with memory as needed. The power slider only raises the maximum TDP, it does not mean the extra power is always on just makes it available, so may as well wack it to the +20 and leave it there, that eliminates low power levels as a point of failure when dealing inside the driver settings.

Regards
Zy
ID: 52449 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Beyond
Avatar

Send message
Joined: 15 Jul 08
Posts: 383
Credit: 729,293,740
RAC: 0
Message 52458 - Posted: 13 Jan 2012, 16:02:50 UTC - in response to Message 52449.  

I ran the cards overnight @1115/300 to see if low memory worked (I run my 5970s @770/175). The invalids came in, whilst still a very small number, they were still there.

At 1030am I changed it to 1125/1000 (default is 1125/1375), and the invalids have pretty well stopped - insignificant now. So its not a happy bunny down at 300, probably not surprising given the 4.3Bn transistor count. Zy

Sometimes it takes a while for the new invalids to show up. All my machines are getting a 2-3% invalid rate. When I start MW on a new box there are no invalids for a while, then they start dribbling in. They're all of the sderr bug type and I think a lot of them get held in "pending" until they're finally marked as invalid. Even Matt's machines show the same problem and it's one of the 3 reasons I dropped MW for a while (along with the tiny queue and unreliable servers). Since the queue size is up I'm back to give it another try but the false invalids are unique to MW and quite irritating.
ID: 52458 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
[PNT] Obix007

Send message
Joined: 31 Jul 09
Posts: 5
Credit: 228,708,123
RAC: 0
Message 52462 - Posted: 13 Jan 2012, 22:10:08 UTC

sesef managed to finish MW WU in 35-36 sec. we are testing now.
ID: 52462 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
sesef

Send message
Joined: 14 Nov 08
Posts: 16
Credit: 1,753,367
RAC: 0
Message 52464 - Posted: 14 Jan 2012, 0:04:24 UTC

with these drivers http://forums.guru3d.com/showthread.php?t=356972 i have significant speedup, but there is bad news :( After drivers update I can't control card with Afterburner :(
ID: 52464 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Zydor
Avatar

Send message
Joined: 24 Feb 09
Posts: 620
Credit: 100,587,625
RAC: 0
Message 52465 - Posted: 14 Jan 2012, 0:09:33 UTC

Try Sapphire Trixx - works on reference AMD cards

https://www.sapphireselectclub.com/ssc/TriXX/TriXX.aspx

Regards
Zy
ID: 52465 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile [SETI.USA]Tank_Master

Send message
Joined: 2 Sep 07
Posts: 24
Credit: 26,244,596
RAC: 0
Message 52466 - Posted: 14 Jan 2012, 0:43:55 UTC

why not use the official driver from AMD's site dated 1/9/2012?
http://support.amd.com/us/kbarticles/Pages/radeon-7900-series.aspx
ID: 52466 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile cenit

Send message
Joined: 16 Mar 09
Posts: 58
Credit: 1,129,612
RAC: 0
Message 52472 - Posted: 14 Jan 2012, 11:16:32 UTC - in response to Message 52466.  

I'm trying to run the cuda_opencl app on my 6970 just for comparison.

I used the app_info described here and downloaded the app from the link posted, but boinc says

1/14/2012 12:19:06 PM | Milkyway@home | Sending scheduler request: To fetch work.
1/14/2012 12:19:06 PM | Milkyway@home | Requesting new tasks for ATI
1/14/2012 12:19:08 PM | Milkyway@home | Scheduler request failed: HTTP file not found
ID: 52472 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
288larsson
Avatar

Send message
Joined: 8 Dec 09
Posts: 14
Credit: 902,727,796
RAC: 0
Message 52478 - Posted: 14 Jan 2012, 13:01:54 UTC - in response to Message 52472.  

I'm trying to run the cuda_opencl app on my 6970 just for comparison.

I used the app_info described here and downloaded the app from the link posted, but boinc says

1/14/2012 12:19:06 PM | Milkyway@home | Sending scheduler request: To fetch work.
1/14/2012 12:19:06 PM | Milkyway@home | Requesting new tasks for ATI
1/14/2012 12:19:08 PM | Milkyway@home | Scheduler request failed: HTTP file not found


You can try it zydor wrote in another thread.
A detatch (not reset) and reattatch will do it.
ID: 52478 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile cenit

Send message
Joined: 16 Mar 09
Posts: 58
Credit: 1,129,612
RAC: 0
Message 52479 - Posted: 14 Jan 2012, 13:13:39 UTC - in response to Message 52478.  
Last modified: 14 Jan 2012, 13:35:57 UTC

I'm trying to run the cuda_opencl app on my 6970 just for comparison.

I used the app_info described here and downloaded the app from the link posted, but boinc says

1/14/2012 12:19:06 PM | Milkyway@home | Sending scheduler request: To fetch work.
1/14/2012 12:19:06 PM | Milkyway@home | Requesting new tasks for ATI
1/14/2012 12:19:08 PM | Milkyway@home | Scheduler request failed: HTTP file not found


You can try it zydor wrote in another thread.
A detatch (not reset) and reattatch will do it.



thanks

Renaming the app into _ati14 also did the trick (done also when detaching, so i'm not sure what benefitted more)

Anyway, this is my result: 1m 20s - 1m 25s, while also doing some cpu computation. CPU usage is 0, screen is a bit lagging.

Link to my host:
http://milkyway.cs.rpi.edu/milkyway/results.php?hostid=364107

On the contrary, I resumed working on milkyway for some wus and the times with stock (CAL) app are around 45-55 seconds, with 0 cpu usage and the screen absolutely better than when using the opencl app.

Let's hope that some parameter tuning within the opencl app could bring similar results to the cal app. Surely it will not be better than that, but at least maybe closer... Let's hope, also because the cal interface is deprecated and so this old app will soon stop working with newer drivers...
ID: 52479 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Zydor
Avatar

Send message
Joined: 24 Feb 09
Posts: 620
Credit: 100,587,625
RAC: 0
Message 52485 - Posted: 14 Jan 2012, 15:12:47 UTC
Last modified: 14 Jan 2012, 15:16:10 UTC

let's hope that some parameter tuning within the opencl app could bring similar results to the cal app


Sadly, in pure terms of OpenCL, that will not happen to the level past experiences may lead you to believe is possible, because OpenCL code is compiled by Vendor/OpenCL compilers. That means little detailed tweeking - if any - is possible such as diving in with assembler coding, as it is all brought back to a level playing field by the compiler. The latter makes sense when looking at what OpenCL is for - cross platform compatibility - but in the short term presents some issues in niche areas such as BOINC. Fundamental speed improvements will be (as such given tight coding in the first place) dependent on the effort and development of the compiler by the Vendors ......

You saw the difference in speed on the 6970 run compared to CAL, and the fact that the 7970 is doing it in 49-52 seconds, compared to 90 seconds+ with the 6970. That amply demonstrates the hardware leap forward by 7XXX as well as starkly showing the transition issues in terms of speed CAL/OpenCL.

As to the future .... well the new GCN architecture is not yet even started being properly brought into play, and it (realistically) promises much. It will however take time for developers to wrap their heads around the new Architecture and code around the whole new mix.

Its likely therefore that new BOINC AMD apps will take a speed hit as they will be OpenCL. I wouldnt rush off and knee jerk an NVidia buy just on the back of that however, as they will also face similar issues as the industry is moving to OpenCL as the "standard". Will there be "cheating" and "exceptions", of course there will be - its the nature of the beast - but such short term grabs for glory will only serve to mask the longer term underlying trend, the move to OpenCL across the industry.

All that together with the 28nm hardware from Red and Green Teams will make this year somewhat turbulent methinks :)

Regards
Zy
ID: 52485 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile cenit

Send message
Joined: 16 Mar 09
Posts: 58
Credit: 1,129,612
RAC: 0
Message 52487 - Posted: 14 Jan 2012, 15:28:55 UTC - in response to Message 52485.  

the situation is not so dark as you're depicting! AMD is surely pushing a lot on their compiler development, but also something else can be done by the projects, if they have the time and the ability (Matt seems a talented programmer!)
Surely, in the case of milkyway, where the code seems not so difficult, if the compiler is not giving best results, the kernel compilation can be done offline and then you can distribute a "hand-tuned" IL code. This was also something done by Gipsel back in the days, with the first version of the milkyway gpu app (Brook was not so good in those days, history is back ;) )

Anyway, a push back of 50 % with an app absolutely not tuned to AMD hardware seems not so bad. Even if without distributing IL tuned code, I think that some opencl compiler revisions and some tuning will cut this to no more than 20% without much effort.
ID: 52487 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Zydor
Avatar

Send message
Joined: 24 Feb 09
Posts: 620
Credit: 100,587,625
RAC: 0
Message 52488 - Posted: 14 Jan 2012, 15:34:04 UTC

We shall see what we shall see ..... hard to read the runes on this in absolute terms, as you point out talented programmers can do much to alleviate situations in many cases.

The underlying message should not be ignored though, whatever is done in the short term (my definition of that is circa 18-24 months), long term software trend is OpenCL in all its glory - a new set of goal posts we have to get used to like it or not.

Gonna be fun :)

Regards
Zy
ID: 52488 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Beyond
Avatar

Send message
Joined: 15 Jul 08
Posts: 383
Credit: 729,293,740
RAC: 0
Message 52490 - Posted: 14 Jan 2012, 15:42:01 UTC - in response to Message 52465.  

Try Sapphire Trixx - works on reference AMD cards
https://www.sapphireselectclub.com/ssc/TriXX/TriXX.aspx
Regards Zy

Zy, thanks for the link. Question: just tried this (4.1.0) on a box with 2 5850 cards (different brands) and it mostly seems to work except that whenever I set the custom fan profile for one card the other resets to automatic. Tried everything I can think of, any ideas? Afterburner 2.10 does not exhibit this problem.
ID: 52490 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Zydor
Avatar

Send message
Joined: 24 Feb 09
Posts: 620
Credit: 100,587,625
RAC: 0
Message 52492 - Posted: 14 Jan 2012, 16:16:32 UTC - in response to Message 52490.  

None - sorry - its a generalised utility that covers essentials, a throw away vender bribe rofl - so its kind of as is. Cant think of any workaround it might have for that.

Regards
Zy
ID: 52492 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile cenit

Send message
Joined: 16 Mar 09
Posts: 58
Credit: 1,129,612
RAC: 0
Message 52494 - Posted: 14 Jan 2012, 16:41:28 UTC - in response to Message 52488.  

We shall see what we shall see ..... hard to read the runes on this in absolute terms, as you point out talented programmers can do much to alleviate situations in many cases.

The underlying message should not be ignored though, whatever is done in the short term (my definition of that is circa 18-24 months), long term software trend is OpenCL in all its glory - a new set of goal posts we have to get used to like it or not.

Gonna be fun :)

Regards
Zy


in 18-24 months I think many more surprises can happen.
C++ AMP & similars can deliver much, if they keep up to promises!
ID: 52494 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Previous · 1 · 2 · 3 · 4 · 5 . . . 7 · Next

Message boards : Number crunching : 79XX Dont Run

©2024 Astroinformatics Group