79XX Dont Run

Author	Message
Zydor Send message Joined: 24 Feb 09 Posts: 620 Credit: 100,587,625 RAC: 0	Message 52427 - Posted: 12 Jan 2012, 19:04:23 UTC Another impressive factor .... I just had a driver reset, my fault I was messing with a few things. It all paused as the driver reset itself, and carried on like nothing happened - all WUs running at time of reset validated, and all back to normal without me doing a thing. If thats not a one off fluke ... thats brilliant, no more getting up in the morning to find never ending WUs cycling for nothing for hours on end in the night as a result of a reset in the night. Regards Zy ID: 52427 · Rating: 0 · rate: / Reply Quote

Matt Arsenault Volunteer moderator Project developer Project tester Project scientist Send message Joined: 8 May 10 Posts: 576 Credit: 15,979,383 RAC: 0	Message 52432 - Posted: 12 Jan 2012, 22:26:12 UTC - in response to Message 52424. BTW, the page below needs to be corrected and updated: http://milkyway.cs.rpi.edu/milkyway/gpu_list.php The HD 69xx series isn't even listed (unless the top AMD listing of HD5800 is a typo) and no the GTX480 is not faster than the GTX580 according to the WU completion times in "Top Computers". I'm guessing that since the ATI branding was dropped for 6xxx and the BOINC script seems to look for things with "ATI" in the name it never finds it. ID: 52432 · Rating: 0 · rate: / Reply Quote

kashi Send message Joined: 30 Dec 07 Posts: 311 Credit: 149,490,184 RAC: 0	Message 52434 - Posted: 12 Jan 2012, 23:26:54 UTC - in response to Message 52424. It's interesting that even at this point the HD 7970 is running the NVidia optimized app at 3.5x faster than the top GTX580 card that I could find in the database. Pretty impressive and the HD 7970 also uses less power..... Also interesting is the low CPU usage. Some other OpenCL GPU applications use a lot more CPU. ID: 52434 · Rating: 0 · rate: / Reply Quote

[boinc.at] Nowi Send message Joined: 22 Mar 09 Posts: 99 Credit: 503,422,495 RAC: 0	Message 52443 - Posted: 13 Jan 2012, 7:19:24 UTC - in response to Message 52424. Last modified: 13 Jan 2012, 7:28:47 UTC Run Time is 52 secs with GPU set to 1125/1575, the max speed inside the driver, power slider is +5 I hope this is only the beginning without optimized applications. This is small better than the performance of a 6970 at stock, look here: http://milkyway.cs.rpi.edu/milkyway/results.php?hostid=357618&offset=0&show_names=0&state=3&appid= (Thanks to joe!) It's interesting that even at this point the HD 7970 is running the NVidia optimized app at 3.5x faster than the top GTX580 card that I could find in the database. Pretty impressive and the HD 7970 also uses less power. IÂ´m not surprised. The customer versions of Fermi are crippled on the DP-performance. Good for this project! IÂ´m interested in the comparison on SP-projects like collatz or primegrid or maybe in a long, long, long future GPUGrid. ID: 52443 · Rating: 0 · rate: / Reply Quote

Zydor Send message Joined: 24 Feb 09 Posts: 620 Credit: 100,587,625 RAC: 0	Message 52445 - Posted: 13 Jan 2012, 8:09:05 UTC Last modified: 13 Jan 2012, 8:09:46 UTC Would be interesting to see how the NVidia OpenCL app runs on a 6970 .... that would give an early level playing field comparison. I suspect it would be very significantly slower. Regards Zy ID: 52445 · Rating: 0 · rate: / Reply Quote

Zydor Send message Joined: 24 Feb 09 Posts: 620 Credit: 100,587,625 RAC: 0	Message 52449 - Posted: 13 Jan 2012, 10:55:32 UTC I ran the cards overnight @1115/300 to see if low memory worked (I run my 5970s @770/175). The invalids came in, whilst still a very small number, they were still there. At 1030am I changed it to 1125/1000 (default is 1125/1375), and the invalids have pretty well stopped - insignificant now. So its not a happy bunny down at 300, probably not surprising given the 4.3Bn transistor count. So there is a sweet spot between 300 & 1000 on memory for MW on this NVidia app, dont yet know where, not worth fiddling more for the moment, going to just let it run @1125/1000. The latter is a safe start point for anyone getting a 7970. For sure, staying down at 925 gpu is a travesty on this card, wack it up to 1125 immediately, and power slider to 20%. Then play with memory as needed. The power slider only raises the maximum TDP, it does not mean the extra power is always on just makes it available, so may as well wack it to the +20 and leave it there, that eliminates low power levels as a point of failure when dealing inside the driver settings. Regards Zy ID: 52449 · Rating: 0 · rate: / Reply Quote

Beyond Send message Joined: 15 Jul 08 Posts: 384 Credit: 735,133,302 RAC: 37,116	Message 52458 - Posted: 13 Jan 2012, 16:02:50 UTC - in response to Message 52449. I ran the cards overnight @1115/300 to see if low memory worked (I run my 5970s @770/175). The invalids came in, whilst still a very small number, they were still there. At 1030am I changed it to 1125/1000 (default is 1125/1375), and the invalids have pretty well stopped - insignificant now. So its not a happy bunny down at 300, probably not surprising given the 4.3Bn transistor count. Zy Sometimes it takes a while for the new invalids to show up. All my machines are getting a 2-3% invalid rate. When I start MW on a new box there are no invalids for a while, then they start dribbling in. They're all of the sderr bug type and I think a lot of them get held in "pending" until they're finally marked as invalid. Even Matt's machines show the same problem and it's one of the 3 reasons I dropped MW for a while (along with the tiny queue and unreliable servers). Since the queue size is up I'm back to give it another try but the false invalids are unique to MW and quite irritating. ID: 52458 · Rating: 0 · rate: / Reply Quote

[PNT] Obix007 Send message Joined: 31 Jul 09 Posts: 5 Credit: 228,708,123 RAC: 0	Message 52462 - Posted: 13 Jan 2012, 22:10:08 UTC sesef managed to finish MW WU in 35-36 sec. we are testing now. ID: 52462 · Rating: 0 · rate: / Reply Quote

sesef Send message Joined: 14 Nov 08 Posts: 16 Credit: 1,753,367 RAC: 0	Message 52464 - Posted: 14 Jan 2012, 0:04:24 UTC with these drivers http://forums.guru3d.com/showthread.php?t=356972 i have significant speedup, but there is bad news :( After drivers update I can't control card with Afterburner :( ID: 52464 · Rating: 0 · rate: / Reply Quote

Zydor Send message Joined: 24 Feb 09 Posts: 620 Credit: 100,587,625 RAC: 0	Message 52465 - Posted: 14 Jan 2012, 0:09:33 UTC Try Sapphire Trixx - works on reference AMD cards https://www.sapphireselectclub.com/ssc/TriXX/TriXX.aspx Regards Zy ID: 52465 · Rating: 0 · rate: / Reply Quote

[SETI.USA]Tank_Master Send message Joined: 2 Sep 07 Posts: 24 Credit: 26,244,596 RAC: 0	Message 52466 - Posted: 14 Jan 2012, 0:43:55 UTC why not use the official driver from AMD's site dated 1/9/2012? http://support.amd.com/us/kbarticles/Pages/radeon-7900-series.aspx ID: 52466 · Rating: 0 · rate: / Reply Quote

cenit Send message Joined: 16 Mar 09 Posts: 58 Credit: 1,129,612 RAC: 0	Message 52472 - Posted: 14 Jan 2012, 11:16:32 UTC - in response to Message 52466. I'm trying to run the cuda_opencl app on my 6970 just for comparison. I used the app_info described here and downloaded the app from the link posted, but boinc says 1/14/2012 12:19:06 PM \| Milkyway@home \| Sending scheduler request: To fetch work. 1/14/2012 12:19:06 PM \| Milkyway@home \| Requesting new tasks for ATI 1/14/2012 12:19:08 PM \| Milkyway@home \| Scheduler request failed: HTTP file not found ID: 52472 · Rating: 0 · rate: / Reply Quote

288larsson Send message Joined: 8 Dec 09 Posts: 14 Credit: 902,727,796 RAC: 0	Message 52478 - Posted: 14 Jan 2012, 13:01:54 UTC - in response to Message 52472. I'm trying to run the cuda_opencl app on my 6970 just for comparison. I used the app_info described here and downloaded the app from the link posted, but boinc says 1/14/2012 12:19:06 PM \| Milkyway@home \| Sending scheduler request: To fetch work. 1/14/2012 12:19:06 PM \| Milkyway@home \| Requesting new tasks for ATI 1/14/2012 12:19:08 PM \| Milkyway@home \| Scheduler request failed: HTTP file not found You can try it zydor wrote in another thread. A detatch (not reset) and reattatch will do it. ID: 52478 · Rating: 0 · rate: / Reply Quote

cenit Send message Joined: 16 Mar 09 Posts: 58 Credit: 1,129,612 RAC: 0	Message 52479 - Posted: 14 Jan 2012, 13:13:39 UTC - in response to Message 52478. Last modified: 14 Jan 2012, 13:35:57 UTC I'm trying to run the cuda_opencl app on my 6970 just for comparison. I used the app_info described here and downloaded the app from the link posted, but boinc says 1/14/2012 12:19:06 PM \| Milkyway@home \| Sending scheduler request: To fetch work. 1/14/2012 12:19:06 PM \| Milkyway@home \| Requesting new tasks for ATI 1/14/2012 12:19:08 PM \| Milkyway@home \| Scheduler request failed: HTTP file not found You can try it zydor wrote in another thread. A detatch (not reset) and reattatch will do it. thanks Renaming the app into _ati14 also did the trick (done also when detaching, so i'm not sure what benefitted more) Anyway, this is my result: 1m 20s - 1m 25s, while also doing some cpu computation. CPU usage is 0, screen is a bit lagging. Link to my host: http://milkyway.cs.rpi.edu/milkyway/results.php?hostid=364107 On the contrary, I resumed working on milkyway for some wus and the times with stock (CAL) app are around 45-55 seconds, with 0 cpu usage and the screen absolutely better than when using the opencl app. Let's hope that some parameter tuning within the opencl app could bring similar results to the cal app. Surely it will not be better than that, but at least maybe closer... Let's hope, also because the cal interface is deprecated and so this old app will soon stop working with newer drivers... ID: 52479 · Rating: 0 · rate: / Reply Quote

Zydor Send message Joined: 24 Feb 09 Posts: 620 Credit: 100,587,625 RAC: 0	Message 52485 - Posted: 14 Jan 2012, 15:12:47 UTC Last modified: 14 Jan 2012, 15:16:10 UTC let's hope that some parameter tuning within the opencl app could bring similar results to the cal app Sadly, in pure terms of OpenCL, that will not happen to the level past experiences may lead you to believe is possible, because OpenCL code is compiled by Vendor/OpenCL compilers. That means little detailed tweeking - if any - is possible such as diving in with assembler coding, as it is all brought back to a level playing field by the compiler. The latter makes sense when looking at what OpenCL is for - cross platform compatibility - but in the short term presents some issues in niche areas such as BOINC. Fundamental speed improvements will be (as such given tight coding in the first place) dependent on the effort and development of the compiler by the Vendors ...... You saw the difference in speed on the 6970 run compared to CAL, and the fact that the 7970 is doing it in 49-52 seconds, compared to 90 seconds+ with the 6970. That amply demonstrates the hardware leap forward by 7XXX as well as starkly showing the transition issues in terms of speed CAL/OpenCL. As to the future .... well the new GCN architecture is not yet even started being properly brought into play, and it (realistically) promises much. It will however take time for developers to wrap their heads around the new Architecture and code around the whole new mix. Its likely therefore that new BOINC AMD apps will take a speed hit as they will be OpenCL. I wouldnt rush off and knee jerk an NVidia buy just on the back of that however, as they will also face similar issues as the industry is moving to OpenCL as the "standard". Will there be "cheating" and "exceptions", of course there will be - its the nature of the beast - but such short term grabs for glory will only serve to mask the longer term underlying trend, the move to OpenCL across the industry. All that together with the 28nm hardware from Red and Green Teams will make this year somewhat turbulent methinks :) Regards Zy ID: 52485 · Rating: 0 · rate: / Reply Quote

cenit Send message Joined: 16 Mar 09 Posts: 58 Credit: 1,129,612 RAC: 0	Message 52487 - Posted: 14 Jan 2012, 15:28:55 UTC - in response to Message 52485. the situation is not so dark as you're depicting! AMD is surely pushing a lot on their compiler development, but also something else can be done by the projects, if they have the time and the ability (Matt seems a talented programmer!) Surely, in the case of milkyway, where the code seems not so difficult, if the compiler is not giving best results, the kernel compilation can be done offline and then you can distribute a "hand-tuned" IL code. This was also something done by Gipsel back in the days, with the first version of the milkyway gpu app (Brook was not so good in those days, history is back ;) ) Anyway, a push back of 50 % with an app absolutely not tuned to AMD hardware seems not so bad. Even if without distributing IL tuned code, I think that some opencl compiler revisions and some tuning will cut this to no more than 20% without much effort. ID: 52487 · Rating: 0 · rate: / Reply Quote

Zydor Send message Joined: 24 Feb 09 Posts: 620 Credit: 100,587,625 RAC: 0	Message 52488 - Posted: 14 Jan 2012, 15:34:04 UTC We shall see what we shall see ..... hard to read the runes on this in absolute terms, as you point out talented programmers can do much to alleviate situations in many cases. The underlying message should not be ignored though, whatever is done in the short term (my definition of that is circa 18-24 months), long term software trend is OpenCL in all its glory - a new set of goal posts we have to get used to like it or not. Gonna be fun :) Regards Zy ID: 52488 · Rating: 0 · rate: / Reply Quote

Beyond Send message Joined: 15 Jul 08 Posts: 384 Credit: 735,133,302 RAC: 37,116	Message 52490 - Posted: 14 Jan 2012, 15:42:01 UTC - in response to Message 52465. Try Sapphire Trixx - works on reference AMD cards https://www.sapphireselectclub.com/ssc/TriXX/TriXX.aspx Regards Zy Zy, thanks for the link. Question: just tried this (4.1.0) on a box with 2 5850 cards (different brands) and it mostly seems to work except that whenever I set the custom fan profile for one card the other resets to automatic. Tried everything I can think of, any ideas? Afterburner 2.10 does not exhibit this problem. ID: 52490 · Rating: 0 · rate: / Reply Quote

Zydor Send message Joined: 24 Feb 09 Posts: 620 Credit: 100,587,625 RAC: 0	Message 52492 - Posted: 14 Jan 2012, 16:16:32 UTC - in response to Message 52490. None - sorry - its a generalised utility that covers essentials, a throw away vender bribe rofl - so its kind of as is. Cant think of any workaround it might have for that. Regards Zy ID: 52492 · Rating: 0 · rate: / Reply Quote

cenit Send message Joined: 16 Mar 09 Posts: 58 Credit: 1,129,612 RAC: 0	Message 52494 - Posted: 14 Jan 2012, 16:41:28 UTC - in response to Message 52488. We shall see what we shall see ..... hard to read the runes on this in absolute terms, as you point out talented programmers can do much to alleviate situations in many cases. The underlying message should not be ignored though, whatever is done in the short term (my definition of that is circa 18-24 months), long term software trend is OpenCL in all its glory - a new set of goal posts we have to get used to like it or not. Gonna be fun :) Regards Zy in 18-24 months I think many more surprises can happen. C++ AMP & similars can deliver much, if they keep up to promises! ID: 52494 · Rating: 0 · rate: / Reply Quote