Welcome to MilkyWay@home

Optimized Nvidia Applications

Message boards : Number crunching : Optimized Nvidia Applications
Message board moderation

To post messages, you must log in.

AuthorMessage
Profile [AF>HFR>RR] Gaulois952

Send message
Joined: 28 Sep 08
Posts: 4
Credit: 110,206,704
RAC: 0
Message 14722 - Posted: 10 Mar 2009, 13:45:56 UTC

Everything is in the title, what is the state of progress of this optimized application?
ID: 14722 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
STE\/E

Send message
Joined: 29 Aug 07
Posts: 486
Credit: 576,548,171
RAC: 43
Message 14740 - Posted: 10 Mar 2009, 15:05:44 UTC

Good Question, I have 10 200 Series GTX's ready to go when the Application is ready to go ... :)
ID: 14740 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
jedirock
Avatar

Send message
Joined: 8 Nov 08
Posts: 178
Credit: 6,140,854
RAC: 0
Message 14741 - Posted: 10 Mar 2009, 15:06:27 UTC - in response to Message 14722.  

As I understand, even if a CUDA application were to be released, it wouldn't provide nearly the same crunching power as the current ATI cards. I've read statements by both Cluster Physik and the admin of GPUGRID saying that. Essentially, as I understand it, ATI cards have many more units that can handle double-precision calculations than the comparable GT200 based cards. As for the actual status of the application... Travis?
ID: 14741 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Cluster Physik

Send message
Joined: 26 Jul 08
Posts: 627
Credit: 94,940,203
RAC: 0
Message 14748 - Posted: 10 Mar 2009, 16:50:46 UTC - in response to Message 14741.  

As I understand, even if a CUDA application were to be released, it wouldn't provide nearly the same crunching power as the current ATI cards. I've read statements by both Cluster Physik and the admin of GPUGRID saying that. Essentially, as I understand it, ATI cards have many more units that can handle double-precision calculations than the comparable GT200 based cards. As for the actual status of the application... Travis?

My estimate would be that a GTX280 runs at about 40% the speed of a HD4870. To get above 50% one would have to do some magic ;)
But maybe nvidia handles the exp and sqrt functions exceptionally well (or the implementation on ATI is suboptimal), we will see.
ID: 14748 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Westsail and *Pyxey*
Avatar

Send message
Joined: 22 Mar 08
Posts: 65
Credit: 15,715,071
RAC: 0
Message 14848 - Posted: 11 Mar 2009, 1:01:35 UTC

Well, lets's get an app that will run on both and have a go. My money is still on Nvdia at the finish line.
ID: 14848 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Zanth
Avatar

Send message
Joined: 18 Feb 09
Posts: 158
Credit: 110,699,054
RAC: 0
Message 14856 - Posted: 11 Mar 2009, 2:09:04 UTC - in response to Message 14748.  

Then why do the folding@home nVidia apps absolutely crush the ATI apps? My 4850 could barely chug out 1000 ppd there, while my 9800GTX+ does around 5000 ppd. Also, if the ATI cards are so much more powerful for this, why does GPUGRID not have something like what you guys have for ATI cards reather than just crunching on the PS3(nVidia) and nVidia GPUs? I'm not trying to criticize or question your knowlegde, I genuinely want to understand. :)
ID: 14856 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Bruce
Avatar

Send message
Joined: 28 Apr 08
Posts: 1415
Credit: 2,716,428
RAC: 0
Message 14859 - Posted: 11 Mar 2009, 2:15:36 UTC
Last modified: 11 Mar 2009, 2:22:13 UTC

I'm hoping for an nVidia stock App.Would it be a bit faster than stock cpu App?
Seti-beta's gpu is aprox. 4 x faster than stock cpu app.
ID: 14859 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
jedirock
Avatar

Send message
Joined: 8 Nov 08
Posts: 178
Credit: 6,140,854
RAC: 0
Message 14864 - Posted: 11 Mar 2009, 2:40:14 UTC - in response to Message 14856.  

Then why do the folding@home nVidia apps absolutely crush the ATI apps? My 4850 could barely chug out 1000 ppd there, while my 9800GTX+ does around 5000 ppd. Also, if the ATI cards are so much more powerful for this, why does GPUGRID not have something like what you guys have for ATI cards reather than just crunching on the PS3(nVidia) and nVidia GPUs? I'm not trying to criticize or question your knowlegde, I genuinely want to understand. :)

Because both Folding@home and GPUGRID probably use single-precision calculations. While Nvidia packed their latest GT200 GPUs and even their last-gen G92 GPUs with shaders, most are capable of only single-precision calculations. So while they can probably do more single-precision calculations, ATI cards rock out over double-precision, which is what's used at MW.

Note that all the info above is just gathered by my reading stuff. Anyone else feel free to correct me.
ID: 14864 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Cluster Physik

Send message
Joined: 26 Jul 08
Posts: 627
Credit: 94,940,203
RAC: 0
Message 14875 - Posted: 11 Mar 2009, 3:57:37 UTC - in response to Message 14856.  

Then why do the folding@home nVidia apps absolutely crush the ATI apps? My 4850 could barely chug out 1000 ppd there, while my 9800GTX+ does around 5000 ppd. Also, if the ATI cards are so much more powerful for this, why does GPUGRID not have something like what you guys have for ATI cards reather than just crunching on the PS3(nVidia) and nVidia GPUs? I'm not trying to criticize or question your knowlegde, I genuinely want to understand. :)

AFAIK the ATIs over at folding are held back by software issues. The resources of the GPU are not used that much (same as with SETIs CUDA app now, GPUGrid reaches about 3 times the flops on the same card :o).

The general problem of ATI cards for GPGPU computing is not the shader power, they have plenty of it (for single precision roughly the same or even slightly more than nvidia). The problem is that it's not easy to program for ATI with the Stream SDK and its documentation.

It's sometimes quite hard to implement an efficient solution. Some hardware features of the ATI GPUs can only be utilized if one resorts to a low level programming approach as they are not accesible through the high level Brook. For a somewhat larger project this might be too much effort and the sometimes patchy documentation doesn't help either. One has to admit that nvidia has done a good job with CUDA.

Hopefully ATI will do better with newer incarnations of the Stream SDK and the support of OpenCL arriving in 3 months or so.
ID: 14875 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Zanth
Avatar

Send message
Joined: 18 Feb 09
Posts: 158
Credit: 110,699,054
RAC: 0
Message 14931 - Posted: 11 Mar 2009, 19:29:18 UTC - in response to Message 14875.  

Ahhh, that makes sense. Thank you very much for expalining things to me. :)
ID: 14931 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile [AF>HFR>RR] Gaulois952

Send message
Joined: 28 Sep 08
Posts: 4
Credit: 110,206,704
RAC: 0
Message 15178 - Posted: 13 Mar 2009, 10:49:56 UTC

Come on my GTX285 wait for the optimized application ^^
ID: 15178 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote

Message boards : Number crunching : Optimized Nvidia Applications

©2024 Astroinformatics Group