Welcome to MilkyWay@home

MilkyWay CUDA takes up a whole core

Message boards : Number crunching : MilkyWay CUDA takes up a whole core
Message board moderation

To post messages, you must log in.

1 · 2 · Next

AuthorMessage
Profile S@NL - Mellowman
Avatar

Send message
Joined: 9 Dec 07
Posts: 65
Credit: 8,015,709
RAC: 0
Message 51497 - Posted: 25 Oct 2011, 14:45:00 UTC

I noticed that MilkyWay CUDA takes up almost a whole core as you can see in this image (the 'img' tag didn't work with windows-skydrive).

Anybody else that have seen this behaviour?

When another MilkyWay task runs on this system (it only runs the GPU tasks) I'll check it again too see if it's a recurring problem.

Anthony.

The longer I live, the more reasons I develop for wanting to die.
ID: 51497 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Matt Arsenault
Volunteer moderator
Project developer
Project tester
Project scientist

Send message
Joined: 8 May 10
Posts: 576
Credit: 15,979,383
RAC: 0
Message 51503 - Posted: 25 Oct 2011, 15:25:03 UTC - in response to Message 51497.  

It's a known recent Nvidia driver problem where OpenCL things take up huge amounts of CPU time for no reason.
ID: 51503 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile S@NL - Mellowman
Avatar

Send message
Joined: 9 Dec 07
Posts: 65
Credit: 8,015,709
RAC: 0
Message 51506 - Posted: 25 Oct 2011, 17:11:04 UTC - in response to Message 51503.  

If this is really a driver problem, do you know if Nvidia is working on it?
Or else would it be possible for MilkyWay to make different apps, 1 type for CUDA and 1 type for OpenCL?

Anthony.

The longer I live, the more reasons I develop for wanting to die.
ID: 51506 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
colasanto

Send message
Joined: 10 Oct 11
Posts: 1
Credit: 278,285
RAC: 0
Message 51526 - Posted: 27 Oct 2011, 16:38:48 UTC

I'm having the exact same issue.

I disallow all my projects to use CPU time, as they have full use of my GPU. Yet as you mention above it has lead us to nvidia driver bug?

I'm on the latest 285.62 - Anyone seen or heard of any updates to this?
ID: 51526 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile S@NL - Mellowman
Avatar

Send message
Joined: 9 Dec 07
Posts: 65
Credit: 8,015,709
RAC: 0
Message 51527 - Posted: 27 Oct 2011, 16:49:21 UTC

I noticed that it doesn't do this on every WU. But out of the 4 or 5 I checked it showed this behaviour on 3 or 4 WU's.

Anthony.

The longer I live, the more reasons I develop for wanting to die.
ID: 51527 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Stephan Volkmann

Send message
Joined: 17 Mar 09
Posts: 7
Credit: 1,563,090
RAC: 0
Message 51530 - Posted: 28 Oct 2011, 16:36:04 UTC

280.13 shows this effect on all WU, taking 200% on 2 cores of my 2500k
ID: 51530 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Danny Heinz

Send message
Joined: 4 May 08
Posts: 1
Credit: 4,919,985
RAC: 0
Message 51531 - Posted: 28 Oct 2011, 21:15:17 UTC

I had the same problem with nvidia driver 280 and 285. CUDA takes at least one whole core (if the cpu idles CUDA even takes 2 cores).

There are no problems with driver 266.
ID: 51531 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile S@NL - Mellowman
Avatar

Send message
Joined: 9 Dec 07
Posts: 65
Credit: 8,015,709
RAC: 0
Message 51535 - Posted: 29 Oct 2011, 12:24:32 UTC - in response to Message 51531.  

I couldn't install driver 266 as it didn't support both my videocards (GF8600GT & GTX550Ti) so I tried driver 270.61 and that one seems to work too. No problems here yet after checking about 5 WU's.

Anthony.

The longer I live, the more reasons I develop for wanting to die.
ID: 51535 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile S@NL - Mellowman
Avatar

Send message
Joined: 9 Dec 07
Posts: 65
Credit: 8,015,709
RAC: 0
Message 51635 - Posted: 8 Nov 2011, 1:52:14 UTC

I just wrote a support request on NVIDIA's website about this problem and another problem I have (driver crashes when suspending Collatz on my GF-8600GT, that's also in my system together with my GTX-550Ti; my GTX-550Ti runs al other GPU projects I have).

Anthony.

The longer I live, the more reasons I develop for wanting to die.
ID: 51635 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Amauri

Send message
Joined: 30 Jan 09
Posts: 21
Credit: 13,256,888
RAC: 0
Message 51686 - Posted: 13 Nov 2011, 6:13:39 UTC - in response to Message 51531.  

I had the same problem with nvidia driver 280 and 285. CUDA takes at least one whole core (if the cpu idles CUDA even takes 2 cores).

There are no problems with driver 266.


+1

Is there anything that could be changed in MW app to fix this?
ID: 51686 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile arkayn
Avatar

Send message
Joined: 14 Feb 09
Posts: 999
Credit: 74,932,619
RAC: 0
Message 51689 - Posted: 13 Nov 2011, 21:18:51 UTC - in response to Message 51686.  

I had the same problem with nvidia driver 280 and 285. CUDA takes at least one whole core (if the cpu idles CUDA even takes 2 cores).

There are no problems with driver 266.


+1

Is there anything that could be changed in MW app to fix this?


Not as long as it is a OpenCL app.
ID: 51689 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Matt Arsenault
Volunteer moderator
Project developer
Project tester
Project scientist

Send message
Joined: 8 May 10
Posts: 576
Credit: 15,979,383
RAC: 0
Message 51716 - Posted: 18 Nov 2011, 22:01:36 UTC - in response to Message 51686.  

I had the same problem with nvidia driver 280 and 285. CUDA takes at least one whole core (if the cpu idles CUDA even takes 2 cores).

There are no problems with driver 266.


+1

Is there anything that could be changed in MW app to fix this?
I'm pretty sure I can write a workaround for it.
ID: 51716 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile S@NL - Mellowman
Avatar

Send message
Joined: 9 Dec 07
Posts: 65
Credit: 8,015,709
RAC: 0
Message 51720 - Posted: 18 Nov 2011, 22:17:28 UTC - in response to Message 51716.  

That would be great :)

BTW, how much memory does a GPU WU take, because maybe I could run more than 1 WU on a 1GB GTX-550Ti. But then I'm gonna need an app_info.xml also for it. On SETI and Einstein I already run 3 WU's concurrently.

Thanks in advance for the rewriting.

Anthony.

The longer I live, the more reasons I develop for wanting to die.
ID: 51720 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Matt Arsenault
Volunteer moderator
Project developer
Project tester
Project scientist

Send message
Joined: 8 May 10
Posts: 576
Credit: 15,979,383
RAC: 0
Message 51721 - Posted: 18 Nov 2011, 22:26:50 UTC - in response to Message 51720.  

BTW, how much memory does a GPU WU take, because maybe I could run more than 1 WU on a 1GB GTX-550Ti.

It should take (mu * r * (number of streams + 1)) * (1 or 2) * sizeof(double) bytes. For a typical work unit this would be (1600 * 1400 * (3 + 1)) * 8 = 68MB.
ID: 51721 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile S@NL - Mellowman
Avatar

Send message
Joined: 9 Dec 07
Posts: 65
Credit: 8,015,709
RAC: 0
Message 51722 - Posted: 18 Nov 2011, 22:31:19 UTC - in response to Message 51721.  
Last modified: 18 Nov 2011, 22:39:28 UTC

So I would be able to run 14 WU's at once on my GPU, but would limit it to 10 just in case. Now to get an app_info.xml.

Anthony.

The longer I live, the more reasons I develop for wanting to die.
ID: 51722 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile arkayn
Avatar

Send message
Joined: 14 Feb 09
Posts: 999
Credit: 74,932,619
RAC: 0
Message 51725 - Posted: 18 Nov 2011, 23:08:08 UTC

You also have to take into account how much of a load 1 WU has on the GPU, if you are already around 80 to 90 percent, 2 at a time would be the max.

Otherwise you are slowing down all wu needlessly.

This will have the appropriate app_info to start.
http://www.arkayn.us/forum/index.php?action=downloads;sa=view;down=33
ID: 51725 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile Beyond
Avatar

Send message
Joined: 15 Jul 08
Posts: 383
Credit: 729,293,740
RAC: 0
Message 51729 - Posted: 19 Nov 2011, 6:14:26 UTC

It's possible to achieve a small benefit by running 2 WUs per GPU because there is a small pause in GPU usage each time a WU nears completion. If 2 WUs are run in tandem, then the second one will use all available GPU during this period. Running more than 2 at a time will yeid no additional benefit and has the potential to cause additional problems.
ID: 51729 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile S@NL - Mellowman
Avatar

Send message
Joined: 9 Dec 07
Posts: 65
Credit: 8,015,709
RAC: 0
Message 51829 - Posted: 4 Dec 2011, 17:49:07 UTC - in response to Message 51716.  

Is there anything that could be changed in MW app to fix this?
I'm pretty sure I can write a workaround for it.


Any news on the workaround?

Anthony.


The longer I live, the more reasons I develop for wanting to die.
ID: 51829 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Norman_RKN

Send message
Joined: 2 Feb 08
Posts: 3
Credit: 6,013,119
RAC: 0
Message 51837 - Posted: 5 Dec 2011, 18:00:12 UTC

the GPU-apps could be much, much longer then only a few minutes.
it´s not good to share so short WU. that´s a heavy, heavy load for the server(s) even a lot of WU are returning in minute takting :/

please increase the calculation-time for 1a. ATI 1b. Nvidia up to 50x-100x and the servers and databases will be very stable and can do the work step-by-step without trouble ;)

Norman
ID: 51837 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Amauri

Send message
Joined: 30 Jan 09
Posts: 21
Credit: 13,256,888
RAC: 0
Message 51853 - Posted: 7 Dec 2011, 6:58:09 UTC - in response to Message 51716.  

I'm pretty sure I can write a workaround for it.


And I'm pretty sure you can do it. But... are you doing it?
ID: 51853 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
1 · 2 · Next

Message boards : Number crunching : MilkyWay CUDA takes up a whole core

©2024 Astroinformatics Group