Welcome to MilkyWay@home

(unknown error) - exit code -5 (0xfffffffb)

Message boards : Number crunching : (unknown error) - exit code -5 (0xfffffffb)
Message board moderation

To post messages, you must log in.

AuthorMessage
rebirthman

Send message
Joined: 12 Jun 16
Posts: 3
Credit: 6,858,927
RAC: 0
Message 66142 - Posted: 28 Jan 2017, 10:56:42 UTC

Hello,

more than 150 WU failed immediately (2-3 CPU seconds) during the last few days on one of my PCs with unknown error.

Example:

http://milkyway.cs.rpi.edu/milkyway/result.php?resultid=1951525908

May I kindly ask the project team / experts for support ?

br Michael
ID: 66142 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile mikey
Avatar

Send message
Joined: 8 May 09
Posts: 3319
Credit: 520,341,059
RAC: 21,816
Message 66143 - Posted: 28 Jan 2017, 12:33:55 UTC - in response to Message 66142.  
Last modified: 28 Jan 2017, 12:36:39 UTC

Hello,

more than 150 WU failed immediately (2-3 CPU seconds) during the last few days on one of my PCs with unknown error.

Example:

http://milkyway.cs.rpi.edu/milkyway/result.php?resultid=1951525908

May I kindly ask the project team / experts for support ?

br Michael


Error Code -5 means Boinc can't open the file, it could be on your end or that you never got the file it is looking for so couldn't open it. If everything is working now it probably wasn't on your end.

The Error Code 0xfffffffb is explained here:

http://serverfault.com/questions/197206/scheduled-task-last-run-result-0xfffffffb-works-when-running-from-command-p
ID: 66143 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
rebirthman

Send message
Joined: 12 Jun 16
Posts: 3
Credit: 6,858,927
RAC: 0
Message 66144 - Posted: 29 Jan 2017, 0:57:01 UTC

Hello,

which file are you referring to ?

From the WU details I see the following:

Found 1 CL device
Device 'GeForce GTX 980 Ti' (NVIDIA Corporation:0x10de) (CL_DEVICE_TYPE_GPU)
Board:
Driver version: 378.49
Version: OpenCL 1.2 CUDA
Compute capability: 5.2
Max compute units: 22
Clock frequency: 1240 Mhz
Global mem size: 6442450944
Local mem size: 49152
Max const buf size: 65536
Double extension: cl_khr_fp64
Error creating context (-5): CL_OUT_OF_RESOURCES
Error getting device and context (-5): CL_OUT_OF_RESOURCES
Failed to calculate likelihood

Not sure what this means, but potentially not a missing file ?!

Thanks for any help in advance

br Michael
ID: 66144 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
alanb1951

Send message
Joined: 16 Mar 10
Posts: 210
Credit: 105,951,744
RAC: 24,881
Message 66145 - Posted: 29 Jan 2017, 5:47:01 UTC - in response to Message 66142.  

Hello,

more than 150 WU failed immediately (2-3 CPU seconds) during the last few days on one of my PCs with unknown error.

Example:

http://milkyway.cs.rpi.edu/milkyway/result.php?resultid=1951525908

May I kindly ask the project team / experts for support ?

br Michael


It appears you had a driver update from version 376.33 (which shows up in your valid results from earlier on 26th January [UTC time]) to version 378.49 (which shows up in your invalid results from later on 26th January and thereafter)

Your other machine listed here appears to be running an even earlier driver (version 372.90) and is not showing errors.

I notice that at SETI@Home the "problem" machine is using the Intel GPU rather than your NVIDIA GPU, so that's not showing the same problem. However, I wonder if something in BOINC getting confused by having a pair of active GPUs that don't use the same driver for computing?...

I'm not a Windows user myself, so I've no personal knowledge of the current state of NVIDIA drivers for Windows 10, or potential problems with multiple OpenCL providers and their [different?] drivers - the above is, therefore, a combination of observation and speculation. My only NVIDIA GPU is in a Linux laptop where it is the only OpenCL device enabled, and that has no problems (using a much older driver, though!)

Hope you manage to get it sorted soon.

Good luck - Al.
ID: 66145 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile mikey
Avatar

Send message
Joined: 8 May 09
Posts: 3319
Credit: 520,341,059
RAC: 21,816
Message 66146 - Posted: 29 Jan 2017, 12:22:17 UTC - in response to Message 66144.  

Hello,

which file are you referring to ?

From the WU details I see the following:

Found 1 CL device
Device 'GeForce GTX 980 Ti' (NVIDIA Corporation:0x10de) (CL_DEVICE_TYPE_GPU)
Board:
Driver version: 378.49
Version: OpenCL 1.2 CUDA
Compute capability: 5.2
Max compute units: 22
Clock frequency: 1240 Mhz
Global mem size: 6442450944
Local mem size: 49152
Max const buf size: 65536
Double extension: cl_khr_fp64
Error creating context (-5): CL_OUT_OF_RESOURCES
Error getting device and context (-5): CL_OUT_OF_RESOURCES
Failed to calculate likelihood

Not sure what this means, but potentially not a missing file ?!

Thanks for any help in advance

br Michael


That's the rub it doesn't say which file is missing!

I would try rolling back the gpu driver to an earlier one that worked for you.
ID: 66146 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
rebirthman

Send message
Joined: 12 Jun 16
Posts: 3
Credit: 6,858,927
RAC: 0
Message 66147 - Posted: 29 Jan 2017, 17:24:18 UTC

Hello,

I am back to driver: 376.33 and WUs are calculated correctly since then.

Thanks for the hint.

The project team might want to look into this for general assessment as I assume most participants might update sooner than later.

br Michael
ID: 66147 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote

Message boards : Number crunching : (unknown error) - exit code -5 (0xfffffffb)

©2024 Astroinformatics Group