Questions and Answers :
Windows :
Nvidia GPU tasks crashing after 2 seconds
Message board moderation
Author | Message |
---|---|
Send message Joined: 2 Oct 16 Posts: 2 Credit: 4,660,281 RAC: 0 |
Hi, I've recently tried to crunch a bit for Milkyway but encountered only issues. I got a lot of tasks, but except the 8CPU ones, they all failed. To be more precise, all the Nvidia tasks failed with the following error : <core_client_version>7.14.2</core_client_version> <![CDATA[ <message> Les caract�res g�n�riques (* ou�?) ont �t� sp�cifi�s de mani�re incorrecte ou en trop grand nombre. (0xd0) - exit code 208 (0xd0)</message> <stderr_txt> <search_application> milkyway_separation 1.46 Windows x86 double OpenCL </search_application> Reading preferences ended prematurely BOINC GPU type suggests using OpenCL vendor 'NVIDIA Corporation' Error loading Lua script 'astronomy_parameters.txt': [string "number_parameters: 4..."]:1: '<name>' expected near '4' Switching to Parameter File 'astronomy_parameters.txt' <number_WUs> 5 </number_WUs> <number_params_per_WU> 20 </number_params_per_WU> Using SSE4.1 path Found 1 platform Platform 0 information: Name: NVIDIA CUDA Version: OpenCL 1.2 CUDA 10.2.150 Vendor: NVIDIA Corporation Extensions: cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics Profile: FULL_PROFILE Using device 0 on platform 0 Found 1 CL device Device 'GeForce GTX 980M' (NVIDIA Corporation:0x10de) (CL_DEVICE_TYPE_GPU) Board: Driver version: 442.74 Version: OpenCL 1.2 CUDA Compute capability: 5.2 Max compute units: 12 Clock frequency: 1126 Mhz Global mem size: 4294967296 Local mem size: 49152 Max const buf size: 65536 Double extension: cl_khr_fp64 Error creating contextTrying to show unknown cl_int 208 (208): Unknown cl_int Error getting device and contextTrying to show unknown cl_int 208 (208): Unknown cl_int Failed to calculate likelihood Using SSE4.1 path Found 1 platform Platform 0 information: Name: NVIDIA CUDA Version: OpenCL 1.2 CUDA 10.2.150 Vendor: NVIDIA Corporation Extensions: cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics Profile: FULL_PROFILE Using device 0 on platform 0 Found 1 CL device Device 'GeForce GTX 980M' (NVIDIA Corporation:0x10de) (CL_DEVICE_TYPE_GPU) Board: Driver version: 442.74 Version: OpenCL 1.2 CUDA Compute capability: 5.2 Max compute units: 12 Clock frequency: 1126 Mhz Global mem size: 4294967296 Local mem size: 49152 Max const buf size: 65536 Double extension: cl_khr_fp64 Error creating contextTrying to show unknown cl_int 208 (208): Unknown cl_int Error getting device and contextTrying to show unknown cl_int 208 (208): Unknown cl_int Failed to calculate likelihood Using SSE4.1 path Found 1 platform Platform 0 information: Name: NVIDIA CUDA Version: OpenCL 1.2 CUDA 10.2.150 Vendor: NVIDIA Corporation Extensions: cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics Profile: FULL_PROFILE Using device 0 on platform 0 Found 1 CL device Device 'GeForce GTX 980M' (NVIDIA Corporation:0x10de) (CL_DEVICE_TYPE_GPU) Board: Driver version: 442.74 Version: OpenCL 1.2 CUDA Compute capability: 5.2 Max compute units: 12 Clock frequency: 1126 Mhz Global mem size: 4294967296 Local mem size: 49152 Max const buf size: 65536 Double extension: cl_khr_fp64 Error creating contextTrying to show unknown cl_int 208 (208): Unknown cl_int Error getting device and contextTrying to show unknown cl_int 208 (208): Unknown cl_int Failed to calculate likelihood Using SSE4.1 path Found 1 platform Platform 0 information: Name: NVIDIA CUDA Version: OpenCL 1.2 CUDA 10.2.150 Vendor: NVIDIA Corporation Extensions: cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics Profile: FULL_PROFILE Using device 0 on platform 0 Found 1 CL device Device 'GeForce GTX 980M' (NVIDIA Corporation:0x10de) (CL_DEVICE_TYPE_GPU) Board: Driver version: 442.74 Version: OpenCL 1.2 CUDA Compute capability: 5.2 Max compute units: 12 Clock frequency: 1126 Mhz Global mem size: 4294967296 Local mem size: 49152 Max const buf size: 65536 Double extension: cl_khr_fp64 Error creating contextTrying to show unknown cl_int 208 (208): Unknown cl_int Error getting device and contextTrying to show unknown cl_int 208 (208): Unknown cl_int Failed to calculate likelihood Using SSE4.1 path Found 1 platform Platform 0 information: Name: NVIDIA CUDA Version: OpenCL 1.2 CUDA 10.2.150 Vendor: NVIDIA Corporation Extensions: cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics Profile: FULL_PROFILE Using device 0 on platform 0 Found 1 CL device Device 'GeForce GTX 980M' (NVIDIA Corporation:0x10de) (CL_DEVICE_TYPE_GPU) Board: Driver version: 442.74 Version: OpenCL 1.2 CUDA Compute capability: 5.2 Max compute units: 12 Clock frequency: 1126 Mhz Global mem size: 4294967296 Local mem size: 49152 Max const buf size: 65536 Double extension: cl_khr_fp64 Error creating contextTrying to show unknown cl_int 208 (208): Unknown cl_int Error getting device and contextTrying to show unknown cl_int 208 (208): Unknown cl_int Failed to calculate likelihood 09:00:56 (22696): called boinc_finish(208) </stderr_txt> ]]> |
Send message Joined: 13 Oct 16 Posts: 112 Credit: 1,174,293,644 RAC: 0 |
Nevermind, I see it has some ~100 GFLOPS. |
Send message Joined: 2 Oct 16 Posts: 2 Credit: 4,660,281 RAC: 0 |
I think it has something to do with CUDA/Driver version but I have no way to be sure... |
Send message Joined: 8 May 09 Posts: 3315 Credit: 519,947,535 RAC: 22,188 |
I think it has something to do with CUDA/Driver version but I have no way to be sure... Roll it back to an older version and see if that works, I would go back a couple of versions. If the Server doesn't know about the driver version the units will crash, MW is not the fastest to update things. |
Send message Joined: 9 Dec 11 Posts: 38 Credit: 1,497,896,956 RAC: 0 |
I think it has something to do with CUDA/Driver version but I have no way to be sure... Well, seti@home has the same problem with the most recent drivers under Windows. I would roll the driver back. If memory serves from the seti project 436.x was the most recent driver that worked for crunching with Nvidia on Windows. |
Send message Joined: 24 Jan 11 Posts: 696 Credit: 540,041,007 RAC: 86,688 |
There was a problem with the most recent Nvidia drivers. Then we got Nvidia to fix them. Then Nvidia again released new drivers that don't have the previous required fix apparently. Rollback. |
Send message Joined: 7 May 14 Posts: 57 Credit: 201,115,648 RAC: 24,604 |
hi all made vid on youtube for multiple instances instruction's and at full load on a Radeon VII RADEON VII GIGABYTE// 3 Instances_ Milkyway@home WUs BOINC_ 3_instances https://www.youtube.com/watch?v=4xKy9wGKmz4 all the best and welcome to earth |
Send message Joined: 8 May 09 Posts: 3315 Credit: 519,947,535 RAC: 22,188 |
hi all made vid on youtube for multiple instances instruction's and at full load on a Radeon VII SPAM |
©2024 Astroinformatics Group