Message boards :
Number crunching :
All tasks end with Computation error or Validation error
Message board moderation
Author | Message |
---|---|
Send message Joined: 15 Jul 13 Posts: 3 Credit: 253,079 RAC: 0 |
I'm running this project on GPU only(set in project settings on site), since I've reserved all CPUs for Rosetta@home. All my tasks running on computer 1 (Win 8.1 x64, i3-4360, 16GB, GTX1060 6GB, BOINC 7.16.5) seem to end with errors. Tasks on computer 2 (Debian 4.19.98-1 amd64, Xeon2650Lv2, 32GB, P106-090(same as GTX1050 AFAIK) 3GB, boinctui 2.5.0) are validating OK. GPUgrid.net and Einstein@home tasks are producing proper results. I have tried resetting voltages and clockings on the GPU, updating/downgrading geforce-drivers, earlier BOINC version(7.14.2). Nothing works. What could be the cause of this? One example of results: <core_client_version>7.16.5</core_client_version> <![CDATA[ <message> Incorrect function. (0x1) - exit code 1 (0x1)</message> <stderr_txt> <search_application> milkyway_separation 1.46 Windows x86 double OpenCL </search_application> Reading preferences ended prematurely BOINC GPU type suggests using OpenCL vendor 'NVIDIA Corporation' Error loading Lua script 'astronomy_parameters.txt': [string "number_parameters: 4..."]:1: '<name>' expected near '4' Switching to Parameter File 'astronomy_parameters.txt' <number_WUs> 4 </number_WUs> <number_params_per_WU> 26 </number_params_per_WU> Using SSE4.1 path Found 2 platforms Platform 0 information: Name: NVIDIA CUDA Version: OpenCL 1.2 CUDA 10.2.131 Vendor: NVIDIA Corporation Extensions: cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d9_sharing cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics Profile: FULL_PROFILE Platform 1 information: Name: Intel(R) OpenCL Version: OpenCL 1.2 Vendor: Intel(R) Corporation Extensions: cl_khr_icd cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_byte_addressable_store cl_khr_spir cl_intel_exec_by_local_thread cl_khr_depth_images cl_khr_3d_image_writes cl_khr_fp64 cl_intel_dx9_media_sharing cl_khr_dx9_media_sharing cl_khr_d3d11_sharing cl_khr_gl_sharing Profile: FULL_PROFILE Using device 0 on platform 0 Found 1 CL device Device 'GeForce GTX 1060 6GB' (NVIDIA Corporation:0x10de) (CL_DEVICE_TYPE_GPU) Board: Driver version: 442.19 Version: OpenCL 1.2 CUDA Compute capability: 6.1 Max compute units: 10 Clock frequency: 1759 Mhz Global mem size: 6442450944 Local mem size: 49152 Max const buf size: 65536 Double extension: cl_khr_fp64 Build log: -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Build log: -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Estimated Nvidia GPU GFLOP/s: 1126 SP GFLOP/s, 141 DP FLOP/s Using a target frequency of 60.0 Using a block size of 2560 with 21 blocks/chunk Using clWaitForEvents() for polling with initial wait of 12 ms (mode 0) Range: { nu_steps = 320, mu_steps = 800, r_steps = 700 } Iteration area: 560000 Chunk estimate: 10 Num chunks: 11 Chunk size: 53760 Added area: 31360 Effective area: 591360 Initial wait: 12 ms Failed to move file 'separation_checkpoint_tmp' to 'separation_checkpoint' (317): (null)Failed to move file 'separation_checkpoint_tmp' to 'separation_checkpoint' (317): (null)Failed to move file 'separation_checkpoint_tmp' to 'separation_checkpoint' (317): (null)Failed to move file 'separation_checkpoint_tmp' to 'separation_checkpoint' (317): (null)Failed to move file 'separation_checkpoint_tmp' to 'separation_checkpoint' (317): (null)Failed to move file 'separation_checkpoint_tmp' to 'separation_checkpoint' (317): (null)Failed to move file 'separation_checkpoint_tmp' to 'separation_checkpoint' (317): (null)Failed to update checkpoint file ('separation_checkpoint_tmp' to 'separation_checkpoint') (0): No error Integration time: 59.597172 s. Average time per iteration = 186.241163 ms Integral 0 time = 65.089794 s Failed to calculate integral 0 Failed to calculate likelihood Using SSE4.1 path Found 2 platforms Platform 0 information: Name: NVIDIA CUDA Version: OpenCL 1.2 CUDA 10.2.131 Vendor: NVIDIA Corporation Extensions: cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d9_sharing cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics Profile: FULL_PROFILE Platform 1 information: Name: Intel(R) OpenCL Version: OpenCL 1.2 Vendor: Intel(R) Corporation Extensions: cl_khr_icd cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_byte_addressable_store cl_khr_spir cl_intel_exec_by_local_thread cl_khr_depth_images cl_khr_3d_image_writes cl_khr_fp64 cl_intel_dx9_media_sharing cl_khr_dx9_media_sharing cl_khr_d3d11_sharing cl_khr_gl_sharing Profile: FULL_PROFILE Using device 0 on platform 0 Found 1 CL device Device 'GeForce GTX 1060 6GB' (NVIDIA Corporation:0x10de) (CL_DEVICE_TYPE_GPU) Board: Driver version: 442.19 Version: OpenCL 1.2 CUDA Compute capability: 6.1 Max compute units: 10 Clock frequency: 1759 Mhz Global mem size: 6442450944 Local mem size: 49152 Max const buf size: 65536 Double extension: cl_khr_fp64 Build log: -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Build log: -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Estimated Nvidia GPU GFLOP/s: 1126 SP GFLOP/s, 141 DP FLOP/s Using a target frequency of 60.0 Using a block size of 2560 with 21 blocks/chunk Using clWaitForEvents() for polling with initial wait of 12 ms (mode 0) Range: { nu_steps = 320, mu_steps = 800, r_steps = 700 } Iteration area: 560000 Chunk estimate: 10 Num chunks: 11 Chunk size: 53760 Added area: 31360 Effective area: 591360 Initial wait: 12 ms Opening checkpoint 'separation_checkpoint_tmp' (13): Permission denied Integration time: 59.586858 s. Average time per iteration = 186.208933 ms Integral 0 time = 65.441162 s Failed to calculate integral 0 Failed to calculate likelihood Using SSE4.1 path Found 2 platforms Platform 0 information: Name: NVIDIA CUDA Version: OpenCL 1.2 CUDA 10.2.131 Vendor: NVIDIA Corporation Extensions: cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d9_sharing cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics Profile: FULL_PROFILE Platform 1 information: Name: Intel(R) OpenCL Version: OpenCL 1.2 Vendor: Intel(R) Corporation Extensions: cl_khr_icd cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_byte_addressable_store cl_khr_spir cl_intel_exec_by_local_thread cl_khr_depth_images cl_khr_3d_image_writes cl_khr_fp64 cl_intel_dx9_media_sharing cl_khr_dx9_media_sharing cl_khr_d3d11_sharing cl_khr_gl_sharing Profile: FULL_PROFILE Using device 0 on platform 0 Found 1 CL device Device 'GeForce GTX 1060 6GB' (NVIDIA Corporation:0x10de) (CL_DEVICE_TYPE_GPU) Board: Driver version: 442.19 Version: OpenCL 1.2 CUDA Compute capability: 6.1 Max compute units: 10 Clock frequency: 1759 Mhz Global mem size: 6442450944 Local mem size: 49152 Max const buf size: 65536 Double extension: cl_khr_fp64 Build log: -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Build log: -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Estimated Nvidia GPU GFLOP/s: 1126 SP GFLOP/s, 141 DP FLOP/s Using a target frequency of 60.0 Using a block size of 2560 with 21 blocks/chunk Using clWaitForEvents() for polling with initial wait of 12 ms (mode 0) Range: { nu_steps = 320, mu_steps = 800, r_steps = 700 } Iteration area: 560000 Chunk estimate: 10 Num chunks: 11 Chunk size: 53760 Added area: 31360 Effective area: 591360 Initial wait: 12 ms Opening checkpoint 'separation_checkpoint_tmp' (13): Permission denied Integration time: 59.187078 s. Average time per iteration = 184.959619 ms Integral 0 time = 64.536701 s Failed to calculate integral 0 Failed to calculate likelihood Using SSE4.1 path Found 2 platforms Platform 0 information: Name: NVIDIA CUDA Version: OpenCL 1.2 CUDA 10.2.131 Vendor: NVIDIA Corporation Extensions: cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d9_sharing cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics Profile: FULL_PROFILE Platform 1 information: Name: Intel(R) OpenCL Version: OpenCL 1.2 Vendor: Intel(R) Corporation Extensions: cl_khr_icd cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_byte_addressable_store cl_khr_spir cl_intel_exec_by_local_thread cl_khr_depth_images cl_khr_3d_image_writes cl_khr_fp64 cl_intel_dx9_media_sharing cl_khr_dx9_media_sharing cl_khr_d3d11_sharing cl_khr_gl_sharing Profile: FULL_PROFILE Using device 0 on platform 0 Found 1 CL device Device 'GeForce GTX 1060 6GB' (NVIDIA Corporation:0x10de) (CL_DEVICE_TYPE_GPU) Board: Driver version: 442.19 Version: OpenCL 1.2 CUDA Compute capability: 6.1 Max compute units: 10 Clock frequency: 1759 Mhz Global mem size: 6442450944 Local mem size: 49152 Max const buf size: 65536 Double extension: cl_khr_fp64 Build log: -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Build log: -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- Estimated Nvidia GPU GFLOP/s: 1126 SP GFLOP/s, 141 DP FLOP/s Using a target frequency of 60.0 Using a block size of 2560 with 21 blocks/chunk Using clWaitForEvents() for polling with initial wait of 12 ms (mode 0) Range: { nu_steps = 320, mu_steps = 800, r_steps = 700 } Iteration area: 560000 Chunk estimate: 10 Num chunks: 11 Chunk size: 53760 Added area: 31360 Effective area: 591360 Initial wait: 12 ms Opening checkpoint 'separation_checkpoint_tmp' (13): Permission denied Integration time: 59.486543 s. Average time per iteration = 185.895446 ms Integral 0 time = 65.173279 s Failed to calculate integral 0 Failed to calculate likelihood 08:33:40 (6728): called boinc_finish(1) </stderr_txt> ]]> What does this "Incorrect function." mean? Is this "Opening checkpoint 'separation_checkpoint_tmp' (13): Permission denied" related? |
Send message Joined: 15 Jul 13 Posts: 3 Credit: 253,079 RAC: 0 |
Any ideas, anyone? |
Send message Joined: 24 Jan 11 Posts: 712 Credit: 553,380,058 RAC: 54,931 |
The problem is apparent right there in the stderr.txt output. Failed to move file 'separation_checkpoint_tmp' to 'separation_checkpoint' (317): (null)Failed to move file 'separation_checkpoint_tmp' to 'separation_checkpoint' (317): (null)Failed to move file 'separation_checkpoint_tmp' to 'separation_checkpoint' (317): (null)Failed to move file 'separation_checkpoint_tmp' to 'separation_checkpoint' (317): (null)Failed to move file 'separation_checkpoint_tmp' to 'separation_checkpoint' (317): (null)Failed to move file 'separation_checkpoint_tmp' to 'separation_checkpoint' (317): (null)Failed to move file 'separation_checkpoint_tmp' to 'separation_checkpoint' (317): (null)Failed to update checkpoint file ('separation_checkpoint_tmp' to 'separation_checkpoint') (0): You have a permission problem. Probably your AV is blocking access to BOINC's slot directories. Whitelist the BOINC data directory and all its sub-directories. |
Send message Joined: 15 Jul 13 Posts: 3 Credit: 253,079 RAC: 0 |
That error could've been a bit clearer... No clue that "checkpoint" is a reference to the filesystem. Nevertheless, that was the problem; Comodo was blocking access silently. Thanks for helping. Crunching now OK(while waiting for the next GPUgrid.net batch). |
©2024 Astroinformatics Group