Welcome to MilkyWay@home

All tasks end with Computation error or Validation error


Advanced search

Message boards : Number crunching : All tasks end with Computation error or Validation error
Message board moderation

To post messages, you must log in.

AuthorMessage
draken_p

Send message
Joined: 15 Jul 13
Posts: 3
Credit: 253,079
RAC: 0
100 thousand credit badge9 year member badge
Message 69777 - Posted: 10 May 2020, 7:42:52 UTC

I'm running this project on GPU only(set in project settings on site), since I've reserved all CPUs for Rosetta@home.
All my tasks running on computer 1 (Win 8.1 x64, i3-4360, 16GB, GTX1060 6GB, BOINC 7.16.5) seem to end with errors.
Tasks on computer 2 (Debian 4.19.98-1 amd64, Xeon2650Lv2, 32GB, P106-090(same as GTX1050 AFAIK) 3GB, boinctui 2.5.0) are validating OK.
GPUgrid.net and Einstein@home tasks are producing proper results.

I have tried resetting voltages and clockings on the GPU, updating/downgrading geforce-drivers, earlier BOINC version(7.14.2).
Nothing works.

What could be the cause of this?

One example of results:
<core_client_version>7.16.5</core_client_version>
<![CDATA[
<message>
Incorrect function.
 (0x1) - exit code 1 (0x1)</message>
<stderr_txt>
<search_application> milkyway_separation 1.46 Windows x86 double OpenCL </search_application>
Reading preferences ended prematurely
BOINC GPU type suggests using OpenCL vendor 'NVIDIA Corporation'
Error loading Lua script 'astronomy_parameters.txt': [string "number_parameters: 4..."]:1: '<name>' expected near '4' 
Switching to Parameter File 'astronomy_parameters.txt'
<number_WUs> 4 </number_WUs>
<number_params_per_WU> 26 </number_params_per_WU>
Using SSE4.1 path
Found 2 platforms
Platform 0 information:
  Name:       NVIDIA CUDA
  Version:    OpenCL 1.2 CUDA 10.2.131
  Vendor:     NVIDIA Corporation
  Extensions: cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d9_sharing cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics
  Profile:    FULL_PROFILE
Platform 1 information:
  Name:       Intel(R) OpenCL
  Version:    OpenCL 1.2 
  Vendor:     Intel(R) Corporation
  Extensions: cl_khr_icd cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_byte_addressable_store cl_khr_spir cl_intel_exec_by_local_thread cl_khr_depth_images cl_khr_3d_image_writes cl_khr_fp64 cl_intel_dx9_media_sharing cl_khr_dx9_media_sharing cl_khr_d3d11_sharing cl_khr_gl_sharing
  Profile:    FULL_PROFILE
Using device 0 on platform 0
Found 1 CL device
Device 'GeForce GTX 1060 6GB' (NVIDIA Corporation:0x10de) (CL_DEVICE_TYPE_GPU)
Board: 
Driver version:      442.19
Version:             OpenCL 1.2 CUDA
Compute capability:  6.1
Max compute units:   10
Clock frequency:     1759 Mhz
Global mem size:     6442450944
Local mem size:      49152
Max const buf size:  65536
Double extension:    cl_khr_fp64
Build log:
--------------------------------------------------------------------------------


--------------------------------------------------------------------------------
Build log:
--------------------------------------------------------------------------------


--------------------------------------------------------------------------------
Estimated Nvidia GPU GFLOP/s: 1126 SP GFLOP/s, 141 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 2560 with 21 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 12 ms (mode 0)
Range:          { nu_steps = 320, mu_steps = 800, r_steps = 700 }
Iteration area: 560000
Chunk estimate: 10
Num chunks:     11
Chunk size:     53760
Added area:     31360
Effective area: 591360
Initial wait:   12 ms
Failed to move file 'separation_checkpoint_tmp' to 'separation_checkpoint' (317): (null)Failed to move file 'separation_checkpoint_tmp' to 'separation_checkpoint' (317): (null)Failed to move file 'separation_checkpoint_tmp' to 'separation_checkpoint' (317): (null)Failed to move file 'separation_checkpoint_tmp' to 'separation_checkpoint' (317): (null)Failed to move file 'separation_checkpoint_tmp' to 'separation_checkpoint' (317): (null)Failed to move file 'separation_checkpoint_tmp' to 'separation_checkpoint' (317): (null)Failed to move file 'separation_checkpoint_tmp' to 'separation_checkpoint' (317): (null)Failed to update checkpoint file ('separation_checkpoint_tmp' to 'separation_checkpoint') (0): No error
Integration time: 59.597172 s. Average time per iteration = 186.241163 ms
Integral 0 time = 65.089794 s
Failed to calculate integral 0
Failed to calculate likelihood
Using SSE4.1 path
Found 2 platforms
Platform 0 information:
  Name:       NVIDIA CUDA
  Version:    OpenCL 1.2 CUDA 10.2.131
  Vendor:     NVIDIA Corporation
  Extensions: cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d9_sharing cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics
  Profile:    FULL_PROFILE
Platform 1 information:
  Name:       Intel(R) OpenCL
  Version:    OpenCL 1.2 
  Vendor:     Intel(R) Corporation
  Extensions: cl_khr_icd cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_byte_addressable_store cl_khr_spir cl_intel_exec_by_local_thread cl_khr_depth_images cl_khr_3d_image_writes cl_khr_fp64 cl_intel_dx9_media_sharing cl_khr_dx9_media_sharing cl_khr_d3d11_sharing cl_khr_gl_sharing
  Profile:    FULL_PROFILE
Using device 0 on platform 0
Found 1 CL device
Device 'GeForce GTX 1060 6GB' (NVIDIA Corporation:0x10de) (CL_DEVICE_TYPE_GPU)
Board: 
Driver version:      442.19
Version:             OpenCL 1.2 CUDA
Compute capability:  6.1
Max compute units:   10
Clock frequency:     1759 Mhz
Global mem size:     6442450944
Local mem size:      49152
Max const buf size:  65536
Double extension:    cl_khr_fp64
Build log:
--------------------------------------------------------------------------------


--------------------------------------------------------------------------------
Build log:
--------------------------------------------------------------------------------


--------------------------------------------------------------------------------
Estimated Nvidia GPU GFLOP/s: 1126 SP GFLOP/s, 141 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 2560 with 21 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 12 ms (mode 0)
Range:          { nu_steps = 320, mu_steps = 800, r_steps = 700 }
Iteration area: 560000
Chunk estimate: 10
Num chunks:     11
Chunk size:     53760
Added area:     31360
Effective area: 591360
Initial wait:   12 ms
Opening checkpoint 'separation_checkpoint_tmp' (13): Permission denied
Integration time: 59.586858 s. Average time per iteration = 186.208933 ms
Integral 0 time = 65.441162 s
Failed to calculate integral 0
Failed to calculate likelihood
Using SSE4.1 path
Found 2 platforms
Platform 0 information:
  Name:       NVIDIA CUDA
  Version:    OpenCL 1.2 CUDA 10.2.131
  Vendor:     NVIDIA Corporation
  Extensions: cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d9_sharing cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics
  Profile:    FULL_PROFILE
Platform 1 information:
  Name:       Intel(R) OpenCL
  Version:    OpenCL 1.2 
  Vendor:     Intel(R) Corporation
  Extensions: cl_khr_icd cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_byte_addressable_store cl_khr_spir cl_intel_exec_by_local_thread cl_khr_depth_images cl_khr_3d_image_writes cl_khr_fp64 cl_intel_dx9_media_sharing cl_khr_dx9_media_sharing cl_khr_d3d11_sharing cl_khr_gl_sharing
  Profile:    FULL_PROFILE
Using device 0 on platform 0
Found 1 CL device
Device 'GeForce GTX 1060 6GB' (NVIDIA Corporation:0x10de) (CL_DEVICE_TYPE_GPU)
Board: 
Driver version:      442.19
Version:             OpenCL 1.2 CUDA
Compute capability:  6.1
Max compute units:   10
Clock frequency:     1759 Mhz
Global mem size:     6442450944
Local mem size:      49152
Max const buf size:  65536
Double extension:    cl_khr_fp64
Build log:
--------------------------------------------------------------------------------


--------------------------------------------------------------------------------
Build log:
--------------------------------------------------------------------------------


--------------------------------------------------------------------------------
Estimated Nvidia GPU GFLOP/s: 1126 SP GFLOP/s, 141 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 2560 with 21 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 12 ms (mode 0)
Range:          { nu_steps = 320, mu_steps = 800, r_steps = 700 }
Iteration area: 560000
Chunk estimate: 10
Num chunks:     11
Chunk size:     53760
Added area:     31360
Effective area: 591360
Initial wait:   12 ms
Opening checkpoint 'separation_checkpoint_tmp' (13): Permission denied
Integration time: 59.187078 s. Average time per iteration = 184.959619 ms
Integral 0 time = 64.536701 s
Failed to calculate integral 0
Failed to calculate likelihood
Using SSE4.1 path
Found 2 platforms
Platform 0 information:
  Name:       NVIDIA CUDA
  Version:    OpenCL 1.2 CUDA 10.2.131
  Vendor:     NVIDIA Corporation
  Extensions: cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d9_sharing cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics
  Profile:    FULL_PROFILE
Platform 1 information:
  Name:       Intel(R) OpenCL
  Version:    OpenCL 1.2 
  Vendor:     Intel(R) Corporation
  Extensions: cl_khr_icd cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_byte_addressable_store cl_khr_spir cl_intel_exec_by_local_thread cl_khr_depth_images cl_khr_3d_image_writes cl_khr_fp64 cl_intel_dx9_media_sharing cl_khr_dx9_media_sharing cl_khr_d3d11_sharing cl_khr_gl_sharing
  Profile:    FULL_PROFILE
Using device 0 on platform 0
Found 1 CL device
Device 'GeForce GTX 1060 6GB' (NVIDIA Corporation:0x10de) (CL_DEVICE_TYPE_GPU)
Board: 
Driver version:      442.19
Version:             OpenCL 1.2 CUDA
Compute capability:  6.1
Max compute units:   10
Clock frequency:     1759 Mhz
Global mem size:     6442450944
Local mem size:      49152
Max const buf size:  65536
Double extension:    cl_khr_fp64
Build log:
--------------------------------------------------------------------------------


--------------------------------------------------------------------------------
Build log:
--------------------------------------------------------------------------------


--------------------------------------------------------------------------------
Estimated Nvidia GPU GFLOP/s: 1126 SP GFLOP/s, 141 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 2560 with 21 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 12 ms (mode 0)
Range:          { nu_steps = 320, mu_steps = 800, r_steps = 700 }
Iteration area: 560000
Chunk estimate: 10
Num chunks:     11
Chunk size:     53760
Added area:     31360
Effective area: 591360
Initial wait:   12 ms
Opening checkpoint 'separation_checkpoint_tmp' (13): Permission denied
Integration time: 59.486543 s. Average time per iteration = 185.895446 ms
Integral 0 time = 65.173279 s
Failed to calculate integral 0
Failed to calculate likelihood
08:33:40 (6728): called boinc_finish(1)

</stderr_txt>
]]>


What does this "Incorrect function." mean? Is this "Opening checkpoint 'separation_checkpoint_tmp' (13): Permission denied" related?
ID: 69777 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
draken_p

Send message
Joined: 15 Jul 13
Posts: 3
Credit: 253,079
RAC: 0
100 thousand credit badge9 year member badge
Message 69823 - Posted: 15 May 2020, 16:05:13 UTC

Any ideas, anyone?
ID: 69823 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
ProfileKeith Myers
Avatar

Send message
Joined: 24 Jan 11
Posts: 640
Credit: 495,769,652
RAC: 166,433
300 million credit badge11 year member badgeextraordinary contributions badge
Message 69824 - Posted: 15 May 2020, 19:30:40 UTC

The problem is apparent right there in the stderr.txt output.

Failed to move file 'separation_checkpoint_tmp' to 'separation_checkpoint' (317): (null)Failed to move file 'separation_checkpoint_tmp' to 'separation_checkpoint' (317): (null)Failed to move file 'separation_checkpoint_tmp' to 'separation_checkpoint' (317): (null)Failed to move file 'separation_checkpoint_tmp' to 'separation_checkpoint' (317): (null)Failed to move file 'separation_checkpoint_tmp' to 'separation_checkpoint' (317): (null)Failed to move file 'separation_checkpoint_tmp' to 'separation_checkpoint' (317): (null)Failed to move file 'separation_checkpoint_tmp' to 'separation_checkpoint' (317): (null)Failed to update checkpoint file ('separation_checkpoint_tmp' to 'separation_checkpoint') (0):

Opening checkpoint 'separation_checkpoint_tmp' (13): Permission denied


You have a permission problem. Probably your AV is blocking access to BOINC's slot directories. Whitelist the BOINC data directory and all its sub-directories.
ID: 69824 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
draken_p

Send message
Joined: 15 Jul 13
Posts: 3
Credit: 253,079
RAC: 0
100 thousand credit badge9 year member badge
Message 69825 - Posted: 16 May 2020, 8:03:20 UTC - in response to Message 69824.  

That error could've been a bit clearer... No clue that "checkpoint" is a reference to the filesystem.

Nevertheless, that was the problem; Comodo was blocking access silently.
Thanks for helping. Crunching now OK(while waiting for the next GPUgrid.net batch).
ID: 69825 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote

Message boards : Number crunching : All tasks end with Computation error or Validation error

©2022 Astroinformatics Group