Welcome to MilkyWay@home

Most of my WUs are getting Computation Error

Message boards : Number crunching : Most of my WUs are getting Computation Error
Message board moderation

To post messages, you must log in.

AuthorMessage
Mitch

Send message
Joined: 4 Dec 09
Posts: 6
Credit: 497,277,946
RAC: 0
Message 51157 - Posted: 21 Sep 2011, 8:28:02 UTC

Although it seems like some of my WUs complete ok, the majority get Computation Errors.

Any Ideas?

Stderr output

<core_client_version>6.12.34</core_client_version>
<![CDATA[
<message>
Incorrect function. (0x1) - exit code 1 (0x1)
</message>
<stderr_txt>
Error loading Lua script 'astronomy_parameters.txt': [string "number_parameters: 4..."]:1: '<name>' expected near '4'
Error reading astronomy parameters from file 'astronomy_parameters.txt'
Trying old parameters file
Using SSE3 path
Found 1 platform(s)
Platform 0 information:
Name: NVIDIA CUDA
Version: OpenCL 1.1 CUDA 4.0.1
Vendor: NVIDIA Corporation
Extensions: cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_d3d9_sharing cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll
Profile: FULL_PROFILE
Using device 0 on platform 0
Found 2 CL device(s)
Device GeForce GTX 295 (NVIDIA Corporation:0x10de) (CL_DEVICE_TYPE_GPU)
Driver version: 280.26
Version: OpenCL 1.0 CUDA
Compute capability: 1.3
Image support: CL_TRUE
Max compute units: 30
Clock frequency: 1476 Mhz
Global mem size: 919994368
Local mem size: 16384
Max const buf size: 65536
Double extension: MW_CL_KHR_FP64
Found a compute capability 1.3 device. Using -cl-nv-maxrregcount=32

Compiler flags:
-cl-mad-enable -cl-no-signed-zeros -cl-strict-aliasing -cl-finite-math-only -cl-nv-verbose -cl-nv-maxrregcount=32 -DDOUBLEPREC=1 -DFAST_H_PROB=1 -DAUX_BG_PROFILE=0 -DNSTREAM=3 -DCONVOLVE=120 -DR0=5.800000000000000 -DSUN_R0=8.500000000000000 -DQ_INV_SQR=3.429355281207133 -DUSE_IMAGES=0

Build status: CL_BUILD_SUCCESS
Build log:


Kernel work group info:
Work group size = 512
Kernel local mem size = 64
Compile work group size = { 0, 0, 0 }
Keeping chunk boundaries as multiples of 1920
Range: { nu_steps = 640, mu_steps = 1600, r_steps = 1400 }
Iteration area: 2240000
Chunk estimate: 146
Num chunks: 146
Added area: 640
Effective area: 2240640
Integration time: 683.318201 s. Average time per iteration = 1067.684689 ms
Integral 0 time = 684.528568 s
Kernel work group info:
Work group size = 512
Kernel local mem size = 64
Compile work group size = { 0, 0, 0 }
Keeping chunk boundaries as multiples of 1920
Range: { nu_steps = 640, mu_steps = 400, r_steps = 1400 }
Iteration area: 560000
Chunk estimate: 37
Num chunks: 37
Added area: 640
Effective area: 560640
Integration time: 311.750670 s. Average time per iteration = 487.110421 ms
Integral 1 time = 312.290463 s
Kernel work group info:
Work group size = 512
Kernel local mem size = 64
Compile work group size = { 0, 0, 0 }
Keeping chunk boundaries as multiples of 1920
Range: { nu_steps = 640, mu_steps = 400, r_steps = 1400 }
Iteration area: 560000
Chunk estimate: 37
Num chunks: 37
Added area: 640
Effective area: 560640
Failed to finish: CL_INVALID_COMMAND_QUEUE
Failed to run nu step: CL_INVALID_COMMAND_QUEUE
Integral 2 time = 51.434438 s
Failed to calculate integral 2
02:39:11 (3196): called boinc_finish

</stderr_txt>
]]>


ID: 51157 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Matt Arsenault
Volunteer moderator
Project developer
Project tester
Project scientist

Send message
Joined: 8 May 10
Posts: 576
Credit: 15,979,383
RAC: 0
Message 51199 - Posted: 23 Sep 2011, 21:49:18 UTC - in response to Message 51157.  

The latest Nvidia drivers (280.26) seem to have some OpenCL issues. Try 275.33. Not sure if it's the driver's fault or mine yet.
ID: 51199 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Mitch

Send message
Joined: 4 Dec 09
Posts: 6
Credit: 497,277,946
RAC: 0
Message 51202 - Posted: 24 Sep 2011, 13:10:25 UTC - in response to Message 51199.  

The latest Nvidia drivers (280.26) seem to have some OpenCL issues. Try 275.33. Not sure if it's the driver's fault or mine yet.


Although going to 275.33 has helped, I now get more successful completions, I am still getting 50% error rate.

Stderr output

<core_client_version>6.12.33</core_client_version>
<![CDATA[
<message>
Incorrect function. (0x1) - exit code 1 (0x1)
</message>
<stderr_txt>
Error loading Lua script 'astronomy_parameters.txt': [string "number_parameters: 4..."]:1: '<name>' expected near '4'
Error reading astronomy parameters from file 'astronomy_parameters.txt'
Trying old parameters file
Using SSE3 path
Found 1 platform(s)
Platform 0 information:
Name: NVIDIA CUDA
Version: OpenCL 1.0 CUDA 4.0.1
Vendor: NVIDIA Corporation
Extensions: cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_d3d9_sharing cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll
Profile: FULL_PROFILE
Using device 0 on platform 0
Found 1 CL device(s)
Device GeForce GTX 260 (NVIDIA Corporation:0x10de) (CL_DEVICE_TYPE_GPU)
Driver version: 275.33
Version: OpenCL 1.0 CUDA
Compute capability: 1.3
Image support: CL_TRUE
Max compute units: 27
Clock frequency: 1408 Mhz
Global mem size: 883228672
Local mem size: 16384
Max const buf size: 65536
Double extension: MW_CL_KHR_FP64
Found a compute capability 1.3 device. Using -cl-nv-maxrregcount=32

Compiler flags:
-cl-mad-enable -cl-no-signed-zeros -cl-strict-aliasing -cl-finite-math-only -cl-nv-verbose -cl-nv-maxrregcount=32 -DDOUBLEPREC=1 -DFAST_H_PROB=1 -DAUX_BG_PROFILE=0 -DNSTREAM=2 -DCONVOLVE=120 -DR0=29.830428334172062 -DSUN_R0=8.500000000000000 -DQ_INV_SQR=9.412153878997035 -DUSE_IMAGES=0

Build status: CL_BUILD_SUCCESS
Build log:

ptxas info : Compiling entry function 'mu_sum_kernel' for 'sm_13'
ptxas info : Used 30 registers, 336+0 bytes lmem, 48+16 bytes smem, 22 bytes cmem[0], 60 bytes cmem[1], 4 bytes cmem[2], 4 bytes cmem[3], 4 bytes cmem[4], 4 bytes cmem[5], 4 bytes cmem[6]
Kernel work group info:
Work group size = 512
Kernel local mem size = 64
Compile work group size = { 0, 0, 0 }
Keeping chunk boundaries as multiples of 1728
Range: { nu_steps = 640, mu_steps = 1600, r_steps = 1400 }
Iteration area: 2240000
Chunk estimate: 163
Num chunks: 163
Added area: 1216
Effective area: 2241216
Integration time: 563.813601 s. Average time per iteration = 880.958752 ms
Integral 0 time = 564.994666 s
Likelihood time = 2.348108 s
Non-finite result
Failed to calculate likelihood
<background_integral> 0.000054630029497 </background_integral>
<stream_integral> 818.416014116782890 10.920919621834059 </stream_integral>
<background_likelihood> -1.#IND00000000000 </background_likelihood>
<stream_only_likelihood> 1.#QNAN0000000000 1.#QNAN0000000000 </stream_only_likelihood>
<search_likelihood> -3.324976656157873 </search_likelihood>
<search_application> milkywayathome_client separation 0.82 Windows x86_64 double OpenCL </search_application>
07:27:16 (4332): called boinc_finish

</stderr_txt>
]]>
ID: 51202 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Profile arkayn
Avatar

Send message
Joined: 14 Feb 09
Posts: 999
Credit: 74,932,619
RAC: 0
Message 51203 - Posted: 24 Sep 2011, 13:34:08 UTC

Try 266.58 instead, CUDA 4.0 is the most likely culprit.
ID: 51203 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Mitch

Send message
Joined: 4 Dec 09
Posts: 6
Credit: 497,277,946
RAC: 0
Message 51220 - Posted: 24 Sep 2011, 22:32:51 UTC - in response to Message 51203.  

Try 266.58 instead, CUDA 4.0 is the most likely culprit.


Tried 266.58 and still getting errors, I really want to donate to this project, but only half my WUs seem to be successful.

Stderr output

<core_client_version>6.12.34</core_client_version>
<![CDATA[
<message>
Incorrect function. (0x1) - exit code 1 (0x1)
</message>
<stderr_txt>
Error loading Lua script 'astronomy_parameters.txt': [string "number_parameters: 4..."]:1: '<name>' expected near '4'
Error reading astronomy parameters from file 'astronomy_parameters.txt'
Trying old parameters file
Using SSE3 path
Found 1 platform(s)
Platform 0 information:
Name: NVIDIA CUDA
Version: OpenCL 1.0 CUDA 3.2.1
Vendor: NVIDIA Corporation
Extensions: cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_d3d9_sharing cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll
Profile: FULL_PROFILE
Using device 0 on platform 0
Found 2 CL device(s)
Device GeForce GTX 295 (NVIDIA Corporation:0x10de) (CL_DEVICE_TYPE_GPU)
Driver version: 266.58
Version: OpenCL 1.0 CUDA
Compute capability: 1.3
Image support: CL_TRUE
Max compute units: 30
Clock frequency: 1476 Mhz
Global mem size: 919994368
Local mem size: 16384
Max const buf size: 65536
Double extension: MW_CL_KHR_FP64
Found a compute capability 1.3 device. Using -cl-nv-maxrregcount=32

Compiler flags:
-cl-mad-enable -cl-no-signed-zeros -cl-strict-aliasing -cl-finite-math-only -cl-nv-verbose -cl-nv-maxrregcount=32 -DDOUBLEPREC=1 -DFAST_H_PROB=1 -DAUX_BG_PROFILE=0 -DNSTREAM=3 -DCONVOLVE=120 -DR0=8.000000000000000 -DSUN_R0=8.500000000000000 -DQ_INV_SQR=3.783304006281294 -DUSE_IMAGES=0

Build status: CL_BUILD_SUCCESS
Build log:

: Considering profile 'compute_13' for gpu='sm_13' in 'cuModuleLoadDataEx_4'
: Retrieving binary for 'cuModuleLoadDataEx_4', for gpu='sm_13', usage mode=' --verbose --maxrregcount 32 '
: Considering profile 'compute_13' for gpu='sm_13' in 'cuModuleLoadDataEx_4'
: Control flags for 'cuModuleLoadDataEx_4' disable search path
: Ptx binary found for 'cuModuleLoadDataEx_4', architecture='compute_13'
: Ptx compilation for 'cuModuleLoadDataEx_4', for gpu='sm_13', ocg options=' --verbose --maxrregcount 32 '
ptxas info : Compiling entry function 'mu_sum_kernel' for 'sm_13'
ptxas info : Used 31 registers, 484+0 bytes lmem, 48+16 bytes smem, 22 bytes cmem[0], 60 bytes cmem[1], 4 bytes cmem[2], 4 bytes cmem[3], 4 bytes cmem[4], 4 bytes cmem[5], 4 bytes cmem[6]
Kernel work group info:
Work group size = 512
Kernel local mem size = 64
Compile work group size = { 0, 0, 0 }
Keeping chunk boundaries as multiples of 1920
Range: { nu_steps = 640, mu_steps = 1600, r_steps = 1400 }
Iteration area: 2240000
Chunk estimate: 146
Num chunks: 146
Added area: 640
Effective area: 2240640
Failed to finish: CL_INVALID_COMMAND_QUEUE
Failed to run nu step: CL_INVALID_COMMAND_QUEUE
Integral 0 time = 66.026411 s
Failed to calculate integral 0
15:10:24 (3460): called boinc_finish

</stderr_txt>
]]>


Home | My Account | Message Boards


Copyright © 2011 AstroInformatics Group
ID: 51220 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Mitch

Send message
Joined: 4 Dec 09
Posts: 6
Credit: 497,277,946
RAC: 0
Message 51224 - Posted: 26 Sep 2011, 1:54:32 UTC - in response to Message 51220.  

I shifted to Collatz and 100% are error free under 266.58, but few are successful under milky way.
ID: 51224 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
SupeRNovA

Send message
Joined: 12 Apr 11
Posts: 1
Credit: 253,867
RAC: 0
Message 51225 - Posted: 26 Sep 2011, 15:49:55 UTC

i use 266.58 no problem at all 4 x gtx 295
ID: 51225 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Mitch

Send message
Joined: 4 Dec 09
Posts: 6
Credit: 497,277,946
RAC: 0
Message 51228 - Posted: 26 Sep 2011, 23:55:48 UTC - in response to Message 51225.  

Still getting errors on milky way only. Win 7 Pro, Boinc ver 12.33,Nvidia 266.58.

It must me something on this machine. Other machines are fine and I changed video cards from a GTX 285 to a GTX 275 and I am still getting a lot of errors on milky way.

Stderr output

<core_client_version>6.12.33</core_client_version>
<![CDATA[
<message>
Incorrect function. (0x1) - exit code 1 (0x1)
</message>
<stderr_txt>
Error loading Lua script 'astronomy_parameters.txt': [string "number_parameters: 4..."]:1: '<name>' expected near '4'
Error reading astronomy parameters from file 'astronomy_parameters.txt'
Trying old parameters file
Using SSE3 path
Found 1 platform(s)
Platform 0 information:
Name: NVIDIA CUDA
Version: OpenCL 1.0 CUDA 3.2.1
Vendor: NVIDIA Corporation
Extensions: cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_d3d9_sharing cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll
Profile: FULL_PROFILE
Using device 0 on platform 0
Found 1 CL device(s)
Device GeForce GTX 275 (NVIDIA Corporation:0x10de) (CL_DEVICE_TYPE_GPU)
Driver version: 266.58
Version: OpenCL 1.0 CUDA
Compute capability: 1.3
Image support: CL_TRUE
Max compute units: 30
Clock frequency: 1404 Mhz
Global mem size: 915800064
Local mem size: 16384
Max const buf size: 65536
Double extension: MW_CL_KHR_FP64
Found a compute capability 1.3 device. Using -cl-nv-maxrregcount=32

Compiler flags:
-cl-mad-enable -cl-no-signed-zeros -cl-strict-aliasing -cl-finite-math-only -cl-nv-verbose -cl-nv-maxrregcount=32 -DDOUBLEPREC=1 -DFAST_H_PROB=1 -DAUX_BG_PROFILE=0 -DNSTREAM=2 -DCONVOLVE=120 -DR0=7.143159167262644 -DSUN_R0=8.500000000000000 -DQ_INV_SQR=2.825765961912116 -DUSE_IMAGES=0

Build status: CL_BUILD_SUCCESS
Build log:

: Considering profile 'compute_13' for gpu='sm_13' in 'cuModuleLoadDataEx_4'
: Retrieving binary for 'cuModuleLoadDataEx_4', for gpu='sm_13', usage mode=' --verbose --maxrregcount 32 '
: Considering profile 'compute_13' for gpu='sm_13' in 'cuModuleLoadDataEx_4'
: Control flags for 'cuModuleLoadDataEx_4' disable search path
: Ptx binary found for 'cuModuleLoadDataEx_4', architecture='compute_13'
: Ptx compilation for 'cuModuleLoadDataEx_4', for gpu='sm_13', ocg options=' --verbose --maxrregcount 32 '
ptxas info : Compiling entry function 'mu_sum_kernel' for 'sm_13'
ptxas info : Used 30 registers, 336+0 bytes lmem, 48+16 bytes smem, 22 bytes cmem[0], 60 bytes cmem[1], 4 bytes cmem[2], 4 bytes cmem[3], 4 bytes cmem[4], 4 bytes cmem[5], 4 bytes cmem[6]
Kernel work group info:
Work group size = 512
Kernel local mem size = 64
Compile work group size = { 0, 0, 0 }
Keeping chunk boundaries as multiples of 1920
Range: { nu_steps = 640, mu_steps = 1600, r_steps = 1400 }
Iteration area: 2240000
Chunk estimate: 146
Num chunks: 146
Added area: 640
Effective area: 2240640
Integration time: 511.316939 s. Average time per iteration = 798.932718 ms
Integral 0 time = 512.492557 s
star points: No such file or directory
Failed to read star points item
Failed to read star points file
Failed to calculate likelihood
<background_integral> 0.000710532391470 </background_integral>
<stream_integral> 780.534422071872200 1918.594803171270100 </stream_integral>
<background_likelihood> 1.#QNAN0000000000 </background_likelihood>
<stream_only_likelihood> 1.#QNAN0000000000 1.#QNAN0000000000 </stream_only_likelihood>
<search_likelihood> 1.#QNAN0000000000 </search_likelihood>
<search_application> milkywayathome_client separation 0.82 Windows x86_64 double OpenCL </search_application>
18:37:15 (4332): called boinc_finish

</stderr_txt>
]]>


Home | My Account | Message Boards


Copyright © 2011 AstroInformatics Group
ID: 51228 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote

Message boards : Number crunching : Most of my WUs are getting Computation Error

©2024 Astroinformatics Group