Welcome to MilkyWay@home

Posts by bob

1) Message boards : News : New Separation Runs (Message 62806)
Posted 9 Dec 2014 by bob
Post:
It is not consistent but some of these runs crash the display driver, which the OS will catch and will then restart, the run will continue to execute, and finally post a result with a computational error.
2) Message boards : Number crunching : (Modified Fit) v 1.34 (opencl_ati_101) Large amount with Validate error or Completed, can't validate (Message 62407)
Posted 26 Sep 2014 by bob
Post:
Ditto. 54 of 57 of my error out are Modified Fit 1.34 Opencl
3) Message boards : Number crunching : Feeder Down (Message 62375)
Posted 23 Sep 2014 by bob
Post:
See the follow thread in the News area

New Version of Separation Modfit 1.34
4) Message boards : News : testing work generation with 'ps_separation_14_2s_null_3' (Message 54652)
Posted 3 Jun 2012 by bob
Post:
Per the very thoughful suggestion on another thread. Does any one have the faintest idea of the issue.

Here is the dump on just one of the 21 that ended the same way.

Does anyone know what this really means? Because I have no idea, but 21 task all pretty much aborted in the same manner.

Boinc 7.0.25
ATI 6590 card with 8.961.0.0 Driver
Windows XP Service Pack 3, 32 Bit.

Stderr output

<core_client_version>7.0.25</core_client_version>
<![CDATA[
<message>
Incorrect function. (0x1) - exit code 1 (0x1)
</message>
<stderr_txt>
BOINC: parse gpu_opencl_dev_index 0
<search_application> milkyway_separation 1.02 Windows x86 double OpenCL </search_application>
Unrecognized XML in project preferences: max_gfx_cpu_pct
Skipping: 20
Skipping: /max_gfx_cpu_pct
Unrecognized XML in project preferences: allow_non_preferred_apps
Skipping: 1
Skipping: /allow_non_preferred_apps
Unrecognized XML in project preferences: nbody_graphics_poll_period
Skipping: 30
Skipping: /nbody_graphics_poll_period
Unrecognized XML in project preferences: nbody_graphics_float_speed
Skipping: 5
Skipping: /nbody_graphics_float_speed
Unrecognized XML in project preferences: nbody_graphics_textured_point_size
Skipping: 250
Skipping: /nbody_graphics_textured_point_size
Unrecognized XML in project preferences: nbody_graphics_point_point_size
Skipping: 40
Skipping: /nbody_graphics_point_point_size
BOINC GPU type suggests using OpenCL vendor 'Advanced Micro Devices, Inc.'
Using SSE3 path
Found 1 platform
Platform 0 information:
Name: ATI Stream
Version: OpenCL 1.1 ATI-Stream-v2.3 (451)
Vendor: Advanced Micro Devices, Inc.
Extensions: cl_khr_icd cl_amd_event_callback cl_amd_offline_devices
Profile: FULL_PROFILE
Using device 0 on platform 0
Found 1 CL device
Device 'Cayman' (Advanced Micro Devices, Inc.:0x1002) (CL_DEVICE_TYPE_GPU)
Driver version: CAL 1.4.1546
Version: OpenCL 1.1 ATI-Stream-v2.3 (451)
Compute capability: 0.0
Max compute units: 22
Clock frequency: 825 Mhz
Global mem size: 1073741824
Local mem size: 32768
Max const buf size: 65536
Double extension: cl_amd_fp64
Build log:
--------------------------------------------------------------------------------
C:\DOCUME~1\Beverly1\LOCALS~1\Temp\OCL3CD.tmp.cl(201): error: invalid unroll
factor
#pragma unroll NSTREAM
^

C:\DOCUME~1\Beverly1\LOCALS~1\Temp\OCL3CD.tmp.cl(243): error: invalid unroll
factor
#pragma unroll NSTREAM
^

C:\DOCUME~1\Beverly1\LOCALS~1\Temp\OCL3CD.tmp.cl(272): error: invalid unroll
factor
#pragma unroll NSTREAM
^

C:\DOCUME~1\Beverly1\LOCALS~1\Temp\OCL3CD.tmp.cl(279): error: invalid unroll
factor
#pragma unroll NSTREAM
^

C:\DOCUME~1\Beverly1\LOCALS~1\Temp\OCL3CD.tmp.cl(287): error: invalid unroll
factor
#pragma unroll NSTREAM
^

5 errors detected in the compilation of "C:\DOCUME~1\Beverly1\LOCALS~1\Temp\OCL3CD.tmp.cl".
&#208;@&#128;&#155;&#253;
--------------------------------------------------------------------------------
clBuildProgram: Build failure (-11): CL_BUILD_PROGRAM_FAILURE
Error building program from source (-11): CL_BUILD_PROGRAM_FAILURE
Error creating integral program from source
Failed to calculate likelihood
<background_integral> 1.#QNAN0000000000 </background_integral>
<stream_integral> 1.#QNAN0000000000 1.#QNAN0000000000 </stream_integral>
<background_likelihood> 1.#QNAN0000000000 </background_likelihood>
<stream_only_likelihood> 1.#QNAN0000000000 1.#QNAN0000000000 </stream_only_likelihood>
<search_likelihood> 1.#QNAN0000000000 </search_likelihood>
06:25:53 (3284): called boinc_finish

</stderr_txt>
]]>
5) Message boards : Number crunching : Computation error on new workunits - ps_seperation_14_2s_null_3_ .......... (Message 54642)
Posted 3 Jun 2012 by bob
Post:
Does anyone know what this really means? Because I have no idea, but 21 task all pretty much aborted in the same manner.

Stderr output

<core_client_version>7.0.25</core_client_version>
<![CDATA[
<message>
Incorrect function. (0x1) - exit code 1 (0x1)
</message>
<stderr_txt>
BOINC: parse gpu_opencl_dev_index 0
<search_application> milkyway_separation 1.02 Windows x86 double OpenCL </search_application>
Unrecognized XML in project preferences: max_gfx_cpu_pct
Skipping: 20
Skipping: /max_gfx_cpu_pct
Unrecognized XML in project preferences: allow_non_preferred_apps
Skipping: 1
Skipping: /allow_non_preferred_apps
Unrecognized XML in project preferences: nbody_graphics_poll_period
Skipping: 30
Skipping: /nbody_graphics_poll_period
Unrecognized XML in project preferences: nbody_graphics_float_speed
Skipping: 5
Skipping: /nbody_graphics_float_speed
Unrecognized XML in project preferences: nbody_graphics_textured_point_size
Skipping: 250
Skipping: /nbody_graphics_textured_point_size
Unrecognized XML in project preferences: nbody_graphics_point_point_size
Skipping: 40
Skipping: /nbody_graphics_point_point_size
BOINC GPU type suggests using OpenCL vendor 'Advanced Micro Devices, Inc.'
Using SSE3 path
Found 1 platform
Platform 0 information:
Name: ATI Stream
Version: OpenCL 1.1 ATI-Stream-v2.3 (451)
Vendor: Advanced Micro Devices, Inc.
Extensions: cl_khr_icd cl_amd_event_callback cl_amd_offline_devices
Profile: FULL_PROFILE
Using device 0 on platform 0
Found 1 CL device
Device 'Cayman' (Advanced Micro Devices, Inc.:0x1002) (CL_DEVICE_TYPE_GPU)
Driver version: CAL 1.4.1546
Version: OpenCL 1.1 ATI-Stream-v2.3 (451)
Compute capability: 0.0
Max compute units: 22
Clock frequency: 825 Mhz
Global mem size: 1073741824
Local mem size: 32768
Max const buf size: 65536
Double extension: cl_amd_fp64
Build log:
--------------------------------------------------------------------------------
C:\DOCUME~1\Beverly1\LOCALS~1\Temp\OCL3CD.tmp.cl(201): error: invalid unroll
factor
#pragma unroll NSTREAM
^

C:\DOCUME~1\Beverly1\LOCALS~1\Temp\OCL3CD.tmp.cl(243): error: invalid unroll
factor
#pragma unroll NSTREAM
^

C:\DOCUME~1\Beverly1\LOCALS~1\Temp\OCL3CD.tmp.cl(272): error: invalid unroll
factor
#pragma unroll NSTREAM
^

C:\DOCUME~1\Beverly1\LOCALS~1\Temp\OCL3CD.tmp.cl(279): error: invalid unroll
factor
#pragma unroll NSTREAM
^

C:\DOCUME~1\Beverly1\LOCALS~1\Temp\OCL3CD.tmp.cl(287): error: invalid unroll
factor
#pragma unroll NSTREAM
^

5 errors detected in the compilation of "C:\DOCUME~1\Beverly1\LOCALS~1\Temp\OCL3CD.tmp.cl".
&#208;@&#128;&#155;&#253;
--------------------------------------------------------------------------------
clBuildProgram: Build failure (-11): CL_BUILD_PROGRAM_FAILURE
Error building program from source (-11): CL_BUILD_PROGRAM_FAILURE
Error creating integral program from source
Failed to calculate likelihood
<background_integral> 1.#QNAN0000000000 </background_integral>
<stream_integral> 1.#QNAN0000000000 1.#QNAN0000000000 </stream_integral>
<background_likelihood> 1.#QNAN0000000000 </background_likelihood>
<stream_only_likelihood> 1.#QNAN0000000000 1.#QNAN0000000000 </stream_only_likelihood>
<search_likelihood> 1.#QNAN0000000000 </search_likelihood>
06:25:53 (3284): called boinc_finish

</stderr_txt>
]]>
6) Message boards : Number crunching : App_info.xml (Message 52526)
Posted 15 Jan 2012 by bob
Post:
I had an issue with racing fans on my ATI cards. After much trial and fail efforts, I decided that the only course of action was to manually set the fan speeds for the card (Via the Driver controls in Catalyst). I set my fan speeds to 95 percent, above this point the noise level was quite loud. GPU Temperature remain below 67 Degrees C (Shader Core all other sections are lower), which is acceptable. The constant noise level of the fan are much better than the changing fan noise of before.
7) Message boards : Number crunching : Server Out Of Work (Message 51771)
Posted 29 Nov 2011 by bob
Post:
Does anyone know what has been going on since since 22 November? Is any one going to post any information as to what has been happening?
8) Message boards : Number crunching : Error while computing v0.82 (ati14) (Message 50600)
Posted 8 Aug 2011 by bob
Post:
First thanks to everyone who has put forth a response to this issue. At this time my plan of action is to reduce the number of process that stay resident in memory instead of 80 percent of memory be used the system now sits between 55 and 62 percent of memory being used. All Boinc task are configured to only be in memory while they are running. Additionally the GPU card are configured to their factory defaults, with one exception the fans are locked at 95 percent of full speed. Where the system is noise is not an issue. I plan to run the system and see what if any issue pop up. Kashi comment about video buffer memory issue at this time makes the most sense.

I will keep this thread informed of the results.
9) Message boards : Number crunching : Error while computing v0.82 (ati14) (Message 50557)
Posted 6 Aug 2011 by bob
Post:
Each of the Card has 1GB of memory. At most I am using 75 percent of the Card memory. Each of the Cards is only running one GPU task. The ambient temp is approx 20 C, and the cards are between 60 and 70 C.

It seems to get worse if an when I suspend GPU operations from inside Boinc.

I have started to just shut down BOINC, when I need to perform some other task that are graphic intensive, and the problem does not happen.

It is looking more and more like a software issue.
10) Message boards : Number crunching : Error while computing v0.82 (ati14) (Message 50542)
Posted 5 Aug 2011 by bob
Post:
One of the first things I have done is a complete shut down and a cold restart. The problem still was present. So I deleted the ATI drivers and download new drivers from ATI and did a fresh install. I deleted then downloaded a new copy of ATI SDK and installed ATI SDK. I then did another shut down and cold restart. The system will process both Milkyway and Prime tasks. But every now and then it will just stop processing the Milkyway task, the Prime Tasks continue to function. When this happens I will shut down BOINC. I sometimes will see an orphaned Milkyway task in memory. I will have to manually kill this task. I will then restart BOINC and everything starts to work.
11) Message boards : Number crunching : Error while computing v0.82 (ati14) (Message 50539)
Posted 5 Aug 2011 by bob
Post:
I am getting a many GPU task starting to error out with the following

<core_client_version>6.12.33</core_client_version>
<![CDATA[
<message>
Incorrect function. (0x1) - exit code 1 (0x1)
</message>
<stderr_txt>
Error loading Lua script 'astronomy_parameters.txt': [string "number_parameters: 4..."]:1: '<name>' expected near '4'
Error reading astronomy parameters from file 'astronomy_parameters.txt'
Trying old parameters file
Using SSE2 path
Found 2 CAL devices
Chose device 1

Device target: CAL_TARGET_770
Revision: 2
CAL Version: 1.4.1457
Engine clock: 625 Mhz
Memory clock: 993 Mhz
GPU RAM: 1024
Wavefront size: 64
Double precision: CAL_TRUE
Compute shader: CAL_TRUE
Number SIMD: 10
Number shader engines: 1
Pitch alignment: 256
Surface alignment: 256
Max size 2D: { 8192, 8192 }

Failed to map resource: Operational error (CAL_RESULT_ERROR)
Failed to zero output buffer: No error (CAL_RESULT_ERROR)
Failed to create output buffer: No error (CAL_RESULT_ERROR)
Failed to map resource: Operational error (CAL_RESULT_ERROR)
Failed to zero output buffer: No error (CAL_RESULT_ERROR)
Failed to create out streams buffer: No error (CAL_RESULT_ERROR)
Failed to create buffers: No error (CAL_RESULT_ERROR)
Failed to release CAL resource: A handle parameter is invalid (CAL_RESULT_BAD_HANDLE)
Failed to release buffers: No error (CAL_RESULT_BAD_HANDLE)
Integral 0 time = 1.906686 s
Failed to calculate integral 0
03:32:58 (27728): called boinc_finish

</stderr_txt>
]]>


Any Idea of what the fix is. Just started.
12) Message boards : Number crunching : All WUs crunched & errored out (Message 50527)
Posted 4 Aug 2011 by bob
Post:
I have not seen this before.

Task error out

<core_client_version>6.12.33</core_client_version>
<![CDATA[
<message>
Incorrect function. (0x1) - exit code 1 (0x1)
</message>
<stderr_txt>
Error loading Lua script 'astronomy_parameters.txt': [string "number_parameters: 4..."]:1: '<name>' expected near '4'
Error reading astronomy parameters from file 'astronomy_parameters.txt'
Trying old parameters file
Using SSE2 path
Found 2 CAL devices
Chose device 0

Device target: CAL_TARGET_770
Revision: 2
CAL Version: 1.4.1457
Engine clock: 625 Mhz
Memory clock: 993 Mhz
GPU RAM: 1024
Wavefront size: 64
Double precision: CAL_TRUE
Compute shader: CAL_TRUE
Number SIMD: 10
Number shader engines: 1
Pitch alignment: 256
Surface alignment: 256
Max size 2D: { 8192, 8192 }

Estimated iteration time 396.578000 ms
Target frequency 30.000000 Hz, polling mode 1
Dividing into 11 chunks, initially sleeping for 0 ms
Integration range: { nu_steps = 640, mu_steps = 1600, r_steps = 1400 }
Using 11 chunk(s) with sizes: 144 144 144 144 144 144 144 144 144 144 160
Integration time = 565.331039 s, average per iteration = 883.329749 ms
Failed to map resource: Operational error (CAL_RESULT_ERROR)
Failed to release CAL resource
Failed to map resource: Operational error (CAL_RESULT_ERROR)
Failed to release CAL resource
Failed to map resource: Operational error (CAL_RESULT_ERROR)
Failed to release CAL resource
Failed to map resource: Operational error (CAL_RESULT_ERROR)
Failed to release CAL resource
Integral 0 time = 568.351234 s
Failed to calculate integral 0
02:17:41 (26300): called boinc_finish

</stderr_txt>
]]>

Has any one else seen this. Just started after the most recent recovery.
13) Message boards : Number crunching : Credit Discrepancies? (Message 50476)
Posted 1 Aug 2011 by bob
Post:
Are you sure about your dates, Since we just started August. But other than that. With the project servers being up and down due to HVAC system failure earlier in the month and then a shut down due to high ambient temperatures, I would suspect that their database is in a very confused state at this time. Given them some time and I am sure that it will all get fixed, the sys admin and database admin have been very good on this project.
14) Message boards : News : database problem fixed (Message 38560)
Posted 11 Apr 2010 by bob
Post:
Thank you, Sorry it messed up your weekend.
15) Message boards : Number crunching : Down for maintenance? (Message 37517)
Posted 18 Mar 2010 by bob
Post:
It's Spring Break! Maybe no one is at Castle Greyskull, all in FLA.




©2024 Astroinformatics Group