Task 325776

Task 325776

Name	de_modfit_84_bundle4_4s_south4s_bgset_4_1603804501_70906409_4
Workunit	2141556262
Created	24 Jan 2021, 10:21:46 UTC
Sent	24 Jan 2021, 10:30:22 UTC
Report deadline	5 Feb 2021, 10:30:22 UTC
Received	1 Feb 2021, 2:47:58 UTC
Server state	Over
Outcome	Validate error
Client state	Done
Exit status	0 (0x00000000)
Computer ID	880712
Run time	5 min 13 sec
CPU time	14 sec
Validate state	Invalid
Credit	0.00
Device peak FLOPS	353.26 GFLOPS
Application version	Milkyway@home Separation v1.46 (opencl_ati_101) windows_x86_64
Peak working set size	103.68 MB
Peak swap size	81.26 MB
Peak disk usage	0.01 MB
Stderr output

<core_client_version>7.16.11</core_client_version>
<![CDATA[
<stderr_txt>
<search_application> milkyway_separation 1.46 Windows x86 double OpenCL </search_application>
BOINC GPU type suggests using OpenCL vendor 'Advanced Micro Devices, Inc.'
Error loading Lua script 'astronomy_parameters.txt': [string "number_parameters: 4..."]:1: '<name>' expected near '4' 
Switching to Parameter File 'astronomy_parameters.txt'
<number_WUs> 4 </number_WUs>
<number_params_per_WU> 26 </number_params_per_WU>
Using SSE4.1 path
Found 2 platforms
Platform 0 information:
  Name:       AMD Accelerated Parallel Processing
  Version:    OpenCL 2.1 AMD-APP (3075.12)
  Vendor:     Advanced Micro Devices, Inc.
  Extensions: cl_khr_icd cl_khr_d3d10_sharing cl_khr_d3d11_sharing cl_khr_dx9_media_sharing cl_amd_event_callback cl_amd_offline_devices 
  Profile:    FULL_PROFILE
Platform 1 information:
  Name:       NVIDIA CUDA
  Version:    OpenCL 1.2 CUDA 11.2.109
  Vendor:     NVIDIA Corporation
  Extensions: cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_device_uuid
  Profile:    FULL_PROFILE
Using device 0 on platform 0
Found 1 CL device
Device 'gfx902' (Advanced Micro Devices, Inc.:0x1002) (CL_DEVICE_TYPE_GPU)
Board: AMD Radeon(TM) RX Vega 11 Graphics
Driver version:      3075.12 (PAL,HSAIL)
Version:             OpenCL 1.2 AMD-APP (3075.12)
Compute capability:  0.0
Max compute units:   11
Clock frequency:     1251 Mhz
Global mem size:     3221225472
Local mem size:      32768
Max const buf size:  3221225472
Double extension:    cl_khr_fp64
Build log:
--------------------------------------------------------------------------------
C:\Users\hvuor\AppData\Local\Temp\\OCL4956T1.cl:183:72: warning: unknown attribute 'max_constant_size' ignored
                            __constant real* _ap_consts __attribute__((max_constant_size(18 * sizeof(real)))),
                                                                       ^
C:\Users\hvuor\AppData\Local\Temp\\OCL4956T1.cl:185:62: warning: unknown attribute 'max_constant_size' ignored
                            __constant SC* sc __attribute__((max_constant_size(NSTREAM * sizeof(SC)))),
                                                             ^
C:\Users\hvuor\AppData\Local\Temp\\OCL4956T1.cl:186:67: warning: unknown attribute 'max_constant_size' ignored
                            __constant real* sg_dx __attribute__((max_constant_size(256 * sizeof(real)))),
                                                                  ^
3 warnings generated.

--------------------------------------------------------------------------------
Estimated AMD GPU GFLOP/s: 138 SP GFLOP/s, 28 DP FLOP/s
Warning: Bizarrely low flops (27). Defaulting to 100
Using a target frequency of 60.0
Using a block size of 2816 with 14 blocks/chunk
Using clWaitForEvents() for polling (mode -1)
Range:          { nu_steps = 320, mu_steps = 800, r_steps = 700 }
Iteration area: 560000
Chunk estimate: 14
Num chunks:     15
Chunk size:     39424
Added area:     31360
Effective area: 591360
Initial wait:   12 ms
Integration time: 70.613000 s. Average time per iteration = 220.665624 ms
Integral 0 time = 70.999180 s
Estimated AMD GPU GFLOP/s: 138 SP GFLOP/s, 28 DP FLOP/s
Warning: Bizarrely low flops (27). Defaulting to 100
Using a target frequency of 60.0
Using a block size of 2816 with 14 blocks/chunk
Using clWaitForEvents() for polling (mode -1)
Range:          { nu_steps = 320, mu_steps = 58, r_steps = 700 }
Iteration area: 40600
Chunk estimate: 1
Num chunks:     2
Chunk size:     39424
Added area:     38248
Effective area: 78848
Initial wait:   13 ms
Integration time: 5.503242 s. Average time per iteration = 17.197630 ms
Integral 1 time = 5.549568 s
Running likelihood with 31815 stars
Likelihood time = 0.830759 s
<background_integral> 0.000047226682784 </background_integral>
<stream_integral>  2.539317486675064  30.754602391083296  75.494320236805720  0.000000000000000 </stream_integral>
<background_likelihood> -3.295755852554647 </background_likelihood>
<stream_only_likelihood>  -107.248845638318260  -3.304068963397155  -4.058820203056652  -229.400251087390220 </stream_only_likelihood>
<search_likelihood> -2.705069460765776 </search_likelihood>
Using SSE4.1 path
Found 2 platforms
Platform 0 information:
  Name:       AMD Accelerated Parallel Processing
  Version:    OpenCL 2.1 AMD-APP (3075.12)
  Vendor:     Advanced Micro Devices, Inc.
  Extensions: cl_khr_icd cl_khr_d3d10_sharing cl_khr_d3d11_sharing cl_khr_dx9_media_sharing cl_amd_event_callback cl_amd_offline_devices 
  Profile:    FULL_PROFILE
Platform 1 information:
  Name:       NVIDIA CUDA
  Version:    OpenCL 1.2 CUDA 11.2.109
  Vendor:     NVIDIA Corporation
  Extensions: cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_device_uuid
  Profile:    FULL_PROFILE
Using device 0 on platform 0
Found 1 CL device
Device 'gfx902' (Advanced Micro Devices, Inc.:0x1002) (CL_DEVICE_TYPE_GPU)
Board: AMD Radeon(TM) RX Vega 11 Graphics
Driver version:      3075.12 (PAL,HSAIL)
Version:             OpenCL 1.2 AMD-APP (3075.12)
Compute capability:  0.0
Max compute units:   11
Clock frequency:     1251 Mhz
Global mem size:     3221225472
Local mem size:      32768
Max const buf size:  3221225472
Double extension:    cl_khr_fp64
Build log:
--------------------------------------------------------------------------------
C:\Users\hvuor\AppData\Local\Temp\\OCL4956T3.cl:183:72: warning: unknown attribute 'max_constant_size' ignored
                            __constant real* _ap_consts __attribute__((max_constant_size(18 * sizeof(real)))),
                                                                       ^
C:\Users\hvuor\AppData\Local\Temp\\OCL4956T3.cl:185:62: warning: unknown attribute 'max_constant_size' ignored
                            __constant SC* sc __attribute__((max_constant_size(NSTREAM * sizeof(SC)))),
                                                             ^
C:\Users\hvuor\AppData\Local\Temp\\OCL4956T3.cl:186:67: warning: unknown attribute 'max_constant_size' ignored
                            __constant real* sg_dx __attribute__((max_constant_size(256 * sizeof(real)))),
                                                                  ^
3 warnings generated.

--------------------------------------------------------------------------------
Estimated AMD GPU GFLOP/s: 138 SP GFLOP/s, 28 DP FLOP/s
Warning: Bizarrely low flops (27). Defaulting to 100
Using a target frequency of 60.0
Using a block size of 2816 with 14 blocks/chunk
Using clWaitForEvents() for polling (mode -1)
Range:          { nu_steps = 320, mu_steps = 800, r_steps = 700 }
Iteration area: 560000
Chunk estimate: 14
Num chunks:     15
Chunk size:     39424
Added area:     31360
Effective area: 591360
Initial wait:   12 ms
Integration time: 70.707428 s. Average time per iteration = 220.960712 ms
Integral 0 time = 71.074050 s
Estimated AMD GPU GFLOP/s: 138 SP GFLOP/s, 28 DP FLOP/s
Warning: Bizarrely low flops (27). Defaulting to 100
Using a target frequency of 60.0
Using a block size of 2816 with 14 blocks/chunk
Using clWaitForEvents() for polling (mode -1)
Range:          { nu_steps = 320, mu_steps = 58, r_steps = 700 }
Iteration area: 40600
Chunk estimate: 1
Num chunks:     2
Chunk size:     39424
Added area:     38248
Effective area: 78848
Initial wait:   13 ms
Integration time: 5.491819 s. Average time per iteration = 17.161934 ms
Integral 1 time = 5.548479 s
Running likelihood with 31815 stars
Likelihood time = 0.851356 s
<background_integral1> 0.000047203548370 </background_integral1>
<stream_integral1>  2.899723586337735  30.116809827049643  104.832140180530830  0.000000000000000 </stream_integral1>
<background_likelihood1> -3.278479350740332 </background_likelihood1>
<stream_only_likelihood1>  -57.012507645594901  -3.421922136341429  -3.746750656220379  -230.835491475148300 </stream_only_likelihood1>
<search_likelihood1> -2.702937947212558 </search_likelihood1>
Using SSE4.1 path
Found 2 platforms
Platform 0 information:
  Name:       AMD Accelerated Parallel Processing
  Version:    OpenCL 2.1 AMD-APP (3075.12)
  Vendor:     Advanced Micro Devices, Inc.
  Extensions: cl_khr_icd cl_khr_d3d10_sharing cl_khr_d3d11_sharing cl_khr_dx9_media_sharing cl_amd_event_callback cl_amd_offline_devices 
  Profile:    FULL_PROFILE
Platform 1 information:
  Name:       NVIDIA CUDA
  Version:    OpenCL 1.2 CUDA 11.2.109
  Vendor:     NVIDIA Corporation
  Extensions: cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_device_uuid
  Profile:    FULL_PROFILE
Using device 0 on platform 0
Found 1 CL device
Device 'gfx902' (Advanced Micro Devices, Inc.:0x1002) (CL_DEVICE_TYPE_GPU)
Board: AMD Radeon(TM) RX Vega 11 Graphics
Driver version:      3075.12 (PAL,HSAIL)
Version:             OpenCL 1.2 AMD-APP (3075.12)
Compute capability:  0.0
Max compute units:   11
Clock frequency:     1251 Mhz
Global mem size:     3221225472
Local mem size:      32768
Max const buf size:  3221225472
Double extension:    cl_khr_fp64
Build log:
--------------------------------------------------------------------------------
C:\Users\hvuor\AppData\Local\Temp\\OCL4956T5.cl:183:72: warning: unknown attribute 'max_constant_size' ignored
                            __constant real* _ap_consts __attribute__((max_constant_size(18 * sizeof(real)))),
                                                                       ^
C:\Users\hvuor\AppData\Local\Temp\\OCL4956T5.cl:185:62: warning: unknown attribute 'max_constant_size' ignored
                            __constant SC* sc __attribute__((max_constant_size(NSTREAM * sizeof(SC)))),
                                                             ^
C:\Users\hvuor\AppData\Local\Temp\\OCL4956T5.cl:186:67: warning: unknown attribute 'max_constant_size' ignored
                            __constant real* sg_dx __attribute__((max_constant_size(256 * sizeof(real)))),
                                                                  ^
3 warnings generated.

--------------------------------------------------------------------------------
Estimated AMD GPU GFLOP/s: 138 SP GFLOP/s, 28 DP FLOP/s
Warning: Bizarrely low flops (27). Defaulting to 100
Using a target frequency of 60.0
Using a block size of 2816 with 14 blocks/chunk
Using clWaitForEvents() for polling (mode -1)
Range:          { nu_steps = 320, mu_steps = 800, r_steps = 700 }
Iteration area: 560000
Chunk estimate: 14
Num chunks:     15
Chunk size:     39424
Added area:     31360
Effective area: 591360
Initial wait:   12 ms
Integration time: 70.551165 s. Average time per iteration = 220.472390 ms
Integral 0 time = 70.907756 s
Estimated AMD GPU GFLOP/s: 138 SP GFLOP/s, 28 DP FLOP/s
Warning: Bizarrely low flops (27). Defaulting to 100
Using a target frequency of 60.0
Using a block size of 2816 with 14 blocks/chunk
Using clWaitForEvents() for polling (mode -1)
Range:          { nu_steps = 320, mu_steps = 58, r_steps = 700 }
Iteration area: 40600
Chunk estimate: 1
Num chunks:     2
Chunk size:     39424
Added area:     38248
Effective area: 78848
Initial wait:   13 ms
Integration time: 5.487461 s. Average time per iteration = 17.148315 ms
Integral 1 time = 5.532070 s
Running likelihood with 31815 stars
Likelihood time = 0.776328 s
<background_integral2> 0.000046460540788 </background_integral2>
<stream_integral2>  8.179644043842496  28.259344575282483  49.470551274295644  0.000000000000032 </stream_integral2>
<background_likelihood2> -3.281601133528056 </background_likelihood2>
<stream_only_likelihood2>  -25.063365030639698  -3.446447863288009  -4.234515402530359  -230.371652768094260 </stream_only_likelihood2>
<search_likelihood2> -2.711626813572428 </search_likelihood2>
Using SSE4.1 path
Found 2 platforms
Platform 0 information:
  Name:       AMD Accelerated Parallel Processing
  Version:    OpenCL 2.1 AMD-APP (3075.12)
  Vendor:     Advanced Micro Devices, Inc.
  Extensions: cl_khr_icd cl_khr_d3d10_sharing cl_khr_d3d11_sharing cl_khr_dx9_media_sharing cl_amd_event_callback cl_amd_offline_devices 
  Profile:    FULL_PROFILE
Platform 1 information:
  Name:       NVIDIA CUDA
  Version:    OpenCL 1.2 CUDA 11.2.109
  Vendor:     NVIDIA Corporation
  Extensions: cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_device_uuid
  Profile:    FULL_PROFILE
Using device 0 on platform 0
Found 1 CL device
Device 'gfx902' (Advanced Micro Devices, Inc.:0x1002) (CL_DEVICE_TYPE_GPU)
Board: AMD Radeon(TM) RX Vega 11 Graphics
Driver version:      3075.12 (PAL,HSAIL)
Version:             OpenCL 1.2 AMD-APP (3075.12)
Compute capability:  0.0
Max compute units:   11
Clock frequency:     1251 Mhz
Global mem size:     3221225472
Local mem size:      32768
Max const buf size:  3221225472
Double extension:    cl_khr_fp64
Build log:
--------------------------------------------------------------------------------
C:\Users\hvuor\AppData\Local\Temp\\OCL4956T7.cl:183:72: warning: unknown attribute 'max_constant_size' ignored
                            __constant real* _ap_consts __attribute__((max_constant_size(18 * sizeof(real)))),
                                                                       ^
C:\Users\hvuor\AppData\Local\Temp\\OCL4956T7.cl:185:62: warning: unknown attribute 'max_constant_size' ignored
                            __constant SC* sc __attribute__((max_constant_size(NSTREAM * sizeof(SC)))),
                                                             ^
C:\Users\hvuor\AppData\Local\Temp\\OCL4956T7.cl:186:67: warning: unknown attribute 'max_constant_size' ignored
                            __constant real* sg_dx __attribute__((max_constant_size(256 * sizeof(real)))),
                                                                  ^
3 warnings generated.

--------------------------------------------------------------------------------
Estimated AMD GPU GFLOP/s: 138 SP GFLOP/s, 28 DP FLOP/s
Warning: Bizarrely low flops (27). Defaulting to 100
Using a target frequency of 60.0
Using a block size of 2816 with 14 blocks/chunk
Using clWaitForEvents() for polling (mode -1)
Range:          { nu_steps = 320, mu_steps = 800, r_steps = 700 }
Iteration area: 560000
Chunk estimate: 14
Num chunks:     15
Chunk size:     39424
Added area:     31360
Effective area: 591360
Initial wait:   12 ms
Integration time: 70.489653 s. Average time per iteration = 220.280166 ms
Integral 0 time = 70.916444 s
Estimated AMD GPU GFLOP/s: 138 SP GFLOP/s, 28 DP FLOP/s
Warning: Bizarrely low flops (27). Defaulting to 100
Using a target frequency of 60.0
Using a block size of 2816 with 14 blocks/chunk
Using clWaitForEvents() for polling (mode -1)
Range:          { nu_steps = 320, mu_steps = 58, r_steps = 700 }
Iteration area: 40600
Chunk estimate: 1
Num chunks:     2
Chunk size:     39424
Added area:     38248
Effective area: 78848
Initial wait:   13 ms
Integration time: 5.470321 s. Average time per iteration = 17.094752 ms
Integral 1 time = 5.517336 s
Running likelihood with 31815 stars
Likelihood time = 0.793003 s
<background_integral3> 0.000046282101911 </background_integral3>
<stream_integral3>  8.973412498699606  27.433672940205206  96.930352751279827  0.000000000000000 </stream_integral3>
<background_likelihood3> -3.284821653294561 </background_likelihood3>
<stream_only_likelihood3>  -22.121262365789111  -3.481216734959700  -3.842139474510653  -231.126853781374250 </stream_only_likelihood3>
<search_likelihood3> -2.704718729335344 </search_likelihood3>
02:22:22 (4956): called boinc_finish(0)

</stderr_txt>
]]>