Welcome to MilkyWay@home

Task 1665383

Name de_modfit_84_bundle4_4s_south4s_bgset_4_1603804501_71389584_5
Workunit 2142092558
Created 31 Jan 2021, 1:55:42 UTC
Sent 31 Jan 2021, 1:55:43 UTC
Report deadline 12 Feb 2021, 1:55:43 UTC
Received 31 Jan 2021, 10:44:06 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x00000000)
Computer ID 741258
Run time 3 min 27 sec
CPU time 2 min 2 sec
Validate state Valid
Credit 244.01
Device peak FLOPS 1,294.21 GFLOPS
Application version Milkyway@home Separation v1.46 (opencl_nvidia_101)
windows_x86_64
Peak working set size 331.99 MB
Peak swap size 336.07 MB
Peak disk usage 0.02 MB

Stderr output

<core_client_version>7.16.11</core_client_version>
<![CDATA[
<stderr_txt>
<search_application> milkyway_separation 1.46 Windows x86 double OpenCL </search_application>
Reading preferences ended prematurely
BOINC GPU type suggests using OpenCL vendor 'NVIDIA Corporation'
Error loading Lua script 'astronomy_parameters.txt': [string "number_parameters: 4..."]:1: '<name>' expected near '4' 
Switching to Parameter File 'astronomy_parameters.txt'
<number_WUs> 4 </number_WUs>
<number_params_per_WU> 26 </number_params_per_WU>
Using SSE4.1 path
Found 1 platform
Platform 0 information:
  Name:       NVIDIA CUDA
  Version:    OpenCL 1.2 CUDA 10.1.120
  Vendor:     NVIDIA Corporation
  Extensions: cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer
  Profile:    FULL_PROFILE
Using device 0 on platform 0
Found 2 CL devices
Device 'GeForce RTX 2060' (NVIDIA Corporation:0x10de) (CL_DEVICE_TYPE_GPU)
Board: 
Driver version:      419.35
Version:             OpenCL 1.2 CUDA
Compute capability:  7.5
Max compute units:   30
Clock frequency:     1680 Mhz
Global mem size:     6442450944
Local mem size:      49152
Max const buf size:  65536
Double extension:    cl_khr_fp64
Build log:
--------------------------------------------------------------------------------

ptxas info    : 0 bytes gmem
ptxas info    : Compiling entry function 'probabilities' for 'sm_75'
ptxas info    : Function properties for probabilities
ptxas         .     0 bytes stack frame, 0 bytes spill stores, 0 bytes spill loads
ptxas info    : Used 128 registers, 420 bytes cmem[0], 144 bytes cmem[2]
--------------------------------------------------------------------------------
Build log:
--------------------------------------------------------------------------------


--------------------------------------------------------------------------------
Estimated Nvidia GPU GFLOP/s: 3226 SP GFLOP/s, 403 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 15360 with 12 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 15 ms (mode 0)
Range:          { nu_steps = 320, mu_steps = 800, r_steps = 700 }
Iteration area: 560000
Chunk estimate: 3
Num chunks:     4
Chunk size:     184320
Added area:     177280
Effective area: 737280
Initial wait:   15 ms
Integration time: 46.592618 s. Average time per iteration = 145.601932 ms
Integral 0 time = 46.856574 s
Estimated Nvidia GPU GFLOP/s: 3226 SP GFLOP/s, 403 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 15360 with 2 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 0 ms (mode 0)
Range:          { nu_steps = 320, mu_steps = 58, r_steps = 700 }
Iteration area: 40600
Chunk estimate: 1
Num chunks:     2
Chunk size:     30720
Added area:     20840
Effective area: 61440
Initial wait:   0 ms
Integration time: 3.382243 s. Average time per iteration = 10.569509 ms
Integral 1 time = 3.427409 s
Running likelihood with 31815 stars
Likelihood time = 0.702839 s
<background_integral> 0.000047120962174 </background_integral>
<stream_integral>  4.823643222585946  37.528057360482137  72.456022670824538  0.000000000000000 </stream_integral>
<background_likelihood> -3.300131768063754 </background_likelihood>
<stream_only_likelihood>  -35.410789831291346  -3.356732606882309  -3.935320547201767  -233.551795548391510 </stream_only_likelihood>
<search_likelihood> -2.705510403140722 </search_likelihood>
Using SSE4.1 path
Found 1 platform
Platform 0 information:
  Name:       NVIDIA CUDA
  Version:    OpenCL 1.2 CUDA 10.1.120
  Vendor:     NVIDIA Corporation
  Extensions: cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer
  Profile:    FULL_PROFILE
Using device 0 on platform 0
Found 2 CL devices
Device 'GeForce RTX 2060' (NVIDIA Corporation:0x10de) (CL_DEVICE_TYPE_GPU)
Board: 
Driver version:      419.35
Version:             OpenCL 1.2 CUDA
Compute capability:  7.5
Max compute units:   30
Clock frequency:     1680 Mhz
Global mem size:     6442450944
Local mem size:      49152
Max const buf size:  65536
Double extension:    cl_khr_fp64
Build log:
--------------------------------------------------------------------------------

ptxas info    : 0 bytes gmem
ptxas info    : Compiling entry function 'probabilities' for 'sm_75'
ptxas info    : Function properties for probabilities
ptxas         .     0 bytes stack frame, 0 bytes spill stores, 0 bytes spill loads
ptxas info    : Used 128 registers, 420 bytes cmem[0], 144 bytes cmem[2]
--------------------------------------------------------------------------------
Build log:
--------------------------------------------------------------------------------


--------------------------------------------------------------------------------
Estimated Nvidia GPU GFLOP/s: 3226 SP GFLOP/s, 403 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 15360 with 12 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 15 ms (mode 0)
Range:          { nu_steps = 320, mu_steps = 800, r_steps = 700 }
Iteration area: 560000
Chunk estimate: 3
Num chunks:     4
Chunk size:     184320
Added area:     177280
Effective area: 737280
Initial wait:   15 ms
Integration time: 46.728613 s. Average time per iteration = 146.026914 ms
Integral 0 time = 47.001394 s
Estimated Nvidia GPU GFLOP/s: 3226 SP GFLOP/s, 403 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 15360 with 2 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 0 ms (mode 0)
Range:          { nu_steps = 320, mu_steps = 58, r_steps = 700 }
Iteration area: 40600
Chunk estimate: 1
Num chunks:     2
Chunk size:     30720
Added area:     20840
Effective area: 61440
Initial wait:   0 ms
Integration time: 3.398606 s. Average time per iteration = 10.620644 ms
Integral 1 time = 3.448593 s
Running likelihood with 31815 stars
Likelihood time = 0.719782 s
<background_integral1> 0.000047137156619 </background_integral1>
<stream_integral1>  1.502293030517428  23.577209876224487  78.649851464294827  0.000000000000000 </stream_integral1>
<background_likelihood1> -3.299555546183296 </background_likelihood1>
<stream_only_likelihood1>  -85.086530694669889  -3.464807991492788  -4.008626961796946  -230.633562522145780 </stream_only_likelihood1>
<search_likelihood1> -2.706986186141305 </search_likelihood1>
Using SSE4.1 path
Found 1 platform
Platform 0 information:
  Name:       NVIDIA CUDA
  Version:    OpenCL 1.2 CUDA 10.1.120
  Vendor:     NVIDIA Corporation
  Extensions: cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer
  Profile:    FULL_PROFILE
Using device 0 on platform 0
Found 2 CL devices
Device 'GeForce RTX 2060' (NVIDIA Corporation:0x10de) (CL_DEVICE_TYPE_GPU)
Board: 
Driver version:      419.35
Version:             OpenCL 1.2 CUDA
Compute capability:  7.5
Max compute units:   30
Clock frequency:     1680 Mhz
Global mem size:     6442450944
Local mem size:      49152
Max const buf size:  65536
Double extension:    cl_khr_fp64
Build log:
--------------------------------------------------------------------------------

ptxas info    : 0 bytes gmem
ptxas info    : Compiling entry function 'probabilities' for 'sm_75'
ptxas info    : Function properties for probabilities
ptxas         .     0 bytes stack frame, 0 bytes spill stores, 0 bytes spill loads
ptxas info    : Used 128 registers, 420 bytes cmem[0], 144 bytes cmem[2]
--------------------------------------------------------------------------------
Build log:
--------------------------------------------------------------------------------


--------------------------------------------------------------------------------
Estimated Nvidia GPU GFLOP/s: 3226 SP GFLOP/s, 403 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 15360 with 12 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 15 ms (mode 0)
Range:          { nu_steps = 320, mu_steps = 800, r_steps = 700 }
Iteration area: 560000
Chunk estimate: 3
Num chunks:     4
Chunk size:     184320
Added area:     177280
Effective area: 737280
Initial wait:   15 ms
Integration time: 43.808259 s. Average time per iteration = 136.900811 ms
Integral 0 time = 44.062128 s
Estimated Nvidia GPU GFLOP/s: 3226 SP GFLOP/s, 403 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 15360 with 2 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 0 ms (mode 0)
Range:          { nu_steps = 320, mu_steps = 58, r_steps = 700 }
Iteration area: 40600
Chunk estimate: 1
Num chunks:     2
Chunk size:     30720
Added area:     20840
Effective area: 61440
Initial wait:   0 ms
Integration time: 3.175056 s. Average time per iteration = 9.922048 ms
Integral 1 time = 3.220297 s
Running likelihood with 31815 stars
Likelihood time = 0.708886 s
<background_integral2> 0.000048391943766 </background_integral2>
<stream_integral2>  0.384172110264436  34.841014812486861  78.143567015357917  0.000000000000000 </stream_integral2>
<background_likelihood2> -3.337179294752301 </background_likelihood2>
<stream_only_likelihood2>  -116.953369151770100  -3.305643364072938  -3.862579733696698  -227.486482883892420 </stream_only_likelihood2>
<search_likelihood2> -2.713762968998046 </search_likelihood2>
Using SSE4.1 path
Found 1 platform
Platform 0 information:
  Name:       NVIDIA CUDA
  Version:    OpenCL 1.2 CUDA 10.1.120
  Vendor:     NVIDIA Corporation
  Extensions: cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer
  Profile:    FULL_PROFILE
Using device 0 on platform 0
Found 2 CL devices
Device 'GeForce RTX 2060' (NVIDIA Corporation:0x10de) (CL_DEVICE_TYPE_GPU)
Board: 
Driver version:      419.35
Version:             OpenCL 1.2 CUDA
Compute capability:  7.5
Max compute units:   30
Clock frequency:     1680 Mhz
Global mem size:     6442450944
Local mem size:      49152
Max const buf size:  65536
Double extension:    cl_khr_fp64
Build log:
--------------------------------------------------------------------------------

ptxas info    : 0 bytes gmem
ptxas info    : Compiling entry function 'probabilities' for 'sm_75'
ptxas info    : Function properties for probabilities
ptxas         .     0 bytes stack frame, 0 bytes spill stores, 0 bytes spill loads
ptxas info    : Used 128 registers, 420 bytes cmem[0], 144 bytes cmem[2]
--------------------------------------------------------------------------------
Build log:
--------------------------------------------------------------------------------


--------------------------------------------------------------------------------
Estimated Nvidia GPU GFLOP/s: 3226 SP GFLOP/s, 403 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 15360 with 12 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 15 ms (mode 0)
Range:          { nu_steps = 320, mu_steps = 800, r_steps = 700 }
Iteration area: 560000
Chunk estimate: 3
Num chunks:     4
Chunk size:     184320
Added area:     177280
Effective area: 737280
Initial wait:   15 ms
Integration time: 47.851298 s. Average time per iteration = 149.535307 ms
Integral 0 time = 48.118525 s
Estimated Nvidia GPU GFLOP/s: 3226 SP GFLOP/s, 403 DP FLOP/s
Using a target frequency of 60.0
Using a block size of 15360 with 2 blocks/chunk
Using clWaitForEvents() for polling with initial wait of 0 ms (mode 0)
Range:          { nu_steps = 320, mu_steps = 58, r_steps = 700 }
Iteration area: 40600
Chunk estimate: 1
Num chunks:     2
Chunk size:     30720
Added area:     20840
Effective area: 61440
Initial wait:   0 ms
Integration time: 3.491038 s. Average time per iteration = 10.909492 ms
Integral 1 time = 3.535286 s
Running likelihood with 31815 stars
Likelihood time = 0.734523 s
<background_integral3> 0.000047908247193 </background_integral3>
<stream_integral3>  3.239268693260050  32.098857353343007  86.536339338134042  0.000000000025260 </stream_integral3>
<background_likelihood3> -3.354679791431713 </background_likelihood3>
<stream_only_likelihood3>  -64.301175049515678  -3.279834395613916  -3.845491775402681  -226.354531679912410 </stream_only_likelihood3>
<search_likelihood3> -2.706626800225428 </search_likelihood3>
05:42:30 (3736): called boinc_finish(0)

</stderr_txt>
]]>


©2024 Astroinformatics Group