Welcome to MilkyWay@home

Problem with OpenCL tasks on NVIDIA Telsa C1060

Message boards : Number crunching : Problem with OpenCL tasks on NVIDIA Telsa C1060
Message board moderation

To post messages, you must log in.

AuthorMessage
Greg Tippitt

Send message
Joined: 8 Apr 11
Posts: 3
Credit: 46,155,607
RAC: 0
Message 62305 - Posted: 9 Sep 2014, 23:22:18 UTC

I have 5 machines that I'm trying to get to run OpenCL tasks for MilkyWay. They all have the same hardware. They each have 4 hex core AMD Opteron 8431 CPUs and NVIDIA Telsa C1060 GPUs. These GPUs have 4GB memory and support double precision. The GPUs run tasks for SETI, POEM, EINSTEIN, and GPUGRID without problems. On some other MW@H tasks, I have had errors with difficulty writing finished files, which have been due to an unstable NFS disk server, but I've gotten those cleaned up.

I don't know where to start with these GPU errors from MilkyWay, since the GPUs are working for other projects okay, and the CPUs are working on MilkyWay okay as well. Any help would be greatly appreciated.


N-Body Simulation v1.42 jobs runs fine with it using 16 of the 24 cores on the machine to runs these really fast. Separation tasks on CPU run fine as well.

But GPU tasks for NVIDIA OpenCL for both @home v1.02 and Separation (Modified Fit) v1.30 end with errors.

The links for my hardware and the examples of three tasks are below.

Thanks and good luck with the fund raiser, I can't wait for my t-shirt to arrive.

Greg


==================================
http://milkyway.cs.rpi.edu/milkyway/show_host_detail.php?hostid=583640

NVIDIA GPU 0: Tesla T10 Processor
driver version 340.32
CUDA version 6.5
compute capability 1.3
OpenCL 1.0
4096MB, 4041MB available, 933 GFLOPS peak


Processor: AMD Six-Core AMD Opteron(tm) Processor 8431 [Family 16 Model 8 Stepping 0]

Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt pdpe1gb rdtscp lm 3dnowext 3dnow constant_tsc rep_good nopl nonstop_tsc extd_apicid pni monitor cx16 popcnt lahf_lm cmp_legacy svm extapic cr8_legacy abm sse4a misalignsse 3dnowprefetch osvw ibs skinit wdt hw_pstate npt lbrv svm_lock nrip_save pausefilter

OS: Linux: 3.13.0-34-generic

======================================
Application version MilkyWay@Home N-Body Simulation v1.42 (mt)

http://milkyway.cs.rpi.edu/milkyway/result.php?resultid=820222153


<core_client_version>7.2.42</core_client_version>
<![CDATA[
<stderr_txt>
<search_application> milkyway_nbody 1.42 Linux x86_64 double OpenMP, Crlibm </search_application>
Using OpenMP 16 max threads on a system with 24 processors
RHO MAX IS 7.17600
7.17600<search_likelihood>-5.740629918799274</search_likelihood>
14:49:03 (21689): called boinc_finish

</stderr_txt>
]]>


=====================================
Application version Milkyway@Home Separation (Modified Fit) v1.30 (opencl_nvidia)


http://milkyway.cs.rpi.edu/milkyway/result.php?resultid=825033456

Stderr output
<core_client_version>7.2.42</core_client_version>
<![CDATA[
<message>
process exited with code 1 (0x1, -255)
</message>
<stderr_txt>
<search_application> milkyway_separation 1.30 Linux x86_64 double OpenCL </search_application>
Reading preferences ended prematurely
BOINC GPU type suggests using OpenCL vendor 'NVIDIA Corporation'
Setting process priority to 0 (13): Permission denied
Error loading Lua script 'astronomy_parameters.txt': [string "number_parameters: 4..."]:1: '<name>' expected near '4'
Switching to Parameter File
Using SSE3 path
Error getting number of platform (-1001): CL_PLATFORM_NOT_FOUND_KHR
Failed to get information about device
Error getting device and context (1): MW_CL_ERROR
Failed to calculate likelihood
<background_integral> nan </background_integral>
<stream_integral> nan nan nan </stream_integral>
<background_likelihood> nan </background_likelihood>
<stream_only_likelihood> nan nan nan </stream_only_likelihood>
<search_likelihood> nan </search_likelihood>
10:22:39 (3969): called boinc_finish

</stderr_txt>
]]>

---------------------------------------
Application version MilkyWay@Home v1.02 (opencl_nvidia)

http://milkyway.cs.rpi.edu/milkyway/result.php?resultid=824774451

<core_client_version>7.2.42</core_client_version>
<![CDATA[
<message>
process exited with code 1 (0x1, -255)
</message>
<stderr_txt>
<search_application> milkyway_separation 1.02 Linux x86_64 double OpenCL </search_application>
Unrecognized XML in project preferences: max_gfx_cpu_pct
Skipping: 0
Skipping: /max_gfx_cpu_pct
Unrecognized XML in project preferences: allow_non_preferred_apps
Skipping: 1
Skipping: /allow_non_preferred_apps
Unrecognized XML in project preferences: nbody_graphics_poll_period
Skipping: 30
Skipping: /nbody_graphics_poll_period
Unrecognized XML in project preferences: nbody_graphics_float_speed
Skipping: 5
Skipping: /nbody_graphics_float_speed
Unrecognized XML in project preferences: nbody_graphics_textured_point_size
Skipping: 250
Skipping: /nbody_graphics_textured_point_size
Unrecognized XML in project preferences: nbody_graphics_point_point_size
Skipping: 40
Skipping: /nbody_graphics_point_point_size
BOINC GPU type suggests using OpenCL vendor 'NVIDIA Corporation'
Setting process priority to 0 (13): Permission denied
Error loading Lua script 'astronomy_parameters.txt': [string "number_parameters: 4..."]:1: '<name>' expected near '4'
Error reading astronomy parameters from file 'astronomy_parameters.txt'
Trying old parameters file
Using SSE3 path
Error getting number of platform (-1001): CL_PLATFORM_NOT_FOUND_KHR
Failed to get information about device
Error getting device and context (1): MW_CL_ERROR
Failed to calculate likelihood
<background_integral> nan </background_integral>
<stream_integral> nan nan nan </stream_integral>
<background_likelihood> nan </background_likelihood>
<stream_only_likelihood> nan nan nan </stream_only_likelihood>
<search_likelihood> nan </search_likelihood>
04:43:39 (4471): called boinc_finish

</stderr_txt>
]]>
---------------------------------------------
ID: 62305 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote

Message boards : Number crunching : Problem with OpenCL tasks on NVIDIA Telsa C1060

©2024 Astroinformatics Group