Message boards :
Number crunching :
Problem with OpenCL tasks on NVIDIA Telsa C1060
Message board moderation
Author | Message |
---|---|
Send message Joined: 8 Apr 11 Posts: 3 Credit: 46,155,607 RAC: 0 |
I have 5 machines that I'm trying to get to run OpenCL tasks for MilkyWay. They all have the same hardware. They each have 4 hex core AMD Opteron 8431 CPUs and NVIDIA Telsa C1060 GPUs. These GPUs have 4GB memory and support double precision. The GPUs run tasks for SETI, POEM, EINSTEIN, and GPUGRID without problems. On some other MW@H tasks, I have had errors with difficulty writing finished files, which have been due to an unstable NFS disk server, but I've gotten those cleaned up. I don't know where to start with these GPU errors from MilkyWay, since the GPUs are working for other projects okay, and the CPUs are working on MilkyWay okay as well. Any help would be greatly appreciated. N-Body Simulation v1.42 jobs runs fine with it using 16 of the 24 cores on the machine to runs these really fast. Separation tasks on CPU run fine as well. But GPU tasks for NVIDIA OpenCL for both @home v1.02 and Separation (Modified Fit) v1.30 end with errors. The links for my hardware and the examples of three tasks are below. Thanks and good luck with the fund raiser, I can't wait for my t-shirt to arrive. Greg ================================== http://milkyway.cs.rpi.edu/milkyway/show_host_detail.php?hostid=583640 NVIDIA GPU 0: Tesla T10 Processor driver version 340.32 CUDA version 6.5 compute capability 1.3 OpenCL 1.0 4096MB, 4041MB available, 933 GFLOPS peak Processor: AMD Six-Core AMD Opteron(tm) Processor 8431 [Family 16 Model 8 Stepping 0] Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt pdpe1gb rdtscp lm 3dnowext 3dnow constant_tsc rep_good nopl nonstop_tsc extd_apicid pni monitor cx16 popcnt lahf_lm cmp_legacy svm extapic cr8_legacy abm sse4a misalignsse 3dnowprefetch osvw ibs skinit wdt hw_pstate npt lbrv svm_lock nrip_save pausefilter OS: Linux: 3.13.0-34-generic ====================================== Application version MilkyWay@Home N-Body Simulation v1.42 (mt) http://milkyway.cs.rpi.edu/milkyway/result.php?resultid=820222153 <core_client_version>7.2.42</core_client_version> <![CDATA[ <stderr_txt> <search_application> milkyway_nbody 1.42 Linux x86_64 double OpenMP, Crlibm </search_application> Using OpenMP 16 max threads on a system with 24 processors RHO MAX IS 7.17600 7.17600<search_likelihood>-5.740629918799274</search_likelihood> 14:49:03 (21689): called boinc_finish </stderr_txt> ]]> ===================================== Application version Milkyway@Home Separation (Modified Fit) v1.30 (opencl_nvidia) http://milkyway.cs.rpi.edu/milkyway/result.php?resultid=825033456 Stderr output <core_client_version>7.2.42</core_client_version> <![CDATA[ <message> process exited with code 1 (0x1, -255) </message> <stderr_txt> <search_application> milkyway_separation 1.30 Linux x86_64 double OpenCL </search_application> Reading preferences ended prematurely BOINC GPU type suggests using OpenCL vendor 'NVIDIA Corporation' Setting process priority to 0 (13): Permission denied Error loading Lua script 'astronomy_parameters.txt': [string "number_parameters: 4..."]:1: '<name>' expected near '4' Switching to Parameter File Using SSE3 path Error getting number of platform (-1001): CL_PLATFORM_NOT_FOUND_KHR Failed to get information about device Error getting device and context (1): MW_CL_ERROR Failed to calculate likelihood <background_integral> nan </background_integral> <stream_integral> nan nan nan </stream_integral> <background_likelihood> nan </background_likelihood> <stream_only_likelihood> nan nan nan </stream_only_likelihood> <search_likelihood> nan </search_likelihood> 10:22:39 (3969): called boinc_finish </stderr_txt> ]]> --------------------------------------- Application version MilkyWay@Home v1.02 (opencl_nvidia) http://milkyway.cs.rpi.edu/milkyway/result.php?resultid=824774451 <core_client_version>7.2.42</core_client_version> <![CDATA[ <message> process exited with code 1 (0x1, -255) </message> <stderr_txt> <search_application> milkyway_separation 1.02 Linux x86_64 double OpenCL </search_application> Unrecognized XML in project preferences: max_gfx_cpu_pct Skipping: 0 Skipping: /max_gfx_cpu_pct Unrecognized XML in project preferences: allow_non_preferred_apps Skipping: 1 Skipping: /allow_non_preferred_apps Unrecognized XML in project preferences: nbody_graphics_poll_period Skipping: 30 Skipping: /nbody_graphics_poll_period Unrecognized XML in project preferences: nbody_graphics_float_speed Skipping: 5 Skipping: /nbody_graphics_float_speed Unrecognized XML in project preferences: nbody_graphics_textured_point_size Skipping: 250 Skipping: /nbody_graphics_textured_point_size Unrecognized XML in project preferences: nbody_graphics_point_point_size Skipping: 40 Skipping: /nbody_graphics_point_point_size BOINC GPU type suggests using OpenCL vendor 'NVIDIA Corporation' Setting process priority to 0 (13): Permission denied Error loading Lua script 'astronomy_parameters.txt': [string "number_parameters: 4..."]:1: '<name>' expected near '4' Error reading astronomy parameters from file 'astronomy_parameters.txt' Trying old parameters file Using SSE3 path Error getting number of platform (-1001): CL_PLATFORM_NOT_FOUND_KHR Failed to get information about device Error getting device and context (1): MW_CL_ERROR Failed to calculate likelihood <background_integral> nan </background_integral> <stream_integral> nan nan nan </stream_integral> <background_likelihood> nan </background_likelihood> <stream_only_likelihood> nan nan nan </stream_only_likelihood> <search_likelihood> nan </search_likelihood> 04:43:39 (4471): called boinc_finish </stderr_txt> ]]> --------------------------------------------- |
©2024 Astroinformatics Group