1)
Message boards :
Number crunching :
Questions on GPU config settings
(Message 57683)
Posted 27 Mar 2013 by jay_e Post: My warehouses, having burned down, I have a clear vision of the moon. Mikey, Thanks again! After days of trials and false starts, your explanation gives a clear vision of the moon. (and BOINC) Jay |
2)
Message boards :
Number crunching :
Too Many WU downloaded when additional buffers =0
(Message 57682)
Posted 27 Mar 2013 by jay_e Post: Mikey, Richard, Thanks to you both! I really appreciate the info. I have been trying to read other posts before asking the same question again. I am truly amazed how much you both contribute to answering the questions of others! Thanks again, Jay ps will have to get a nvidia card so I can crunch for seti too. |
3)
Message boards :
Number crunching :
Too Many WU downloaded when additional buffers =0
(Message 57671)
Posted 27 Mar 2013 by jay_e Post: It would be simplest to just allow the other tasks to finish and then download MW units. Yes! I was alarmed that BOINC did not follow the Project Resource Share. I wanted to have all (3) projects enabled. But BOINC looks at past performance as well: http://boinc.berkeley.edu/trac/wiki/ClientSched says: Project scheduling priority Both scheduling policies involve a notion of project scheduling priority, a dynamic quantity that reflects how much processing has been done recently by the project's tasks relative to its resource share. It didn't say how far back it looks for "recently". By experience I have found that if I manually enable one project at a time - then allow all projects - the project that had the lease run-time comes back with a vengeance and overshadows the current project-share ratios. Just not intuitively obvious. Thanks for your response!!! Jay |
4)
Message boards :
Number crunching :
Too Many WU downloaded when additional buffers =0
(Message 57670)
Posted 27 Mar 2013 by jay_e Post: I would think it has to do with your time to run one project before switching to the next project. Add in the the fact that MW units take very little time to run and therefore if you let a few units come thru your very fast 8 core cpu is just responding to your settings. Hi, I pondered over the BOINC scheduling Wiki http://boinc.berkeley.edu/trac/wiki/ClientSched This problem went away as WU were processed. I assume that it just took time for BOINC to set the correlation factor. Now, I'm focusing on the other problem where I have set the preference for 50% of CPU and MW uses 100% every other time!. I posted in this thread because I thought it had to do with the N-Body 1.08 release: Thanks again! Jay http://milkyway.cs.rpi.edu/milkyway/forum_thread.php?id=3171&sort=6 starting at http://milkyway.cs.rpi.edu/milkyway/forum_thread.php?id=3171&nowrap=true#57666 |
5)
Message boards :
News :
N-Body 1.08
(Message 57669)
Posted 27 Mar 2013 by jay_e Post: 2nd 'continued' posting to CPU using 100% when 50% specified. OK. It looks like every the overload happens on every other GPU WU. I checked to see if the problem was in thesystem monitor. I went back to the sysstat package and ran sar -P ALL here is what it reported during the overload: 08:45:35 PM CPU %user %nice %system %iowait %steal %idle 08:45:45 PM all 1.64 92.98 0.82 0.00 0.00 4.56 08:45:45 PM 0 0.70 96.72 0.70 0.00 0.00 1.89 08:45:45 PM 1 0.00 94.31 0.50 0.00 0.00 5.19 08:45:45 PM 2 1.00 94.41 0.70 0.00 0.00 3.90 08:45:45 PM 3 0.70 94.31 0.70 0.00 0.00 4.29 08:45:45 PM 4 5.32 87.56 1.71 0.00 0.00 5.42 08:45:45 PM 5 3.09 89.82 1.20 0.00 0.00 5.89 08:45:45 PM 6 1.71 92.38 0.50 0.00 0.00 5.42 08:45:45 PM 7 0.70 94.49 0.40 0.00 0.00 4.40 this shows that all 8, indeed, are used. The BOINC status only shows 4 CPU plus one CPU-GPU task. A ps -ef only shows ( but there are 3 more running at 100% somwhere) $ ps -ef | grep boinc boinc 2473 1 0 19:03 ? 00:01:01 /usr/bin/boinc --check_all_logins --redirectio --dir /var/lib/boinc-client jay 2514 1 1 19:04 ? 00:01:59 /usr/bin/boincmgr boinc 2551 2473 69 19:06 ? 01:15:17 ../../projects/milkyway.cs.rpi.edu_milkyway/milkyway_nbody_1.08_x86_64-pc-linux-gnu__mt -f nbody_parameters.lua -h histogram.txt --seed 230516087 -np 6 -p 2.2613531307244 2.30756208857862 0.280775306084978 0.307090154017686 13.7441723154459 0.146058620134541 boinc 2552 2473 68 19:06 ? 01:15:02 ../../projects/milkyway.cs.rpi.edu_milkyway/milkyway_separation_1.01_x86_64-pc-linux-gnu -np 20 -p 0.401795116392895 11.0570420466829 20 120 9.23227259465482 6.02408145224687 -4.65891254542505 13.49007896143 20 122.412517562509 2.3 0.569129901562 -6.28318530717959 2.69928641156317 20 244 2.4 4.0146428615553 6.28318530717959 0.984356404794499 boinc 2554 2473 68 19:06 ? 01:14:56 ../../projects/milkyway.cs.rpi.edu_milkyway/milkyway_separation_1.01_x86_64-pc-linux-gnu -np 20 -p 0.917377772089099 1 20 218.173156674949 9.36815519919617 6.28318530717959 5.3483147091067 16.745039480715 4.81361567974091 151.560387347829 2.3 6.28318530717959 -6.28318530717959 3.30858427949722 20 244 2.4 5.39780165528236 -4.43232117875659 4.42432041794522 boinc 3038 2473 66 20:37 ? 00:12:06 ../../projects/milkyway.cs.rpi.edu_milkyway/milkyway_nbody_1.08_x86_64-pc-linux-gnu__mt -f nbody_parameters.lua -h histogram.txt --seed 25278614 -np 6 -p 1.5 1.5 0.5 0.5 15 0.128067006109071 boinc 3043 2473 99 20:44 ? 01:01:30 ../../projects/milkyway.cs.rpi.edu_milkyway/milkyway_nbody_1.08_x86_64-pc-linux-gnu_mt__opencl_amd_ati -f nbody_parameters.lua -h histogram.txt --seed 130617792 -np 6 -p 2.5 1.66034131823107 0.5 0.330835734494028 15 0.131445339787751 --device 0 I tried running the ps-ef as root - same thing ah-hah "htop" shows 9 task - different PIDs running opencl amd ati tasks. Hmmmm Anyone else see this? Should I change to Beta fglrx drivers? Thanks, Jay |
6)
Message boards :
News :
N-Body 1.08
(Message 57667)
Posted 27 Mar 2013 by jay_e Post: -additional data to previous post -- Data when GPU task changes. The overload stopped when the GPU task finished. System Moinitor shows 5 of 8 cores at 100%. But then, after I was writing this post, all 8 cores went back to 100% Tue 26 Mar 2013 07:50:18 PM EDT | Milkyway@Home | Computation for task de_nbody_100K_EMD_32013_2_1358941502_373121_1 finished Hmmm. Not sure how BOINC and MW list the CPU task that handles the GPU loading/unloading. I did check when I suspended the GPU task and the overload stopped. It *was* the GPU task - not the CPU task that I suspended on the BOINC Manager screen - when the overload previously stopped. The overload lasted for the time that the GPU task ran - approx. 20 minutes. The GPU is a Radeon HD 7750 with 2GB memory - slower - but less heat and watts. Here is link to the 1st completed GPU task - no errors in stderr. http://milkyway.cs.rpi.edu/milkyway/result.php?resultid=428297681 After the load went back to 100%. I did a ps -ef to get a task list - but all of the parameters did not fit on the display. boinc 2551 2473 60 19:06 ? 00:42:17 ../../projects/milkyway.cs.rpi.edu_milkyway/milkyway_nbody_1.08_x86_64-pc-linux-gnu__mt -f nbody_parameters.lua -h histogram.txt --seed 230516087 boinc 2552 2473 59 19:06 ? 00:41:49 ../../projects/milkyway.cs.rpi.edu_milkyway/milkyway_separation_1.01_x86_64-pc-linux-gnu -np 20 -p 0.401795116392895 11.0570420466829 20 120 9.23 boinc 2553 2473 60 19:06 ? 00:42:02 ../../projects/milkyway.cs.rpi.edu_milkyway/milkyway_nbody_1.08_x86_64-pc-linux-gnu__mt -f nbody_parameters.lua -h histogram.txt --seed 244208335 boinc 2554 2473 60 19:06 ? 00:41:55 ../../projects/milkyway.cs.rpi.edu_milkyway/milkyway_separation_1.01_x86_64-pc-linux-gnu -np 20 -p 0.917377772089099 1 20 218.173156674949 9.3681 root 2733 2 0 19:44 ? 00:00:00 [flush-8:0] root 2797 2 0 19:52 ? 00:00:00 [kworker/4:2] boinc 2830 2473 99 20:04 ? 01:04:40 ../../projects/milkyway.cs.rpi.edu_milkyway/milkyway_nbody_1.08_x86_64-pc-linux-gnu_mt__opencl_amd_ati -f nbody_parameters.lua -h histogram.txt - Enjoy! Jay |
7)
Message boards :
News :
N-Body 1.08
(Message 57666)
Posted 26 Mar 2013 by jay_e Post: Greetings. I have an 8 core CPU and set the CPU preferences to 50%. 4 CPU tasks are loaded and 1 GPU task. Yet the CPU monitor shows all 8 cores are running at 100%. I set no new tasks. Let all tasks finish. stopped work from all other projects and rebooted. Problem repeats. Summary: Ubuntu-Linux and de_nbody_100K_EMD_32013_2_1358941502_444444_0 . details follow. Tue 26 Mar 2013 07:03:52 PM EDT | | Starting BOINC client version 7.0.27 for x86_64-pc-linux-gnu Tue 26 Mar 2013 07:03:52 PM EDT | | log flags: file_xfer, sched_ops, task Tue 26 Mar 2013 07:03:52 PM EDT | | Libraries: libcurl/7.29.0 OpenSSL/1.0.1c zlib/1.2.7 libidn/1.25 librtmp/2.3 Tue 26 Mar 2013 07:03:52 PM EDT | | Data directory: /var/lib/boinc-client Tue 26 Mar 2013 07:03:52 PM EDT | | Processor: 8 AuthenticAMD AMD FX(tm)-8150 Eight-Core Processor [Family 21 Model 1 Stepping 2] Tue 26 Mar 2013 07:03:52 PM EDT | | Processor: 2.00 MB cache Tue 26 Mar 2013 07:03:52 PM EDT | | Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt pdpe1gb rdtscp lm constant_tsc rep_good nopl nonstop_tsc extd_apicid aperfmperf pni pclmulqdq monitor ssse3 cx16 sse4_1 sse4_2 popcnt aes xsave avx lahf_lm cmp_legacy svm extapic cr8_legacy abm sse4a misalignsse 3dnowprefetch osvw ibs xop skinit wdt lwp fma4 nodeid_msr topoext perfctr_core arat cpb hw_pstate npt lbrv svm_lock nrip_save tsc_scale vmcb_clean flushbyasid decodeassists pausefilter pfthreshold Tue 26 Mar 2013 07:03:52 PM EDT | | OS: Linux: 3.8.0-13-generic Tue 26 Mar 2013 07:03:52 PM EDT | | Memory: 7.70 GB physical, 8.04 GB virtual Tue 26 Mar 2013 07:03:52 PM EDT | | Disk: 18.33 GB total, 16.27 GB free Tue 26 Mar 2013 07:03:52 PM EDT | | Local time is UTC -4 hours Tue 26 Mar 2013 07:03:52 PM EDT | | ATI GPU 0: Capeverde (CAL version 1.4.1741, 2048MB, 1710MB available, 2048 GFLOPS peak) Tue 26 Mar 2013 07:03:52 PM EDT | | OpenCL: ATI GPU 0: Capeverde (driver version 1084.4 (VM), device version OpenCL 1.2 AMD-APP (1084.4), 2048MB, 1710MB available) Tue 26 Mar 2013 07:03:52 PM EDT | | Config: use all coprocessors Tue 26 Mar 2013 07:03:52 PM EDT | | Config: GUI RPC allowed from: Tue 26 Mar 2013 07:03:52 PM EDT | | A new version of BOINC is available. <a href=http://boinc.berkeley.edu/download.php>Download it.</a> Tue 26 Mar 2013 07:03:52 PM EDT | malariacontrol.net | URL http://www.malariacontrol.net/; Computer ID 621946; resource share 20 Tue 26 Mar 2013 07:03:52 PM EDT | LHC@home 1.0 | URL http://lhcathomeclassic.cern.ch/sixtrack/; Computer ID 10282414; resource share 20 Tue 26 Mar 2013 07:03:52 PM EDT | World Community Grid | URL http://www.worldcommunitygrid.org/; Computer ID 2325898; resource share 40 Tue 26 Mar 2013 07:03:52 PM EDT | Milkyway@Home | URL http://milkyway.cs.rpi.edu/milkyway/; Computer ID 508164; resource share 20 Tue 26 Mar 2013 07:03:52 PM EDT | | General prefs: from http://setiathome.berkeley.edu/ (last modified 24-Mar-2013 04:56:00) Tue 26 Mar 2013 07:03:52 PM EDT | | Host location: none Tue 26 Mar 2013 07:03:52 PM EDT | | General prefs: using your defaults Tue 26 Mar 2013 07:03:52 PM EDT | | Preferences: Tue 26 Mar 2013 07:03:52 PM EDT | | max memory usage when active: 7494.78MB Tue 26 Mar 2013 07:03:52 PM EDT | | max memory usage when idle: 7494.78MB Tue 26 Mar 2013 07:03:52 PM EDT | | max disk usage: 10.00GB Tue 26 Mar 2013 07:03:52 PM EDT | | max CPUs used: 7 Tue 26 Mar 2013 07:03:52 PM EDT | | (to change preferences, visit the web site of an attached project, or select Preferences in the Manager) Tue 26 Mar 2013 07:03:52 PM EDT | | Not using a proxy Tue 26 Mar 2013 07:04:17 PM EDT | LHC@home 1.0 | update requested by user Tue 26 Mar 2013 07:04:19 PM EDT | LHC@home 1.0 | Sending scheduler request: Requested by user. Tue 26 Mar 2013 07:04:19 PM EDT | LHC@home 1.0 | Not reporting or requesting tasks Tue 26 Mar 2013 07:04:20 PM EDT | LHC@home 1.0 | work fetch resumed by user Tue 26 Mar 2013 07:04:21 PM EDT | LHC@home 1.0 | Scheduler request completed Tue 26 Mar 2013 07:04:31 PM EDT | LHC@home 1.0 | Sending scheduler request: To fetch work. Tue 26 Mar 2013 07:04:31 PM EDT | LHC@home 1.0 | Requesting new tasks for CPU Tue 26 Mar 2013 07:04:33 PM EDT | LHC@home 1.0 | Scheduler request completed: got 0 new tasks Tue 26 Mar 2013 07:04:33 PM EDT | LHC@home 1.0 | Project has no tasks available Tue 26 Mar 2013 07:05:21 PM EDT | LHC@home 1.0 | work fetch suspended by user Tue 26 Mar 2013 07:05:53 PM EDT | | General prefs: from http://setiathome.berkeley.edu/ (last modified 24-Mar-2013 04:56:00) Tue 26 Mar 2013 07:05:53 PM EDT | | Host location: none Tue 26 Mar 2013 07:05:53 PM EDT | | General prefs: using your defaults Tue 26 Mar 2013 07:05:53 PM EDT | | Reading preferences override file Tue 26 Mar 2013 07:05:53 PM EDT | | Preferences: Tue 26 Mar 2013 07:05:53 PM EDT | | max memory usage when active: 7494.78MB Tue 26 Mar 2013 07:05:53 PM EDT | | max memory usage when idle: 7494.78MB Tue 26 Mar 2013 07:05:53 PM EDT | | max disk usage: 10.00GB Tue 26 Mar 2013 07:05:53 PM EDT | | Number of usable CPUs has changed from 7 to 4. [color=darkred]This is where I set preferences to 50% before allowing ANY work.[/color] Tue 26 Mar 2013 07:05:53 PM EDT | | max CPUs used: 4 Tue 26 Mar 2013 07:05:53 PM EDT | | (to change preferences, visit the web site of an attached project, or select Preferences in the Manager) Tue 26 Mar 2013 07:05:59 PM EDT | Milkyway@Home | work fetch resumed by user Tue 26 Mar 2013 07:06:53 PM EDT | Milkyway@Home | Sending scheduler request: To fetch work. Tue 26 Mar 2013 07:06:53 PM EDT | Milkyway@Home | Requesting new tasks for CPU and ATI Tue 26 Mar 2013 07:06:55 PM EDT | Milkyway@Home | Scheduler request completed: got 5 new tasks Tue 26 Mar 2013 07:06:57 PM EDT | Milkyway@Home | Starting task de_nbody_100K_EMD_32013_2_1358941502_444444_0 using milkyway_nbody version 108 (opencl_amd_ati) in slot 0 Tue 26 Mar 2013 07:06:57 PM EDT | Milkyway@Home | Starting task ps_nbody_100K_EMD_32013_2_1358941502_274697_2 using milkyway_nbody version 108 in slot 1 Tue 26 Mar 2013 07:06:57 PM EDT | Milkyway@Home | Starting task de_separation_23_3s_sSgr_1_1358941502_28794660_0 using milkyway version 101 in slot 2 Tue 26 Mar 2013 07:06:57 PM EDT | Milkyway@Home | Starting task ps_nbody_100K_EMD_32013_2_1358941502_444422_0 using milkyway_nbody version 108 in slot 3 Tue 26 Mar 2013 07:06:57 PM EDT | Milkyway@Home | Starting task de_separation_23_3s_sSgr_1_1358941502_28794661_0 using milkyway version 101 in slot 4 App_config.xml and cc_config.xml <app_config> <app> <name>hcc1</name> <max_concurrent>2</max_concurrent>` <gpu_versions> <gpu_usage>0.5</gpu_usage> <cpu_usage>0.5</cpu_usage> </gpu_version> </app> <app> <name>milkyway</name> <max_concurrent>2</max_concurrent> <gpu_versions> <gpu_usage>0.5</gpu_usage> <cpu_usage>0.5</cpu_usage> </gpu_versions> </app> </app_config> ================================================= <cc_config> <options> <use_all_gpus>1</use_all_gpus> </options> </cc_config> Need more data? In comparison, when using WCG Help Conquer Cancer GPU WU, the total CPU utilization is ~50% Is there a way to see if something is spinning in a loop? Its not a reliable measurement, but the fans sound like they are running for a 100% load. One weirdness. The BOINC Task page, on the line describing the GPU task says: "Running (0.05 CPUs + 1 ATI GPU) The xml file was not set to 0.05 for CPU. And, it looks like FOUR CPUs are attached/linked/associated with the GPU task. If I suspend the GPU task, the utilization goes to using 4 of the 8 CPUs at 100% and the other 4 are idle. Resuming the single GPU task sets all 8 cores to 100%. This is not temporary, but lasts all the time the GPU is running. I have not observed, yet, what happens when the WU finishes and uploads finished data and gets new data into the GPU. I'll try to observe this and report later. T H A N K S, Jay --edit - add fglrx versions -- Package fglrx: i 2:9.010-0ubuntu2 raring 500 Package fglrx-amdcccle: i A 2:9.010-0ubuntu2 raring 500 Ubuntu 13.04. |
8)
Message boards :
Number crunching :
Questions on GPU config settings
(Message 57664)
Posted 26 Mar 2013 by jay_e Post: Hi Mikey!! Thanks for the reply!! a) Had already replaced cc_config.xml with much shorter version that had ncpus defined. Replaced file with exact copy of yours. Will wait for current WU in WCG to finish and will restart, reboot, turn it upside-down and shake it over my head. b) The 0.05 value caught my eye since I had set the app-config.xml value to 1.0 and, yet, Boinc reports it as 0.05 in the expert-view task list. c) Thanks for the explanation of running multiple projects concurrently on a GPU. That makes perfect sense since - considering the complexity of loading and unloading data to/from the GPU. Other forum-posts have led me to believe that a single GPU card could hold or process more than one gpu WU at a time. The posts mentioned that, at least, the card could be crunching on one WU while uploading/downloading another WU. Other forum posts talk about setting up multiple GPU cards, each one fed by a (possible different) BOINC project. I have a niggling thought to draw up some pictures and submit them to the boinc-wiki for clarification on the subject. I'm trying small tests to see what works with different projects. When I get stumped (often), I ask in the forums. Thanks again!! Jay --Edit: PS Try Linux some time. a dual boot gives an easy way to try it - or a live version that will boot and run from a usb-stick. If I get frustrated, I go watch you-tubes of Gallagher from 1981 Enjoy! |
9)
Message boards :
Number crunching :
Questions on GPU config settings
(Message 57644)
Posted 25 Mar 2013 by jay_e Post: Greetings. I still don't understand how to set up app_config.xml for the GPU. What I set and what I get don't make sense. Are these "suggestions" rather than fixed settings? I have one Radeon HD 7750 that is crunching and running my video. It has 2 GB of ram in it. My CPU has 8 cores and I run it at 87.5% - 7 of the 8 - just to let other programs run. Here is the cc_config.xml <cc_config> <log_flags> <file_xfer>1</file_xfer> <sched_ops>1</sched_ops> <task>1</task> </log_flags> <options> <ncpus>-1</ncpus> <save_stats_days>14</save_stats_days> <use_all_gpus>1</use_all_gpus> </options> </cc_config> Here is the app_config.xml <app_config> <app> <name>hcc1</name> <max_concurrent>2</max_concurrent>` <gpu_versions> <gpu_usage>0.5</gpu_usage> <cpu_usage>0.5</cpu_usage> </gpu_version> </app> <app> <name>milkyway</name> <max_concurrent>1</max_concurrent> <gpu_versions> <gpu_usage>1.0</gpu_usage> <cpu_usage>1.0</cpu_usage> </gpu_versions> </app> </app_config> When the WCG GPU task runs, the BOINC task page shows: World Community Grid ---- Running (1 CPUs + 1 ATI GPU ) (( and the normal, elapsed times and Application and Name)) When the MilkyWay GPU task runs, the BOINC task page shows: Milkyway@Home ---------- Running (0.05 CPUs +1 ATI GPU) (( and the normal, elapsed times and Application and Name)) ONLY one runs at a time. Question 1) On the Milkyway status, where did the 0.05 CPU come from? Question 2) I'm trying to get the card to do more than one wu at a time on the WCG settings - but BOINC runs only one and says there is only one. Question 3) Does the CPUtask (that is GPU related) task feed only tasks of ONE project? Or if I put one CPU for each project, will BOINC end up using 2 cpu cores to feed the one GPU card? Question 4) What is the scope of the "max_concurrent" setting? Does it refer to total WU an All GPU cards? Or does it mean a partial sum of WU running the defined task - In my case 1 Milkyway WU and 2 WCG WU on the one GPU card? Every time I read the Wiki of a post about this, I get confused at a higher level. :-) I hope this can also save someone else some headaches. THANKS in advance!!! Jay |
10)
Message boards :
Number crunching :
Too Many WU downloaded when additional buffers =0
(Message 57640)
Posted 25 Mar 2013 by jay_e Post: Greetings.. Thank you for your responses. I tried to recreate the problem this morning. The problem did not repeat. I made a small difference. The WU from all other projects completed first. This time, when allowing MW work, the correct number of WU were started (7). I waited for 2 GPU WU to complete and looked at the stderr that was reported. No errors. Previously, I had neglected to report that the additional 27 WU were downloaded within 5 minutes of starting the project. So, I agree with your comments and assume that this probably has to do with BOINC scheduling and nothing to do with MilkyWay. BOINC has a lot of fixes coded into recent releases effecting priority of scheduling. There is one more item - but it is not related to this topic. Thanks again!! Jay |
11)
Message boards :
Number crunching :
Too Many WU downloaded when additional buffers =0
(Message 57636)
Posted 25 Mar 2013 by jay_e Post: Greetings, I am new to WU, but have been with BOINC for years. I chose MilkyWay@home to use my GPU to process. I have an 8-core CPU and have set the preferences to run 87.5% ( 7/8) of them. I attached to the MW project and set No allowed tasks until I could set up the 'default' venue preferences. I then waited until all WCG GPU finished only 4 CPU WUs of WCG were running. At first every thing was OK. MW downloaded 4 GPU tasks and 3 or 4 CPU tasks. (There was room for 1 GPU-CPU and 2 CPU tasks to run. Then it got weird. MW downloaded 23 CPU tasks. They knocked the WCG tasks into waiting. Some had around 30 minutes to go until complete. I had cleared the local prefs before this - and I caleed them back up to see what the preferences had for - Minimum work buffer - Max additional work buffer. The values for both were 0.0 I aborted the 23 so all WU could finish up. I'll check the forum for answers before I fire up MW again. I wanted to post in the MW forum first- before going to the boinc forum - to see if this is something related to MW. Thanks in advance!! Oh Yes. Environment stuff Sun 24 Mar 2013 04:04:11 PM EDT | | Starting BOINC client version 7.0.27 for x86_64-pc-linux-gnu Sun 24 Mar 2013 04:04:11 PM EDT | | Libraries: libcurl/7.29.0 OpenSSL/1.0.1c zlib/1.2.7 libidn/1.25 librtmp/2.3 Sun 24 Mar 2013 04:04:11 PM EDT | | Data directory: /var/lib/boinc-client Sun 24 Mar 2013 04:04:11 PM EDT | | Processor: 8 AuthenticAMD AMD FX(tm)-8150 Eight-Core Processor [Family 21 Model 1 Stepping 2] Sun 24 Mar 2013 04:04:11 PM EDT | | Processor: 2.00 MB cache Sun 24 Mar 2013 04:04:11 PM EDT | | Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt pdpe1gb rdtscp lm constant_tsc rep_good nopl nonstop_tsc extd_apicid aperfmperf pni pclmulqdq monitor ssse3 cx16 sse4_1 sse4_2 popcnt aes xsave avx lahf_lm cmp_legacy svm extapic cr8_legacy abm sse4a misalignsse 3dnowprefetch osvw ibs xop skinit wdt lwp fma4 nodeid_msr topoext perfctr_core arat cpb hw_pstate npt lbrv svm_lock nrip_save tsc_scale vmcb_clean flushbyasid decodeassists pausefilter pfthreshold Sun 24 Mar 2013 04:04:11 PM EDT | | OS: Linux: 3.8.0-13-generic Sun 24 Mar 2013 04:04:11 PM EDT | | Memory: 7.70 GB physical, 8.04 GB virtual Sun 24 Mar 2013 04:04:11 PM EDT | | Disk: 18.33 GB total, 16.28 GB free Sun 24 Mar 2013 04:04:11 PM EDT | | Local time is UTC -4 hours Sun 24 Mar 2013 04:04:11 PM EDT | | ATI GPU 0: Capeverde (CAL version 1.4.1741, 2048MB, 1708MB available, 2048 GFLOPS peak) Sun 24 Mar 2013 04:04:11 PM EDT | | OpenCL: ATI GPU 0: Capeverde (driver version 1084.4 (VM), device version OpenCL 1.2 AMD-APP (1084.4), 2048MB, 1708MB available) Sun 24 Mar 2013 04:04:11 PM EDT | | Config: use all coprocessors Sun 24 Mar 2013 04:04:11 PM EDT | | Config: don't compute while brasero is running Sun 24 Mar 2013 04:04:11 PM EDT | | Config: GUI RPC allowed from: I looked up the difference between BOINC 7.0.27 and 7.0.28 the changelog said the difference was a Windows patch, so I should have the latest, stable, Linux release. The OS is the Raring release from Ubuntu that I downloaded on 3/22/2013 it may be overkill, but here is the clinfo... Number of platforms: 1 Platform Profile: FULL_PROFILE Platform Version: OpenCL 1.2 AMD-APP (1084.4) Platform Name: AMD Accelerated Parallel Processing Platform Vendor: Advanced Micro Devices, Inc. Platform Extensions: cl_khr_icd cl_amd_event_callback cl_amd_offline_devices Platform Name: AMD Accelerated Parallel Processing Number of devices: 2 Device Type: CL_DEVICE_TYPE_GPU Device ID: 4098 Board name: AMD Radeon HD 7700 Series Device Topology: PCI[ B#1, D#0, F#0 ] Max compute units: 8 Max work items dimensions: 3 Max work items[0]: 256 Max work items[1]: 256 Max work items[2]: 256 Max work group size: 256 Preferred vector width char: 4 Preferred vector width short: 2 Preferred vector width int: 1 Preferred vector width long: 1 Preferred vector width float: 1 Preferred vector width double: 1 Native vector width char: 4 Native vector width short: 2 Native vector width int: 1 Native vector width long: 1 Native vector width float: 1 Native vector width double: 1 Max clock frequency: 800Mhz Address bits: 32 Max memory allocation: 536870912 Image support: Yes Max number of images read arguments: 128 Max number of images write arguments: 8 Max image 2D width: 16384 Max image 2D height: 16384 Max image 3D width: 2048 Max image 3D height: 2048 Max image 3D depth: 2048 Max samplers within kernel: 16 Max size of kernel argument: 1024 Alignment (bits) of base address: 2048 Minimum alignment (bytes) for any datatype: 128 Single precision floating point capability Denorms: No Quiet NaNs: Yes Round to nearest even: Yes Round to zero: Yes Round to +ve and infinity: Yes IEEE754-2008 fused multiply-add: Yes Cache type: Read/Write Cache line size: 64 Cache size: 16384 Global memory size: 1508900864 Constant buffer size: 65536 Max number of constant args: 8 Local memory type: Scratchpad Local memory size: 32768 Kernel Preferred work group size multiple: 64 Error correction support: 0 Unified memory for Host and Device: 0 Profiling timer resolution: 1 Device endianess: Little Available: Yes Compiler available: Yes Execution capabilities: Execute OpenCL kernels: Yes Execute native function: No Queue properties: Out-of-Order: No Profiling : Yes Platform ID: 0x00007feadac62e40 Name: Capeverde Vendor: Advanced Micro Devices, Inc. Device OpenCL C version: OpenCL C 1.2 Driver version: 1084.4 (VM) Profile: FULL_PROFILE Version: OpenCL 1.2 AMD-APP (1084.4) Extensions: cl_khr_fp64 cl_amd_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_gl_sharing cl_ext_atomic_counters_32 cl_amd_device_attribute_query cl_amd_vec3 cl_amd_printf cl_amd_media_ops cl_amd_media_ops2 cl_amd_popcnt cl_amd_c1x_atomics Device Type: CL_DEVICE_TYPE_CPU Device ID: 4098 Board name: Max compute units: 8 Max work items dimensions: 3 Max work items[0]: 1024 Max work items[1]: 1024 Max work items[2]: 1024 Max work group size: 1024 Preferred vector width char: 16 Preferred vector width short: 8 Preferred vector width int: 4 Preferred vector width long: 2 Preferred vector width float: 8 Preferred vector width double: 4 Native vector width char: 16 Native vector width short: 8 Native vector width int: 4 Native vector width long: 2 Native vector width float: 8 Native vector width double: 4 Max clock frequency: 3600Mhz Address bits: 64 Max memory allocation: 2147483648 Image support: Yes Max number of images read arguments: 128 Max number of images write arguments: 8 Max image 2D width: 8192 Max image 2D height: 8192 Max image 3D width: 2048 Max image 3D height: 2048 Max image 3D depth: 2048 Max samplers within kernel: 16 Max size of kernel argument: 4096 Alignment (bits) of base address: 1024 Minimum alignment (bytes) for any datatype: 128 Single precision floating point capability Denorms: Yes Quiet NaNs: Yes Round to nearest even: Yes Round to zero: Yes Round to +ve and infinity: Yes IEEE754-2008 fused multiply-add: Yes Cache type: Read/Write Cache line size: 64 Cache size: 16384 Global memory size: 8272465920 Constant buffer size: 65536 Max number of constant args: 8 Local memory type: Global Local memory size: 32768 Kernel Preferred work group size multiple: 1 Error correction support: 0 Unified memory for Host and Device: 1 Profiling timer resolution: 1 Device endianess: Little Available: Yes Compiler available: Yes Execution capabilities: Execute OpenCL kernels: Yes Execute native function: Yes Queue properties: Out-of-Order: No Profiling : Yes Platform ID: 0x00007feadac62e40 Name: AMD FX(tm)-8150 Eight-Core Processor Vendor: AuthenticAMD Device OpenCL C version: OpenCL C 1.2 Driver version: 1084.4 (sse2,avx,fma4) Profile: FULL_PROFILE Version: OpenCL 1.2 AMD-APP (1084.4) Extensions: cl_khr_fp64 cl_amd_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_gl_sharing cl_ext_device_fission cl_amd_device_attribute_query cl_amd_vec3 cl_amd_printf cl_amd_media_ops cl_amd_media_ops2 cl_amd_popcnt cc_config.xml: <cc_config> <log_flags> <file_xfer>0</file_xfer> <sched_ops>0</sched_ops> <task>0</task> <app_msg_receive>0</app_msg_receive> <app_msg_send>0</app_msg_send> <async_file_debug>0</async_file_debug> <benchmark_debug>0</benchmark_debug> <checkpoint_debug>0</checkpoint_debug> <coproc_debug>0</coproc_debug> <cpu_sched>0</cpu_sched> <cpu_sched_debug>0</cpu_sched_debug> <cpu_sched_status>0</cpu_sched_status> <dcf_debug>0</dcf_debug> <disk_usage_debug>0</disk_usage_debug> <priority_debug>0</priority_debug> <file_xfer_debug>0</file_xfer_debug> <gui_rpc_debug>0</gui_rpc_debug> <heartbeat_debug>0</heartbeat_debug> <http_debug>0</http_debug> <http_xfer_debug>0</http_xfer_debug> <mem_usage_debug>0</mem_usage_debug> <network_status_debug>0</network_status_debug> <poll_debug>0</poll_debug> <proxy_debug>0</proxy_debug> <rr_simulation>0</rr_simulation> <rrsim_detail>0</rrsim_detail> <sched_op_debug>0</sched_op_debug> <scrsave_debug>0</scrsave_debug> <slot_debug>0</slot_debug> <state_debug>0</state_debug> <statefile_debug>0</statefile_debug> <suspend_debug>0</suspend_debug> <task_debug>0</task_debug> <time_debug>0</time_debug> <trickle_debug>0</trickle_debug> <unparsed_xml>0</unparsed_xml> <work_fetch_debug>0</work_fetch_debug> <notice_debug>0</notice_debug> </log_flags> <options> <abort_jobs_on_exit>0</abort_jobs_on_exit> <allow_multiple_clients>0</allow_multiple_clients> <allow_remote_gui_rpc>0</allow_remote_gui_rpc> <client_version_check_url>http://boinc.berkeley.edu/download.php?xml=1</client_version_check_url> <client_download_url>http://boinc.berkeley.edu/download.php</client_download_url> <disallow_attach>0</disallow_attach> <dont_check_file_sizes>0</dont_check_file_sizes> <dont_contact_ref_site>0</dont_contact_ref_site> <exclusive_app>brasero</exclusive_app> <exit_after_finish>0</exit_after_finish> <exit_before_start>0</exit_before_start> <exit_when_idle>0</exit_when_idle> <fetch_minimal_work>0</fetch_minimal_work> <force_auth>default</force_auth> <http_1_0>0</http_1_0> <http_transfer_timeout>300</http_transfer_timeout> <http_transfer_timeout_bps>10</http_transfer_timeout_bps> <max_file_xfers>4</max_file_xfers> <max_file_xfers_per_project>2</max_file_xfers_per_project> <max_stderr_file_size>0</max_stderr_file_size> <max_stdout_file_size>0</max_stdout_file_size> <max_tasks_reported>0</max_tasks_reported> <ncpus>-1</ncpus> <network_test_url>http://www.google.com/</network_test_url> <no_alt_platform>0</no_alt_platform> <no_gpus>0</no_gpus> <no_info_fetch>0</no_info_fetch> <no_priority_change>0</no_priority_change> <os_random_only>0</os_random_only> <proxy_info> <socks_server_name></socks_server_name> <socks_server_port>80</socks_server_port> <http_server_name></http_server_name> <http_server_port>80</http_server_port> <socks5_user_name></socks5_user_name> <socks5_user_passwd></socks5_user_passwd> <http_user_name></http_user_name> <http_user_passwd></http_user_passwd> <no_proxy></no_proxy> </proxy_info> <rec_half_life_days>10.0</rec_half_life_days> <report_results_immediately>0</report_results_immediately> <run_apps_manually>0</run_apps_manually> <save_stats_days>14</save_stats_days> <skip_cpu_benchmarks>0</skip_cpu_benchmarks> <simple_gui_only>0</simple_gui_only> <start_delay>10</start_delay> <stderr_head>0</stderr_head> <suppress_net_info>0</suppress_net_info> <unsigned_apps_ok>0</unsigned_apps_ok> <use_all_gpus>1</use_all_gpus> <use_certs>0</use_certs> <use_certs_only>0</use_certs_only> </options> </cc_config> and app_config.xml. <app_config> <app> <name>hcc1</name> <max_concurrent>2</max_concurrent>` <gpu_versions> <gpu_usage>0.5</gpu_usage> <cpu_usage>0.5</cpu_usage> </gpu_version> </app> <app> <name>milkyway</name> <max_concurrent>1</max_concurrent> <gpu_versions> <gpu_usage>1.0</gpu_usage> <cpu_usage>1.0</cpu_usage> </gpu_versions> </app> </app_config> Note: I wanted to start slow and easy with the MW GPU - doing just 1 WU on thebefore trying to start 2 wu on one GPU card. The Radeon HD 7750 has 2GB of ram on it (it says.) The projects directories looked OK to me. The slots directory was weird. There were 19 slots - zero though 18. Here is the link to my task list. http://milkyway.cs.rpi.edu/milkyway/results.php?userid=834251 There were *many* results with inconclusive results. I do not know if this is in any way related, but here it is... ((I'll be glad to meake them a separate post...)) http://milkyway.cs.rpi.edu/milkyway/result.php?resultid=427073904 says Stderr output <core_client_version>7.0.27</core_client_version> <![CDATA[ <stderr_txt> <search_application> milkyway_separation 1.02 Linux x86_64 double OpenCL </search_application> Unrecognized XML in project preferences: max_gfx_cpu_pct Skipping: 20 Skipping: /max_gfx_cpu_pct Unrecognized XML in project preferences: nbody_graphics_poll_period Skipping: 30 Skipping: /nbody_graphics_poll_period Unrecognized XML in project preferences: nbody_graphics_float_speed Skipping: 10 Skipping: /nbody_graphics_float_speed Unrecognized XML in project preferences: nbody_graphics_textured_point_size Skipping: 250 Skipping: /nbody_graphics_textured_point_size Unrecognized XML in project preferences: nbody_graphics_point_point_size Skipping: 40 Skipping: /nbody_graphics_point_point_size BOINC GPU type suggests using OpenCL vendor 'Advanced Micro Devices, Inc.' Setting process priority to 0 (13): Permission denied Error loading Lua script 'astronomy_parameters.txt': [string "number_parameters: 4..."]:1: '<name>' expected near '4' Error reading astronomy parameters from file 'astronomy_parameters.txt' Trying old parameters file Using AVX path Found 1 platform Platform 0 information: Name: AMD Accelerated Parallel Processing Version: OpenCL 1.2 AMD-APP (1084.4) Vendor: Advanced Micro Devices, Inc. Extensions: cl_khr_icd cl_amd_event_callback cl_amd_offline_devices Profile: FULL_PROFILE Using device 0 on platform 0 Found 1 CL device Device 'Capeverde' (Advanced Micro Devices, Inc.:0x1002) (CL_DEVICE_TYPE_GPU) Driver version: 1084.4 (VM) Version: OpenCL 1.2 AMD-APP (1084.4) Compute capability: 0.0 Max compute units: 8 Clock frequency: 800 Mhz Global mem size: 1696595968 Local mem size: 32768 Max const buf size: 65536 Double extension: cl_khr_fp64 Build log: -------------------------------------------------------------------------------- "/tmp/OCLMvuER2.cl", line 30: warning: OpenCL extension is now part of core #pragma OPENCL EXTENSION cl_khr_fp64 : enable ^ LOOP UNROLL: pragma unroll (line 288) Unrolled as requested! LOOP UNROLL: pragma unroll (line 280) Unrolled as requested! LOOP UNROLL: pragma unroll (line 273) Unrolled as requested! LOOP UNROLL: pragma unroll (line 244) Unrolled as requested! LOOP UNROLL: pragma unroll (line 202) Unrolled as requested! -------------------------------------------------------------------------------- Build log: -------------------------------------------------------------------------------- "/tmp/OCLv06wLL.cl", line 27: warning: OpenCL extension is now part of core #pragma OPENCL EXTENSION cl_khr_fp64 : enable ^ -------------------------------------------------------------------------------- Estimated AMD GPU GFLOP/s: 64 SP GFLOP/s, 13 DP FLOP/s Warning: Bizarrely low flops (12). Defaulting to 100 Using a target frequency of 30.0 Using a block size of 2048 with 47 blocks/chunk Using clWaitForEvents() for polling (mode -1) Range: { nu_steps = 640, mu_steps = 1600, r_steps = 1400 } Iteration area: 2240000 Chunk estimate: 23 Num chunks: 24 Chunk size: 96256 Added area: 70144 Effective area: 2310144 Initial wait: 25 ms Integration time: 787.871171 s. Average time per iteration = 1231.048704 ms Integral 0 time = 790.630396 s Running likelihood with 107122 stars Likelihood time = 2.478142 s <background_integral> 0.000035500276475 </background_integral> <stream_integral> 0.594973184597082 1555.204911632331687 211.955616287150974 </stream_integral> <background_likelihood> -3.857876557072216 </background_likelihood> <stream_only_likelihood> -67.838435191593831 -3.537271642180458 -6.646429787745577 </stream_only_likelihood> <search_likelihood> -2.633785974768273 </search_likelihood> 23:09:13 (5035): called boinc_finish </stderr_txt> ]]> THANKS in ADVANCE!!! Need anything else? Please let me know. throwing tomatoes is allowed. Thanks, Jay PS the WCG GPU tasks worked OK. {edit: fix typos.} |
©2024 Astroinformatics Group