Message boards :
News :
Server updated
Message board moderation
Previous · 1 · 2 · 3 · Next
Author | Message |
---|---|
Send message Joined: 15 Jul 08 Posts: 383 Credit: 729,293,740 RAC: 0 |
Judging by all the replies it looks like I broke everything more than I thought. Actually things are running better here than they ever have. It appears that the standard, non app_info.xml app is having problems though. |
Send message Joined: 11 Jun 10 Posts: 329 Credit: 1,166,222,661 RAC: 0 |
Yes by all means fix it so you can get work units running the stock app if we wish but PLEASE don't pinch off our new wu download bonanza!! I might even make it to 1 MIL RAC, LOL! |
Send message Joined: 8 May 10 Posts: 576 Credit: 15,979,383 RAC: 0 |
I think I've fixed the % of CPU issue. |
Send message Joined: 30 Dec 07 Posts: 311 Credit: 149,490,184 RAC: 0 |
Not sure if this is relevant to the problem but I believe the sending of 32-bit application instead of 64-bit application to 64-bit hosts can be disabled by using <primary_platform_only> option. http://boinc.berkeley.edu/trac/changeset/22183 |
Send message Joined: 28 Feb 10 Posts: 120 Credit: 109,840,492 RAC: 0 |
[quote]... Thats it. Works really fine. thx. Franz |
Send message Joined: 11 Jun 10 Posts: 329 Credit: 1,166,222,661 RAC: 0 |
So now all my guys have plenty of work units and they are all crunching away as they always have yet EVERY ONE OF THEM IS LOSING RAC! And while they keep crunching at their regular paces while things are stopped server side for whatever reason those credits are lost! OH we get them on our totals sure but the RAC is wacked again and again. And it is taken away at a very much faster pace than it can be recovered so the issue I would like to see fixed more than anything is STOP HOSING OUR RAC'S This has nothing to do with the current update but I believe I am not alone in my feelings here. (and yes I understand the dynamic nature of averages, but this system seems weighted against us as numbers we crunch in the "background" are lost credits when looking at our rac's) |
Send message Joined: 24 Feb 09 Posts: 620 Credit: 100,587,625 RAC: 0 |
In the OptApp for 64 bit AMD (with V0.23 app) .... the brook64 file .... mine has following attribs: brook64.dll, compressed size 188kb, uncompressed size 447kb, original file date prior to unpacking 07/04/2010 01:01hrs Correct file and still current for AMD 0.23 OptApp 64bit and WUs now being issued ?? Just double checking the obvious ... been caught before Regards Zy |
Send message Joined: 11 Jun 10 Posts: 329 Credit: 1,166,222,661 RAC: 0 |
All systems GO for mine right now and the rac's are jumping back faster than usual too! |
Send message Joined: 1 Sep 08 Posts: 520 Credit: 302,525,188 RAC: 0 |
That would be a yes. I've seen two problems 1) Periodically, the site is flat out inaccessible. Other times it is just very slow. 2) Lack of availability of GPU work units. Workunits have been doled out in driblets as well as larger quantities (as many as 50 on a pass) -- pretty inconsistent there. Some times the site has been up, but the message boards are not useable (threads listed, no messages). But the main thing to me is the elevator effect (server up, server down), (workunits available, workunits not available). Judging by all the replies it looks like I broke everything more than I thought. |
Send message Joined: 1 Feb 11 Posts: 17 Credit: 16,245,184 RAC: 0 |
|
Send message Joined: 24 Dec 07 Posts: 1947 Credit: 240,884,648 RAC: 0 |
What are the specs on the new server? Must be a beast if it can handle the stress it's currently under. |
Send message Joined: 30 Aug 07 Posts: 2046 Credit: 26,480 RAC: 0 |
Are you seeing credit on the MW site? My pending-validation queue is going down but I'm still not seeing points... The separation workunits should be awarding credit. There's a problem with the nbody workunits in that for some reason hosts are either not claiming any credit for their results or it's not being stored in the database, which is making the assimilator/validator award 0 credit because it doesn't know what to award. Working on fixing that right now. |
Send message Joined: 1 Sep 08 Posts: 520 Credit: 302,525,188 RAC: 0 |
I've seen credits increase -- can't be sure if they are yielding properly or not since my completed credits are down due to the server not working right generally. Perhaps it might be best instead of ping-ponging the server multiple times an hours (or at least 3 or 4 times every 6 hours), to actually troubleshoot the issues and resolve them. Bouncing the server every 40 to 80 minutes with a 20 minute restart cycle without fixing things (hoping that a reboot clears the air so to speak) is not getting it done. Or so it seems to me. Are you seeing credit on the MW site? My pending-validation queue is going down but I'm still not seeing points... |
Send message Joined: 30 Aug 07 Posts: 2046 Credit: 26,480 RAC: 0 |
I've seen credits increase -- can't be sure if they are yielding properly or not since my completed credits are down due to the server not working right generally. The server is "bouncing" because I'm updating the assimilator/validator code to try and figure out what all went wrong. I'm not just blindly restarting things :P |
Send message Joined: 24 Feb 09 Posts: 620 Credit: 100,587,625 RAC: 0 |
Lets not do this .... credits are starting to flow now for GPU WUs. The waters must not be muddied by intermixing N Body issues (likely as not software not server related) with a massive increase in I/O throughput and all the related stresses and strains the latter puts on the various parts of the server, and communications bandwidth both internal to the server and external links - parts which have not yet had a chance to be fine tuned properly. Its going to take two or three days to fine tune the Server properly to cope with the new environment that is totally different to a limit of 6 GPUs per core, its certainly far too early yet to jump on individual aspects, its only been on less than 24 hrs. Give the guys space for a couple of days, it will take that long to get the server initially tuned up. Then its needs a few more days of real world use before further actions are taken based on objective observation, not anecdotal speculation. Considering whats just hit that server, its doing fine, needs to be better, and given the appropriate time it will be, it just doesnt happen all by yesterday. Regards Zy |
Send message Joined: 30 Aug 07 Posts: 2046 Credit: 26,480 RAC: 0 |
I found the problem with the nbody assimilator not granting credit -- I should have it fixed tonight but until then it's not going to be running. I think the separation assimilator is wokring fine (and granting credit). I'll probably turn on work generation for it later (right now it has a huge queue of workunits waiting to be sent anyways). |
Send message Joined: 11 Jun 10 Posts: 329 Credit: 1,166,222,661 RAC: 0 |
well @ the rate of 49-82 sec/wu my main guy is already out of work and there is none to be had! Can he steal wu's from my other guys, lol??? |
Send message Joined: 24 Dec 07 Posts: 1947 Credit: 240,884,648 RAC: 0 |
well @ the rate of 49-82 sec/wu my main guy is already out of work and there is none to be had! Glad I had the app_info already installed in my main cruncher with a cache of 1 day when the change occured. I've got wu's coming out of my ears! So much so that I've dropped the cache to 0.75 days to limit the number of wu's cached as client_state.xml gets a little too big and wu processing is reduced as such a large file has to get written too often. |
Send message Joined: 11 Jun 10 Posts: 329 Credit: 1,166,222,661 RAC: 0 |
Thats all good but having a stockpile of wu's does nothing to keep your rac up where it should be. When the server is being worked on and we have cached wu's to keep crunching without interruption how is it that they then take away (big time) from our rac??? We didnt slow down, the system just isn't smart enough to keep up and so we get screwed again and again! Kind of makes all this worry about cache size irrelevent if you ask me. Yeah, I know, you didn't ask me, LOL and you know what one of my guys just made a liar out of me because he in fact is increasing his rac right now while they are working on it, but he's my slowest guy! I've talked too much.....gonna shut up now, LOL |
Send message Joined: 24 Dec 07 Posts: 1947 Credit: 240,884,648 RAC: 0 |
But if you have a stockpile of wu's which you keep crunching while the server is down and once the server is back up and they are reported your RAC is going to go up. Whereas before without much of a cache your RAC would go down. Shame my main cruncher looks like it had a fit not too long after I left home for w#$^ this morning, so my RAC has been dropping. btw, I resigned from my job today. My boss/owner of the company and I didn't get along. Glad other people think I'm good at what I do. Start work at my new job as soon as my notice period is up. |
©2024 Astroinformatics Group