Welcome to MilkyWay@home

Server Outages


Advanced search

Message boards : Number crunching : Server Outages
Message board moderation

To post messages, you must log in.

Previous · 1 · 2

AuthorMessage
Profilebanditwolf
Avatar

Send message
Joined: 12 Nov 07
Posts: 2425
Credit: 524,164
RAC: 0
500 thousand credit badge15 year member badge
Message 2147 - Posted: 10 Mar 2008, 19:12:37 UTC

this really needs looked at, and fixed
ID: 2147 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
ProfileCrunch3r
Volunteer developer
Avatar

Send message
Joined: 17 Feb 08
Posts: 363
Credit: 258,227,990
RAC: 0
200 million credit badge15 year member badge
Message 2148 - Posted: 10 Mar 2008, 19:42:34 UTC - in response to Message 2147.  
Last modified: 10 Mar 2008, 19:48:57 UTC

this really needs looked at, and fixed


Yeah, we need to get rid of the FreeBSD as server OS ASAP (blame the labstaff for that one).. hopefully after spring break we'll ge that all sovled...

BTW... I'm getting mad without my daily dosis of Milkyway ;)



Join Support science! Joinc Team BOINC United now!
ID: 2148 · Rating: -1 · rate: Rate + / Rate - Report as offensive     Reply Quote
ProfileChertseyAl
Avatar

Send message
Joined: 31 Aug 07
Posts: 66
Credit: 1,002,668
RAC: 0
1 million credit badge15 year member badge
Message 2150 - Posted: 10 Mar 2008, 20:14:14 UTC - in response to Message 2148.  

this really needs looked at, and fixed


Yeah, we need to get rid of the FreeBSD as server OS ASAP (blame the labstaff for that one).. hopefully after spring break we'll ge that all sovled...

BTW... I'm getting mad without my daily dosis of Milkyway ;)



Alternative: Get Dave to live in the lab, do reboots etc.

I have no idea who Dave is, but we need this guy within easy reach of the reboot button ;)

I have to say, this is the most exciting project I've every taken part in. Random server access, Krazy Kredit, good science, and a real community spirit. Long may it last :)

Al.
ID: 2150 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
ProfileCori
Avatar

Send message
Joined: 27 Aug 07
Posts: 647
Credit: 27,592,547
RAC: 0
20 million credit badge15 year member badge
Message 2151 - Posted: 10 Mar 2008, 20:24:42 UTC - in response to Message 2150.  

this really needs looked at, and fixed


Yeah, we need to get rid of the FreeBSD as server OS ASAP (blame the labstaff for that one).. hopefully after spring break we'll ge that all sovled...

BTW... I'm getting mad without my daily dosis of Milkyway ;)



Alternative: Get Dave to live in the lab, do reboots etc.

I have no idea who Dave is, but we need this guy within easy reach of the reboot button ;)

I have to say, this is the most exciting project I've every taken part in. Random server access, Krazy Kredit, good science, and a real community spirit. Long may it last :)

Al.

100% agreed. :-)

Now how can we bribe Dave to babysit the server all day?


PS. Hopefully the server will survive my post better than last time...
Lovely greetings, Cori
ID: 2151 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
ProfileCrunch3r
Volunteer developer
Avatar

Send message
Joined: 17 Feb 08
Posts: 363
Credit: 258,227,990
RAC: 0
200 million credit badge15 year member badge
Message 2152 - Posted: 10 Mar 2008, 20:31:35 UTC - in response to Message 2150.  
Last modified: 10 Mar 2008, 20:38:48 UTC



Alternative: Get Dave to live in the lab, do reboots etc.

I have no idea who Dave is, but we need this guy within easy reach of the reboot button ;)

I have to say, this is the most exciting project I've every taken part in. Random server access, Krazy Kredit, good science, and a real community spirit. Long may it last :)

Al.


Mhh... well sound good to me !! Having Dave in the lab carying only about the servers is quite a good idea :D But now that it's spring break and he has to visit some of his relatives which some are in hospital... well just give him a break for a couple of days or so... But after that well... Dave make yourself comfortable at your new home :) aka the Lab !!

LOOOOL
;)

P.S. Hope you don't mind buddy ;)
we're all just a bit to excited about the progress that we made in the past weeks ;) Cheers my friend :)




Join Support science! Joinc Team BOINC United now!
ID: 2152 · Rating: -1 · rate: Rate + / Rate - Report as offensive     Reply Quote
ProfileDave Przybylo
Avatar

Send message
Joined: 5 Feb 08
Posts: 236
Credit: 49,648
RAC: 0
10 thousand credit badge15 year member badge
Message 2153 - Posted: 10 Mar 2008, 21:19:59 UTC - in response to Message 2152.  



Alternative: Get Dave to live in the lab, do reboots etc.

I have no idea who Dave is, but we need this guy within easy reach of the reboot button ;)

I have to say, this is the most exciting project I've every taken part in. Random server access, Krazy Kredit, good science, and a real community spirit. Long may it last :)

Al.


Mhh... well sound good to me !! Having Dave in the lab carying only about the servers is quite a good idea :D But now that it's spring break and he has to visit some of his relatives which some are in hospital... well just give him a break for a couple of days or so... But after that well... Dave make yourself comfortable at your new home :) aka the Lab !!

LOOOOL
;)

P.S. Hope you don't mind buddy ;)
we're all just a bit to excited about the progress that we made in the past weeks ;) Cheers my friend :)




It's making me quite angry that the server is going down so frequently as well. I've got this response from out labstaff who currently work over the break.

"I've restarted milkyway for now. Last time the milkyway server stopped
responding, it was because the BOINC cgi binary was stuck trying to acquire
a semaphore. I don't know what caused the semaphore not to be released in
the first place, I'll keep a close eye on it."

I'll check into the semaphore bug and see if we can get it resolved. However, i believe that with the new version of BOINC on the server this will be fixed. I'll also be keeping as close an eye on this as I can. And if need be, go back up to school and restart the server myself if it's not done in a timely manner next time this happen. My apologies for the outage again!

Dave Przybylo
MilkyWay@home Developer
Department of Computer Science
Rensselaer Polytechnic Institute
ID: 2153 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
ProfileCrunch3r
Volunteer developer
Avatar

Send message
Joined: 17 Feb 08
Posts: 363
Credit: 258,227,990
RAC: 0
200 million credit badge15 year member badge
Message 2156 - Posted: 10 Mar 2008, 21:38:54 UTC - in response to Message 2153.  
Last modified: 10 Mar 2008, 21:43:45 UTC

[ And if need be, go back up to school and restart the server myself if it's not done in a timely manner next time this happen.


So your moving into the lab ? LOL ;)
Wrtie some mails, get the staff working on getting rid of the FreeBS(E) OS... it's not meant to be running a boinc server after all ;)



Join Support science! Joinc Team BOINC United now!
ID: 2156 · Rating: -1 · rate: Rate + / Rate - Report as offensive     Reply Quote
ProfileDoctorNow
Avatar

Send message
Joined: 28 Aug 07
Posts: 146
Credit: 10,280,584
RAC: 0
10 million credit badge15 year member badge
Message 2159 - Posted: 11 Mar 2008, 19:18:34 UTC
Last modified: 11 Mar 2008, 19:19:20 UTC

Man, these outages are getting really annoying... >:(
Is there nothing you can do about it? ;-)
Member of BOINC@Heidelberg and ATA!

My BOINCstats
ID: 2159 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Martin P.

Send message
Joined: 21 Nov 07
Posts: 52
Credit: 1,756,052
RAC: 0
1 million credit badge15 year member badge
Message 2164 - Posted: 11 Mar 2008, 20:02:08 UTC - in response to Message 2159.  

Man, these outages are getting really annoying... >:(
Is there nothing you can do about it? ;-)


Yes, there is: tell us that they work on it...


ID: 2164 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
ProfileDave Przybylo
Avatar

Send message
Joined: 5 Feb 08
Posts: 236
Credit: 49,648
RAC: 0
10 thousand credit badge15 year member badge
Message 2169 - Posted: 11 Mar 2008, 20:21:38 UTC - in response to Message 2164.  

Man, these outages are getting really annoying... >:(
Is there nothing you can do about it? ;-)


Yes, there is: tell us that they work on it...




Yes, and I'm actaully pretty powerless in this situation since I'm away from school on break. When the server goes down it disables my access to it via SSH. So I rely entirely on labstaff who can sometimes be very slow in going. If it's down for more than 12 hours at a time I'll take a drive up to the school and reboot the server manually. It makes me mad that there are so many outages. I reported this one at 4am this morning and it didnt get fixed until what, like an hour ago? That's a 12 hour lag time. Unacceptable
Dave Przybylo
MilkyWay@home Developer
Department of Computer Science
Rensselaer Polytechnic Institute
ID: 2169 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
ProfileCori
Avatar

Send message
Joined: 27 Aug 07
Posts: 647
Credit: 27,592,547
RAC: 0
20 million credit badge15 year member badge
Message 2170 - Posted: 11 Mar 2008, 20:27:31 UTC - in response to Message 2169.  
Last modified: 11 Mar 2008, 20:27:58 UTC

... If it's down for more than 12 hours at a time I'll take a drive up to the school and reboot the server manually. It makes me mad that there are so many outages...

*Ouch* I'm feeling with you. ;-)

I hope you (meaning the whole MW/labstaff team) will find a better solution soon.
Lovely greetings, Cori
ID: 2170 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
seti@elrcastor.com

Send message
Joined: 22 Dec 07
Posts: 11
Credit: 5,943,029
RAC: 0
5 million credit badge15 year member badge
Message 2173 - Posted: 11 Mar 2008, 21:18:46 UTC

ID: 2173 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
ProfilePhiladelphia
Avatar

Send message
Joined: 9 Nov 07
Posts: 131
Credit: 180,454
RAC: 0
100 thousand credit badge15 year member badge
Message 2175 - Posted: 12 Mar 2008, 0:16:21 UTC
Last modified: 12 Mar 2008, 0:16:50 UTC

I'm giving these folks the benefit of the doubt here.

I'm sure they're doing the best they can to get WU's to sent out.

I wouldn't compare the issues they're having anywhere near the ongoing, regular, consistant, never ending problems at SETI.
CLICK TO HELP BUILD
ID: 2175 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
ProfileTravis
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist

Send message
Joined: 30 Aug 07
Posts: 2046
Credit: 26,480
RAC: 0
10 thousand credit badge15 year member badge
Message 2215 - Posted: 13 Mar 2008, 22:24:35 UTC - in response to Message 2175.  

I'm giving these folks the benefit of the doubt here.

I'm sure they're doing the best they can to get WU's to sent out.

I wouldn't compare the issues they're having anywhere near the ongoing, regular, consistant, never ending problems at SETI.


i'm really not quite sure what's up with the server. it hadn't crashed -- but for some reason all the directories where we were storing results (and generating new ones from) had lost all read privileges which really screwed up our assimilator. i'm going to send this off to labstaff to see what they think the problem could be.
ID: 2215 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Previous · 1 · 2

Message boards : Number crunching : Server Outages

©2023 Astroinformatics Group