Message boards : Projects : Anything and Everything to do with (WCG) World Community Grid
Message board moderation
Previous · 1 . . . 36 · 37 · 38 · 39 · 40 · 41 · 42 . . . 52 · Next
| Author | Message |
|---|---|
|
Send message Joined: 1 Jul 16 Posts: 217
|
Not to be surprised, a project rarely gets mt right the first time. |
DaveSend message Joined: 28 Jun 10 Posts: 3259
|
In reply to mmonnin's message of 7 Jan 2026: Not to be surprised, a project rarely gets mt right the first time. True. CPDN took quite a few goes before getting it right, and they are adapting models that were originally designed to run on European Centre for Medium range Weather Forecasting supercomputers so would originally have been designed for MT. |
|
Send message Joined: 30 Mar 20 Posts: 707
|
A new big update posted on https://www.cs.toronto.edu/~juris/jlab/wcg.html (Operational Status Tab) |
|
Send message Joined: 7 Apr 13 Posts: 72
|
Our fix adds fallback/fallthrough logic to the validator_assimilator daemon to facilitate remote file retrieval and process the tens of millions of backlog events we published to the queue it consumes from. Wowzer! Appears it's gonna be quite a while... |
DaveSend message Joined: 28 Jun 10 Posts: 3259
|
Uploads now failing and no new tasks available. Edit: I couldn't find anything on WCG fora so have posted there. |
|
Send message Joined: 5 Nov 11 Posts: 46
|
Same here uploads failing ….log says an HTTP transient error. |
|
Send message Joined: 24 Dec 10 Posts: 102
|
In reply to Sir LanDroid's message of 9 Jan 2026: Our fix adds fallback/fallthrough logic to the validator_assimilator daemon to facilitate remote file retrieval and process the tens of millions of backlog events we published to the queue it consumes from. Over 16 million were validated in 1 day recently so not too long, once ready to go. Paul. |
DaveSend message Joined: 28 Jun 10 Posts: 3259
|
In reply to Hadrian's message of 9 Jan 2026: Same here uploads failing ….log says an HTTP transient error. At some point overnight (UK) uploads completed and work started flowing again. |
|
Send message Joined: 11 Sep 15 Posts: 36
|
Just a FYI. I just installed a new OS, Leap 16, on a computer and it's communicating fine with this place. The important thing was that I didn't need to fiddle with the hosts file. A giant BRAVO to the admin folks, they seem to have fixed that glitch. Now, about that problem with the... :-) |
|
Send message Joined: 31 Dec 18 Posts: 343
|
In reply to bill's message of 10 Jan 2026: Just a FYI. I just installed a new OS, Leap 16, on a computer and it's communicating fine with this place. The important thing was that I didn't need to fiddle with the hosts file. A giant BRAVO to the admin folks, they seem to have fixed that glitch. Now, about that problem with the... :-) Thanks for pointing that out, I should have realised it myself when I loaded my new laptop a few weeks ago. |
DaveSend message Joined: 28 Jun 10 Posts: 3259
|
Now getting, Mon 12 Jan 2026 09:09:25 AM GMT | World Community Grid | Reporting 1 completed tasks Mon 12 Jan 2026 09:09:25 AM GMT | World Community Grid | Requesting new tasks for CPU Mon 12 Jan 2026 09:09:26 AM GMT | World Community Grid | Scheduler request completed: got 0 new tasks Mon 12 Jan 2026 09:09:26 AM GMT | World Community Grid | Server error: feeder not runningCompleted task not reporting. |
|
Send message Joined: 30 Mar 20 Posts: 707
|
In reply to Dave's message of 12 Jan 2026: Now getting,Feeder issue reported to Igor Jurisica, and the support mail. I also reminded them about that after the feeder problems has been fixed, the Device profile changes, usually doesn't propagate to the server and the BOINC client. But apart from that, life as we know it, still exists on planet Earth. |
DaveSend message Joined: 28 Jun 10 Posts: 3259
|
And now crunching again. I haven't checked out device profile problems. |
|
Send message Joined: 30 Mar 20 Posts: 707
|
In reply to Dave's message of 12 Jan 2026: And now crunching again. I haven't checked out device profile problems.Device profile settings does propagate as they should. |
DaveSend message Joined: 28 Jun 10 Posts: 3259
|
Now not getting any tasks on either Linux or Windows. (No tasks available message.) |
|
Send message Joined: 5 Nov 11 Posts: 46
|
Far be it from me to complain but I notice that my recorded contributions to every WCG project have doubled. |
|
Send message Joined: 1 Mar 23 Posts: 15
|
WCG update: January 15, 2026 We have lost access to the data center - trying to contact them - hopefully we will have some update soon. |
|
Send message Joined: 30 Mar 20 Posts: 707
|
In reply to Hadrian's message of 15 Jan 2026: Far be it from me to complain but I notice that my recorded contributions to every WCG project have doubled.That is a known issue. When the WCG system comes back, you can read about it, in this WCG forum thread: My Contributions totals have DOUBLED |
|
Send message Joined: 24 Dec 10 Posts: 102
|
Now up, below from Dylan. Interim update - We have regained access to our project at Nibicloud. We have ssh access to our servers again, and I am in the process of damage control now before restarting the feeder. Most of our servers/VMs remained online during the outage, but some appear to have been soft rebooted, losing in-memory caches that I need to repopulate from Kafka/Redpanda. Should have everything back up and running "soon", somewhere in the hours to tomorrow morning range as my current best estimate. Validation should improve when I am done, as I have the opportunity to push some changes and separate the validation streams for old result pair upload events, vs. new result pair upload events, and launch additional validators with code changes to stripe them on workunit ID within the node-local partition, and do a second tier of batching to keep load on the BOINC db from spiking from multiple validator_assimilator daemons trying to batch update state and credit at once. I will update here in the forums once I get through everything, and hopefully can address some of the concerns raised in the forums. If all goes well, plan is to start MAM1 beta for Windows as soon as MCM1 is flowing and validating again, along with a new build for Linux, both will run through rounds of beta30 before we run some smoke tests in the MAM1_9999903+ range. Paul. |
|
Send message Joined: 30 Mar 20 Posts: 707
|
Yeah, but he didn't mention the TZ , of this "back up and running "soon", somewhere in the hours to tomorrow morning range." It could be his TZ, or UTC. Big difference..... |
Copyright © 2026 University of California.
Permission is granted to copy, distribute and/or modify this document
under the terms of the GNU Free Documentation License,
Version 1.2 or any later version published by the Free Software Foundation.