Thread 'Anything and Everything to do with (WCG) World Community Grid'

Message boards : Projects : Anything and Everything to do with (WCG) World Community Grid
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 36 · 37 · 38 · 39 · 40 · 41 · 42 . . . 52 · Next

AuthorMessage
mmonnin

Send message
Joined: 1 Jul 16
Posts: 217
United States
Message 117999 - Posted: 7 Jan 2026, 22:26:01 UTC

Not to be surprised, a project rarely gets mt right the first time.
ID: 117999 · Report as offensive     Reply Quote
ProfileDave
Help desk expert

Send message
Joined: 28 Jun 10
Posts: 3259
United Kingdom
Message 118001 - Posted: 8 Jan 2026, 9:15:40 UTC - in response to Message 117999.  

In reply to mmonnin's message of 7 Jan 2026:
Not to be surprised, a project rarely gets mt right the first time.

True. CPDN took quite a few goes before getting it right, and they are adapting models that were originally designed to run on European Centre for Medium range Weather Forecasting supercomputers so would originally have been designed for MT.
ID: 118001 · Report as offensive     Reply Quote
Grumpy Swede
Avatar

Send message
Joined: 30 Mar 20
Posts: 707
Sweden
Message 118011 - Posted: 8 Jan 2026, 21:07:42 UTC

A new big update posted on https://www.cs.toronto.edu/~juris/jlab/wcg.html (Operational Status Tab)
ID: 118011 · Report as offensive     Reply Quote
Sir LanDroid

Send message
Joined: 7 Apr 13
Posts: 72
United States
Message 118014 - Posted: 9 Jan 2026, 14:38:40 UTC
Last modified: 9 Jan 2026, 14:41:54 UTC

Our fix adds fallback/fallthrough logic to the validator_assimilator daemon to facilitate remote file retrieval and process the tens of millions of backlog events we published to the queue it consumes from.

Wowzer! Appears it's gonna be quite a while...
ID: 118014 · Report as offensive     Reply Quote
ProfileDave
Help desk expert

Send message
Joined: 28 Jun 10
Posts: 3259
United Kingdom
Message 118016 - Posted: 9 Jan 2026, 14:40:59 UTC
Last modified: 9 Jan 2026, 14:45:51 UTC

Uploads now failing and no new tasks available.
Edit: I couldn't find anything on WCG fora so have posted there.
ID: 118016 · Report as offensive     Reply Quote
Hadrian

Send message
Joined: 5 Nov 11
Posts: 46
United Kingdom
Message 118017 - Posted: 9 Jan 2026, 15:36:06 UTC - in response to Message 118016.  

Same here uploads failing ….log says an HTTP transient error.
ID: 118017 · Report as offensive     Reply Quote
PMH_UK

Send message
Joined: 24 Dec 10
Posts: 102
United Kingdom
Message 118018 - Posted: 9 Jan 2026, 15:38:45 UTC - in response to Message 118014.  
Last modified: 9 Jan 2026, 15:39:09 UTC

In reply to Sir LanDroid's message of 9 Jan 2026:
Our fix adds fallback/fallthrough logic to the validator_assimilator daemon to facilitate remote file retrieval and process the tens of millions of backlog events we published to the queue it consumes from.

Wowzer! Appears it's gonna be quite a while...

Over 16 million were validated in 1 day recently so not too long, once ready to go.
Paul.
ID: 118018 · Report as offensive     Reply Quote
ProfileDave
Help desk expert

Send message
Joined: 28 Jun 10
Posts: 3259
United Kingdom
Message 118026 - Posted: 10 Jan 2026, 8:54:39 UTC - in response to Message 118017.  

In reply to Hadrian's message of 9 Jan 2026:
Same here uploads failing ….log says an HTTP transient error.

At some point overnight (UK) uploads completed and work started flowing again.
ID: 118026 · Report as offensive     Reply Quote
bill
Avatar

Send message
Joined: 11 Sep 15
Posts: 36
United States
Message 118036 - Posted: 10 Jan 2026, 20:24:02 UTC

Just a FYI. I just installed a new OS, Leap 16, on a computer and it's communicating fine with this place. The important thing was that I didn't need to fiddle with the hosts file. A giant BRAVO to the admin folks, they seem to have fixed that glitch. Now, about that problem with the... :-)
ID: 118036 · Report as offensive     Reply Quote
Bryn Mawr
Help desk expert

Send message
Joined: 31 Dec 18
Posts: 343
United Kingdom
Message 118044 - Posted: 11 Jan 2026, 9:40:57 UTC - in response to Message 118036.  

In reply to bill's message of 10 Jan 2026:
Just a FYI. I just installed a new OS, Leap 16, on a computer and it's communicating fine with this place. The important thing was that I didn't need to fiddle with the hosts file. A giant BRAVO to the admin folks, they seem to have fixed that glitch. Now, about that problem with the... :-)


Thanks for pointing that out, I should have realised it myself when I loaded my new laptop a few weeks ago.
ID: 118044 · Report as offensive     Reply Quote
ProfileDave
Help desk expert

Send message
Joined: 28 Jun 10
Posts: 3259
United Kingdom
Message 118052 - Posted: 12 Jan 2026, 9:17:34 UTC

Now getting,
Mon 12 Jan 2026 09:09:25 AM GMT | World Community Grid | Reporting 1 completed tasks
Mon 12 Jan 2026 09:09:25 AM GMT | World Community Grid | Requesting new tasks for CPU
Mon 12 Jan 2026 09:09:26 AM GMT | World Community Grid | Scheduler request completed: got 0 new tasks
Mon 12 Jan 2026 09:09:26 AM GMT | World Community Grid | Server error: feeder not running
Completed task not reporting.
ID: 118052 · Report as offensive     Reply Quote
Grumpy Swede
Avatar

Send message
Joined: 30 Mar 20
Posts: 707
Sweden
Message 118054 - Posted: 12 Jan 2026, 12:03:00 UTC - in response to Message 118052.  

In reply to Dave's message of 12 Jan 2026:
Now getting,
Mon 12 Jan 2026 09:09:25 AM GMT | World Community Grid | Reporting 1 completed tasks
Mon 12 Jan 2026 09:09:25 AM GMT | World Community Grid | Requesting new tasks for CPU
Mon 12 Jan 2026 09:09:26 AM GMT | World Community Grid | Scheduler request completed: got 0 new tasks
Mon 12 Jan 2026 09:09:26 AM GMT | World Community Grid | Server error: feeder not running
Completed task not reporting.
Feeder issue reported to Igor Jurisica, and the support mail. I also reminded them about that after the feeder problems has been fixed, the Device profile changes, usually doesn't propagate to the server and the BOINC client.

But apart from that, life as we know it, still exists on planet Earth.
ID: 118054 · Report as offensive     Reply Quote
ProfileDave
Help desk expert

Send message
Joined: 28 Jun 10
Posts: 3259
United Kingdom
Message 118055 - Posted: 12 Jan 2026, 16:33:23 UTC

And now crunching again. I haven't checked out device profile problems.
ID: 118055 · Report as offensive     Reply Quote
Grumpy Swede
Avatar

Send message
Joined: 30 Mar 20
Posts: 707
Sweden
Message 118061 - Posted: 12 Jan 2026, 20:29:23 UTC - in response to Message 118055.  

In reply to Dave's message of 12 Jan 2026:
And now crunching again. I haven't checked out device profile problems.
Device profile settings does propagate as they should.
ID: 118061 · Report as offensive     Reply Quote
ProfileDave
Help desk expert

Send message
Joined: 28 Jun 10
Posts: 3259
United Kingdom
Message 118091 - Posted: 15 Jan 2026, 10:29:44 UTC - in response to Message 118061.  

Now not getting any tasks on either Linux or Windows. (No tasks available message.)
ID: 118091 · Report as offensive     Reply Quote
Hadrian

Send message
Joined: 5 Nov 11
Posts: 46
United Kingdom
Message 118092 - Posted: 15 Jan 2026, 11:46:48 UTC

Far be it from me to complain but I notice that my recorded contributions to every WCG project have doubled.
ID: 118092 · Report as offensive     Reply Quote
MJH333

Send message
Joined: 1 Mar 23
Posts: 15
United Kingdom
Message 118094 - Posted: 15 Jan 2026, 17:40:47 UTC - in response to Message 118092.  
Last modified: 15 Jan 2026, 17:41:18 UTC

WCG update:
January 15, 2026
We have lost access to the data center - trying to contact them - hopefully we will have some update soon.
ID: 118094 · Report as offensive     Reply Quote
Grumpy Swede
Avatar

Send message
Joined: 30 Mar 20
Posts: 707
Sweden
Message 118095 - Posted: 15 Jan 2026, 18:22:05 UTC - in response to Message 118092.  
Last modified: 15 Jan 2026, 18:22:45 UTC

In reply to Hadrian's message of 15 Jan 2026:
Far be it from me to complain but I notice that my recorded contributions to every WCG project have doubled.
That is a known issue. When the WCG system comes back, you can read about it, in this WCG forum thread:
My Contributions totals have DOUBLED
ID: 118095 · Report as offensive     Reply Quote
PMH_UK

Send message
Joined: 24 Dec 10
Posts: 102
United Kingdom
Message 118097 - Posted: 15 Jan 2026, 21:53:50 UTC

Now up, below from Dylan.
Interim update -

We have regained access to our project at Nibicloud. We have ssh access to our servers again, and I am in the process of damage control now before restarting the feeder.

Most of our servers/VMs remained online during the outage, but some appear to have been soft rebooted, losing in-memory caches that I need to repopulate from Kafka/Redpanda. Should have everything back up and running "soon", somewhere in the hours to tomorrow morning range as my current best estimate.

Validation should improve when I am done, as I have the opportunity to push some changes and separate the validation streams for old result pair upload events, vs. new result pair upload events, and launch additional validators with code changes to stripe them on workunit ID within the node-local partition, and do a second tier of batching to keep load on the BOINC db from spiking from multiple validator_assimilator daemons trying to batch update state and credit at once.

I will update here in the forums once I get through everything, and hopefully can address some of the concerns raised in the forums. If all goes well, plan is to start MAM1 beta for Windows as soon as MCM1 is flowing and validating again, along with a new build for Linux, both will run through rounds of beta30 before we run some smoke tests in the MAM1_9999903+ range.
Paul.
ID: 118097 · Report as offensive     Reply Quote
Grumpy Swede
Avatar

Send message
Joined: 30 Mar 20
Posts: 707
Sweden
Message 118098 - Posted: 15 Jan 2026, 23:19:27 UTC

Yeah, but he didn't mention the TZ , of this "back up and running "soon", somewhere in the hours to tomorrow morning range."
It could be his TZ, or UTC. Big difference.....
ID: 118098 · Report as offensive     Reply Quote
Previous · 1 . . . 36 · 37 · 38 · 39 · 40 · 41 · 42 . . . 52 · Next

Message boards : Projects : Anything and Everything to do with (WCG) World Community Grid

Copyright © 2026 University of California.
Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.