Thread 'Anything and Everything to do with (WCG) World Community Grid'

Message boards : Projects : Anything and Everything to do with (WCG) World Community Grid
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 30 · 31 · 32 · 33 · 34 · Next

AuthorMessage
MyrCu

Send message
Joined: 27 Aug 22
Posts: 38
Message 117134 - Posted: 21 Oct 2025, 13:24:00 UTC - in response to Message 117132.  
Last modified: 21 Oct 2025, 13:29:19 UTC

And now there are some MCM WUs on my Wondows machine, too.

The first one was ready in 2 Minutes 16 seconds.
ID: 117134 · Report as offensive     Reply Quote
Grumpy Swede
Avatar

Send message
Joined: 30 Mar 20
Posts: 584
Sweden
Message 117135 - Posted: 21 Oct 2025, 13:38:34 UTC

I've been getting so far 75 of the short test tasks.
ID: 117135 · Report as offensive     Reply Quote
Grumpy Swede
Avatar

Send message
Joined: 30 Mar 20
Posts: 584
Sweden
Message 117136 - Posted: 21 Oct 2025, 13:42:45 UTC - in response to Message 117133.  

In reply to bill's message of 21 Oct 2025:
It's a miracle! I got tasks!

Whoops...they seem to only run 3:30 minutes each.
Yes, they are special short test tasks, as previously mentioned by the WCG team on the Operational Status page, on October 18.

Quote from the relevant part: "We are sending small batches of workunits out starting tonight with batch IDs in the range 9999900+ for MCM1 to test the new distributed partition-aware batch upserting app-specific create_work daemons. The few volunteers who get these workunits before we start releasing larger batches as we gain confidence that the new system is working as expected may notice these workunits have a much smaller number of signatures and run much faster than normal. These are still meaningful workunits, but key parameters such as number of signatures to test per workunit were reduced so we could get feedback quckly."
ID: 117136 · Report as offensive     Reply Quote
Grumpy Swede
Avatar

Send message
Joined: 30 Mar 20
Posts: 584
Sweden
Message 117137 - Posted: 21 Oct 2025, 13:47:34 UTC

And a new update from the team was just posted to the Operational Status page:

October 21, 2025
Finally stress testing rather than correctness testing.
Sent a batch of 100,000 workunits (fast running, not full size in case something crashed.
Thank you for your patience and continued support.
ID: 117137 · Report as offensive     Reply Quote
bill

Send message
Joined: 11 Sep 15
Posts: 8
United States
Message 117138 - Posted: 21 Oct 2025, 14:10:13 UTC - in response to Message 117136.  

Thanks for the info. I've been out of the country so I'm pretty much in the dark about things (as if this isn't my normal state).
ID: 117138 · Report as offensive     Reply Quote
Grumpy Swede
Avatar

Send message
Joined: 30 Mar 20
Posts: 584
Sweden
Message 117139 - Posted: 21 Oct 2025, 14:49:11 UTC

Another batch with a bit longer running tasks, are being sent out now.
ID: 117139 · Report as offensive     Reply Quote
Grumpy Swede
Avatar

Send message
Joined: 30 Mar 20
Posts: 584
Sweden
Message 117142 - Posted: 21 Oct 2025, 19:51:19 UTC

New test batch coming in now. The tasks seems to have more normal crunch times.
ID: 117142 · Report as offensive     Reply Quote
Profilejay_e
Avatar

Send message
Joined: 8 Mar 07
Posts: 123
United States
Message 117147 - Posted: 22 Oct 2025, 2:13:58 UTC - in response to Message 117142.  
Last modified: 22 Oct 2025, 2:15:47 UTC

Greetings,
Sharing good news.

Earlier today, I rcvd some MCM WU.
Just now, I requested enough to fill 0.5 day of work.
The new MCM wu have an estimated time of 1 day and 18 minutes.
I am running Linux on an older laptop -1.9Ghz Intel i-7, and the actual times have been running an hour or two less.

Cheers!
Jay
ID: 117147 · Report as offensive     Reply Quote
robsmith
Volunteer tester
Help desk expert

Send message
Joined: 25 May 09
Posts: 1383
United Kingdom
Message 117148 - Posted: 22 Oct 2025, 7:20:10 UTC - in response to Message 117147.  

The most recent batch of MCM tasks have very large guesstimate run times to start with, but in reality are taking about an hour on my computer (which is about twice as fast as yours. This is, in the main, down to these being a "new" type of task, and by design BOINC's initial runtime guesses are way too long, so don't worry, eventually BOINC will catch up with reality and give better guesses to start with.
ID: 117148 · Report as offensive     Reply Quote
Grumpy Swede
Avatar

Send message
Joined: 30 Mar 20
Posts: 584
Sweden
Message 117155 - Posted: 23 Oct 2025, 1:19:14 UTC
Last modified: 23 Oct 2025, 1:21:48 UTC

More test batches (MCM1_9999XXX) are being sent out now. Don't expect finished tasks to validate though. The validators still seems to be a work in progress.
Out of many hundreds of test tasks that I have crunched, only 3 have been validated.

The tasks crunched before or during the migration, and those uploaded and reported when WCG came back, are all still "pending validation"

But, who's in a hurry really? Life goes on, with or without WCG validating :-)
ID: 117155 · Report as offensive     Reply Quote
Grumpy Swede
Avatar

Send message
Joined: 30 Mar 20
Posts: 584
Sweden
Message 117158 - Posted: 23 Oct 2025, 2:44:38 UTC

I think the WCG scheduler is a bit busy now. Most of the time there is no problem, but from time to time the response to a request is:

Scheduler request failed: Error 408
ID: 117158 · Report as offensive     Reply Quote
ProfileDave
Help desk expert

Send message
Joined: 28 Jun 10
Posts: 3029
United Kingdom
Message 117170 - Posted: 23 Oct 2025, 16:12:10 UTC

The validators still seems to be a work in progress.
Out of many hundreds of test tasks that I have crunched, only 3 have been validated.
Yes, nowhere near one hundred never mind hundreds here but only one task validated since the resumption between desktop and Android.
ID: 117170 · Report as offensive     Reply Quote
bill

Send message
Joined: 11 Sep 15
Posts: 8
United States
Message 117172 - Posted: 23 Oct 2025, 17:04:13 UTC - in response to Message 117155.  
Last modified: 23 Oct 2025, 17:05:38 UTC

In reply to Grumpy Swede's message of 23 Oct 2025:
Out of many hundreds of test tasks that I have crunched, only 3 have been validated.

Hehehe...at one point I had something like 2000 waiting for them to come back on-line. They've since transitioned to never-never-land, hopefully not the great-bit-bucket-in-the-sky.
ID: 117172 · Report as offensive     Reply Quote
Profilejay_e
Avatar

Send message
Joined: 8 Mar 07
Posts: 123
United States
Message 117194 - Posted: 24 Oct 2025, 14:59:47 UTC - in response to Message 117172.  
Last modified: 24 Oct 2025, 15:01:53 UTC

In reply to Grumpy Bill's message of 23 Oct 2025:
The Jurisca status said:

October 21, 2025
Finally stress testing rather than correctness testing.
Sent a batch of 100,000 workunits (fast running, not full size in case something crashed.
Thank you for your patience and continued support.


My assumption is/was that a stress test is run on a fully operational system.

Doing a stress test without processing of completed results to both WU partners seems like an incomplete test - to me.

----
Also, I had several WU pending results that were sent to me on 8/29 and WUs returned on 9/24.
Indeed, if you were my partner of any of these WU - and if they went to the Great Bit Bucket in The Sky,
I would think that Jurisca/Krembil should tell us.

With that in mind, Looking at the Operational Status, I see that it is server-centric.
There is no status focused on the users/volunteers; For example.
1) Were WU sent to users lost? (whether or not they were completed?
2) If not lost, is the process the accredit returned WU existing? Repaired? In process?
3) If that process is restored, when will users/volunteers see their results
4) When is the ETA of Boinc Manager's graph of Host/User statistics going to work?
(What happened to our work?)

An explanation of events that are user-centric is ovedue.

Have our expectations been dulled by a lack of communication?

-- begin metaphor --
I understand the value of giving praise to the efforts of those working to save the ship.
I am not pleased by the lack of lifeboat drills and the explanation to passengers about the size of the hole in the hull.
Yes, thank you for telling us that the ship has bilge pumps.
You have not explicitly told us the ship will be made sea-worthy again, only that a speed trial is in progress.
-- end metaphor --

Thanks to Bill, Bryn, and Grumpy for their comments and questions,
Jay
ID: 117194 · Report as offensive     Reply Quote
Profilejay_e
Avatar

Send message
Joined: 8 Mar 07
Posts: 123
United States
Message 117197 - Posted: 24 Oct 2025, 16:02:02 UTC - in response to Message 117194.  

oops. something left out.

https://www.worldcommunitygrid.org/ms/device/viewDevices.do?installedSince=0&lastResult=0
does not see any work done since 8/15/2025.

Jay
ID: 117197 · Report as offensive     Reply Quote
Grumpy Swede
Avatar

Send message
Joined: 30 Mar 20
Posts: 584
Sweden
Message 117202 - Posted: 24 Oct 2025, 20:36:03 UTC

A new short update from Dylan, the WCG tech on the WCG forum:

"blocked uploads to freeze Kafka topics while I try to fix the validation throughput"
https://www.worldcommunitygrid.org/forums/wcg/viewthread_thread,47520_offset,260#706749
ID: 117202 · Report as offensive     Reply Quote
Grumpy Swede
Avatar

Send message
Joined: 30 Mar 20
Posts: 584
Sweden
Message 117206 - Posted: 25 Oct 2025, 0:46:17 UTC

And a new update on the WCG Operational Status page

October 24, 2025
We have paused uploads and release of test batches while we work on the validation throughput issue.

We think we have identified the root cause of the low validation rate for the MCM1 test batches, and we will send a few more test batches to confirm the fix does works.

If this is the final fix and we see the expected validation rate for pairs of MCM1 uploads for new batches, we will replay the Kafka consumer on the upload events fired for test batches received earlier in the week, and this should idempotently allow the new batch assimilator to process those validations and assign credit.

If the above goes well, we will schedule regular MCM1 batches to resume instead of the test batches.

As volulnteers have noted, we have not yet reconciled uploads of regular MCM1 results submitted before we began sending test batches, and before the migration, but we have those files and will be able to do this in a batch update once the path for new workunits is working as described above.

Naturally, we will resume ARP and MAM only after these issues are fully resolved.
ID: 117206 · Report as offensive     Reply Quote
robsmith
Volunteer tester
Help desk expert

Send message
Joined: 25 May 09
Posts: 1383
United Kingdom
Message 117225 - Posted: 26 Oct 2025, 8:12:11 UTC - in response to Message 117194.  

My assumption is/was that a stress test is run on a fully operational system.

Doing a stress test without processing of completed results to both WU partners seems like an incomplete test - to me.


That entirely depends on what is being stress tested.
If the test is of the ability to deliver and receive lots of tasks and results quickly then using small (incomplete) tasks is far better than complete tasks. Small tasks can also be used to make sure that other bits of BOINC's server-side activities are working correctly. Indeed this approach appears to have provided some data that's of use in resolving the current validation process bottleneck - getting back loads of tasks (albeit small ones) allows some human validation to take place and compare this with the computed validation (which is much harder with large results?).


On your other questions about the lack of user related data - I agree, I think this is all part of the system they inherited from IBM, but is available in "BOINC-BOINC" as used by so many projects.
ID: 117225 · Report as offensive     Reply Quote
Grumpy Swede
Avatar

Send message
Joined: 30 Mar 20
Posts: 584
Sweden
Message 117231 - Posted: 26 Oct 2025, 14:43:01 UTC
Last modified: 26 Oct 2025, 15:10:51 UTC

Something is happening now. Previous test batches are beginning to validate, and they have sent out at least one new test batch. I did not catch any of those though, because I had my computer turned off. As of now, no new task are available.

Edit, added: I think the validation of previous test tasks stopped, well before all test task had been validated.
ID: 117231 · Report as offensive     Reply Quote
Grumpy Swede
Avatar

Send message
Joined: 30 Mar 20
Posts: 584
Sweden
Message 117235 - Posted: 26 Oct 2025, 23:39:17 UTC

ID: 117235 · Report as offensive     Reply Quote
Previous · 1 . . . 30 · 31 · 32 · 33 · 34 · Next

Message boards : Projects : Anything and Everything to do with (WCG) World Community Grid

Copyright © 2025 University of California.
Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.