2022-08-05 Update (workunit shortage explanation) |
Hi everyone, many volunteers have been asking about the continued drought of workunits. We have been working hard to resolve related problems, so we can restart the full scale of WCG, but, here are some challenges that we are still facing.
We have been working on several issues, and finally resolved some of them:
- fixed the issue whereby OPNG workunits were transitioning into an incorrect state due to repeated pre-emption by higher priority processes when no CPU resources were available to the scheduler.
- provisioned additional servers to resolve multiple issues stemming from insufficient vCPU and oversubscribed services pre-empting processes that would otherwise have completed normally.
- fixed all OPNG batches that had transitioned to a frozen/idle state due to the above going unnoticed for too long.
While technically, we can now push out new workunits, we ran into some network problems at our data center. Unfortunately, vacation schedules of network engineers in the data center dictate that the earliest we might expect a resolution is August 8. Due to recent personnel changes the center does not have anyone on call to assist any earlier. As a result of these circumstances and the current status of the WCG infrastructure,
we should be able to start sending an appreciable number of new WUs for you to crunch on or shortly after August 8th, 2022.
We will update you early next week on the status of these issues, as we also hope that once we complete this work we will be able to fully restart.
Thank you for your support, patience and understanding.
Have a great weekend!
- WCG Team