ServiceStatusRemarks
Snellius

UP AND RUNNING

On June 17th, due to redeploying a new image to fix kernel issue, some nodes are in draining status. This is temporary and the availability of worker nodes will improve. The queued jobs will be allocated once the nodes become available again.

For non-critical issues, please visit Snellius known issues. For recent changes, check the Snellius maintenance changelog.

On June 1st at 20:30 the archive became unresponsive and was unmounted on the interactive nodes and the staging partition to ensure the stability of the system. It went back into production at 13:45 on June 2nd.

OSSC

UP AND RUNNING



Data Archive

UP AND RUNNING

Previous Downtimes:

  • June 12th 1830- June 18th 1700: unplanned outage due to software failure
  • Tuesday, June 2nd 2026: unplanned outage
  • Tuesday, May 19th 2026: planned maintenance
  • May 15th - 09:30 May 18th 2026: Service nodes temporarily unavailable due to a recently disclosed local privilege escalation vulnerability
  • Tuesday, Mar 17 2026: planned maintenance
  • Monday Feb 09 2026: planned maintenance
  • Tuesday 06 Jan 2026, 9:00-13:00 planned maintenance.
  • Monday 15 Dec 2025, 9:30-9:54 Staging stopped to update and tune performance parameters. 

Data Repository

https://repository.surfsara.nl/







repository.surf.nl (New Pilot Environment)

UP AND RUNNING



Users of the Repository may experience delays in response times following a recent migration. The underlying issues have been resolved. We apologise for the inconvenience and appreciate your patience. Let us know via the service desk if you are still experiencing issues or need support.

UP AND RUNNING

repository.surf.nl (New Pilot Environment and REST API - New deposits)

Previous Issues:

  • June 12th 1830- June 18th 1700: partial outage due to software failure on the storage backend
  • June 15th 2026: unexpected outage of file storage system
  • May 19th 2026: planned maintenance
  • April 23: 15:00 - 15:30: S3 uploads may experience a slight bottleneck as we improve service performance and stability.
  • April 21: We’re currently experiencing issues with the web UI in the new deposit environment. API uploads are working as expected .
Persistent Identifiers (PIDs)

UP AND RUNNING

 

Research Drive

UP AND RUNNING


B2SAFE

UP AND RUNNING


Data Processing - Grid

UP AND RUNNING

May 18 at 09:00, in order to keep the service up and running safely, some maintenance will be done during the day, to address a CVE. No downtime is expected.

Grid Storage (dCache)

UP AND RUNNING

Object Store

UP AND RUNNING


Data Processing - Spider

UP AND RUNNING

See Spider Service Notices


We are experiencing ongoing CephFS stability issues for our project and home directories

From June 1 onward we will roll out changes to the storage system to improve system stability: 

  • Upgrade to CephFS has been implemented successfully
  • Multi MDS configured
  • Mulit mount points added to multiple nodes, will continue to roll out to more in the coming weeks 

We expect these changes to increase filesystem stability and reduce the impact of users on each other

For now we are waiting to see how these changes effect stability before continue to adjust the system

Research Cloud

UP AND RUNNING

See Service Notices

HPC Cloud

UP AND RUNNING

See Service Notices
SURFdrive

UP AND RUNNING

25 September: There was a brief disruption for users logging into the web-interface of SURFdrive. This has now been resolved.
SOIL Cluster

UP AND RUNNING


RDM Scale-out

UP AND RUNNING


Yoda Hosting

UP AND RUNNING


SURF Research Access Management

UP AND RUNNING

Regular maintenance on every third Thursday of the month between 5:00 and 7:00 am.

  • No labels