ASPIRE 2A+: Expect Longer Wait Times for Large Jobs from 14 Oct 2025, 6:00 PM (SGT)​

Dear ASPIRE 2A+ Users,

Please note that starting from 14 October 2025 (Tuesday), 6:00 PM (SGT), a project reservation will temporarily reduce the general resource pool in the ASPIRE 2A+ system.

Impact:

  • Large jobs (More than 8 GPUs): Expect longer queue wait times.
  • Small jobs (1–8 GPUs): No impact.

Tips:

  • To reduce delays, please enable checkpointing or pre-emption where possible.
  • Users are encouraged to schedule job runs between 6:00 PM to 10:00 AM to leverage off-peak hours, which may result in shorter queue times.

Thanks for your patience and cooperation. Kindly look out for updates via the MOTD or email announcement if there are any changes to the arrangement.

Should you have any questions or need assistance, please contact our Helpdesk via the Service Desk Portal or email us at [email protected].

Thank you.

Warm regards,
The NSCC Team

[Resolved] Service Disruption for Users Accessing ASPIRE 2A​

Dear ASPIRE 2A Users,

 

We are pleased to inform you that the issue with the Lustre storage system issue has been resolved as of 1:15 AM, 4 October 2025. You may proceed to login to the ASPIRE 2A system as per normal.

 

We apologise for any inconvenience this may cause and thank you for your understanding. Should you have any questions or need assistance, please contact our Helpdesk via the Service Desk Portal or email us at [email protected].

 
Thank you.

 

Warm regards,

The NSCC Team

[Update 2] Service Disruption for Users Accessing ASPIRE 2A​

Dear ASPIRE 2A Users,

This is to update you on the service disruption on the ASPIRE 2A system.

Actions Taken:
  • Conducted checks on the hardware and filesystem.
Next Steps:
  • HPE engineers and the NSCC team will continue the filesystem recovery to ensure the integrity of data.

Next Update: 9:00 AM, 4 October 2025
 

Thank you.

Warm regards,
The NSCC Team

[Update] Service Disruption for Users Accessing ASPIRE 2A​

Dear ASPIRE 2A Users,

This is to update you on the service disruption on the ASPIRE 2A system.

Actions Taken:
  • The replacement parts have arrived, and the hardware replacement was successfully completed at 1:15 PM.
Next Steps:
  • Conduct checks on the hardware and filesystems.
  • HPE engineers, supported by the NSCC team, will proceed with recovery.

Next Update: 8:00 PM, 3 October 2025
 

Thank you.

Warm regards,
The NSCC Team

Service Disruption for Users Accessing ASPIRE 2A

Dear ASPIRE 2A Users,

We wish to inform you that there is a service disruption on the ASPIRE 2A system. Our team is diligently investigating the issue and working towards a swift resolution.

  • Start Time: 7:15 AM

  • Type: Priority 1 outage

Cause of Disruption:

  • Issues with the Lustre storage system.

  • A storage volume consisting of 53 HDDs was lost from both HA pair nodes. Automatic failover was unsuccessful.

Impact of the Disruption:

  • All users are currently unable to access the ASPIRE 2A system.

Actions Taken:

  • Initial troubleshooting began after both nodes were restarted at 8:00 AM.

  • Job dispatch has been temporarily disabled.

  • Hardware issue confirmed and replacement parts are arranged.

Next Steps:

  • Replacement parts are scheduled to arrive by 2:00 PM.

  • HPE engineers, supported by the NSCC team, will proceed with recovery immediately afterwards.

 

Next Update: 2:00 PM, 3 October 2025

 

We apologise for the disruption and are treating this with the highest priority.

Should you have any questions or need assistance, please contact our Helpdesk via the Service Desk Portal or email us at [email protected] if you have any questions.

 

Thank you.

 

Warm regards,

The NSCC Team

 

Urgent Scheduled Maintenance for ASPIRE 2A on 24 Sep 2025, 9AM to 6PM

Dear ASPIRE 2A Users,

We wish to inform you of an upcoming urgent scheduled system maintenance for ASPIRE 2A to enhance its long-term reliability, uptime, and stability.

Maintenance Details:

  • Start: 24 September 2025 (Wednesday), 9.00 AM SGT
  • End: 24 September 2025 (Wednesday), 6.00 PM SGT
  • Duration: 9 hours

Purpose:

  • Replacement of the existing Checkpoint Smart-1-5050 Appliance, which is approaching End of Life (EOL), with the Checkpoint 6000 1A Gen6 Security Management Appliance.

Impact During the Maintenance Period:

  • Users may experience intermittent connectivity during the maintenance period.

We apologise for any inconvenience this may cause and thank you for your understanding. Should you have any questions or need assistance, please contact our Helpdesk via the Service Desk Portal or email us at [email protected].

Thank you.

Warm regards,
The NSCC Team

Security Notice: Upgrade NVIDIA Nemo and Framework to the Updated Version

Dear Users,

 

Multiple high-severity code-injection vulnerabilities have been identified in NVIDIA NeMo Framework and NeMo Curator. If you are using the affected products listed below, please upgrade to the recommended versions by 8 October 2025.

Affected Products

Platform or OS

Affected Versions

Updated Version

NVIDIA NeMo Framework

Windows, Linux, macOS

All versions prior to 2.4.0

2.4.0

NVIDIA NeMo Curator

Windows, Linux, macOS

All versions prior to Curator 25.07

Curator 25.07

For more information on NVIDIA’s security updates, please refer to their official advisories:

 

Please contact our Helpdesk via the Service Desk Portal or email us at [email protected] if you have any questions.

 

Thank you.

Warm regards,

The NSCC Team

 

[Resolved] Service Disruption for A*STAR Users Accessing ASPIRE2A and ASPIRE2A+

Dear A*STAR Users,

We are pleased to inform you that the network issue has been resolved. You may proceed to login to the ASPIRE 2A and 2A+ as per normal.

We apologise for any inconvenience this may cause and thank you for your understanding. Should you have any questions or need assistance, please contact our Helpdesk via the Service Desk Portal or email us at [email protected] if you have any questions.

Thank you.

Warm regards,

The NSCC Team

 

Service Disruption for A*STAR Users Accessing ASPIRE2A and ASPIRE2A+

Dear A*STAR Users,

We wish to inform you that there is a service disruption on the network access to ASPIRE 2A and 2A+ system. Our team is diligently investigating the issue and working towards a swift resolution.

Cause of Disruption:

  • Issue with network connectivity between FP1 to NUS

Impact of the Disruption:

  • A*STAR users will not be able to access the ASPIRE 2A and ASPIRE 2A+ systems from the A*STAR network.

We apologise for any inconvenience this may cause and thank you for your understanding. Should you have any questions or need assistance, please contact our Helpdesk via the Service Desk Portal or email us at [email protected] if you have any questions.

Thank you.

Warm regards,

The NSCC Team

 

[Completed] A*STAR ExaNet Network Maintenance Affecting ASPIRE 2A and ASPIRE 2A+ on 14 August 2025, from 6.30 PM to 11.30 PM​

Dear A*STAR Users,
 
We are pleased to announce that the A*STAR ExaNet network maintenance has been completed. You may proceed to use the systems as per usual.
 
Should you have any questions or need assistance, please contact our Helpdesk via the Service Desk Portal or email us at [email protected].
 
Thank you for your understanding and patience.
 
Warm regards,
The NSCC Team

Dear A*STAR Users,

We wish to inform you of an upcoming A*STAR ExaNet network maintenance on 14 August 2025 (Thursday), from 6.30 PM to 11.30 PM. All access to the ASPIRE 2A and ASPIRE 2A+ systems from the A*STAR network will be affected.
 
Maintenance Details:
  • Start: 14 August 2025 (Thursday), 6.30 PM SGT
  • End: 14 August 2025 (Thursday), 11.30 PM SGT

Purpose:
  • Maintenance of the A*STAR ExaNet network.
Impact During the Maintenance Period:
  • During this period, users will not be able to access the ASPIRE 2A and ASPIRE 2A+ systems from the A*STAR network.

Please contact our Helpdesk via the Service Desk Portal or email us at [email protected] if you have any questions.

Thank you.

Warm regards,
The NSCC Team