Log Request | Search

NSCC Service Status

ASPIRE 1 Scheduled System Maintenance from 18 Dec 2020, 3pm to 21 Dec 2020, 12pm

Dear Users,
 
Please be informed that there will be a scheduled ASPIRE 1 system maintenance from 18 Dec 2020 (Friday), 3pm to 21 Dec 2020 (Monday), 12 noon.

Please take note of the important timelines that will impact your work below: 

  • 13 Dec 2020 (Sunday), 3pm:
    • All long queues will stop dispatching jobs from the above date and time till the end of the shutdown. This is to ensure jobs do not get terminated during the shutdown period.
    • Jobs which can be completed before the shutdown may still be dispatched manually, subject to the availability of resources.        
  • 17 Dec 2020 (Thursday), 3pm:
    • Normal queues will stop dispatching jobs.
    • Jobs which can complete before the shutdown may still be dispatched manually, subject to the availability of resources.        
  • 18 Dec 2020 (Friday), 3pm:
    • All users will be logged out of the system.
    • All remaining running jobs will be allowed to terminate gracefully before the system shuts down. 
  • 21 Dec 2020 (Monday), 12pm:
    • ASPIRE 1 system will be brought back online and made available to users.

Please contact us at [email protected] if you have any questions.

 

[Reminder] SingAREN Scheduled Maintenance on 1 Nov 2020, Sunday from 0900 hrs to 1800 hrs

Dear Users,

The SingAREN SLIX dark fibre infrastructure will be undergoing scheduled maintenance on 1 November 2020, Sunday from 0900 hrs to 1800 hrs.

During this period, access to ASPIRE 1 may be intermittently disrupted. Login sessions on the login nodes may also be terminated without notice. However, ASPIRE1 will still continue to operate normally, and jobs that are in the queue or already running will not be affected.

We apologise for the inconvenience caused.

Please contact us at [email protected] if you have any questions. 

NSCC Team 

[Notification] Network Disruption for A*STAR Users Accessing ASPIRE 1

Dear A*STAR Users,

Please be informed that the A*STAR network is currently facing some issues. Users in the corporate network may experience difficulties accessing ASPIRE 1. Those who are on Exanet (the A*STAR scientific network) are not affected.

ITSS is currently working to resolve the issue. In the meantime, you can still access ASPIRE 1 via Exanet or via the NSCC VPN (https://vpn.nscc.sg). For more information on how to access the VPN, please refer to https://help.nscc.sg.
We apologise for the inconvenience caused.

Please contact us at [email protected] if you have any questions.

SingAREN Scheduled Maintenance on 24 Oct 2020, Saturday from 0900 hrs to 1800 hrs  

Dear NUS Users,

The SingAREN SLIX dark fibre infrastructure will be undergoing scheduled maintenance on 24 October 2020, Saturday from 0900 hrs to 1800 hrs.

During this period, access to ASPIRE 1 may be intermittently disrupted. Login sessions on the NUS login nodes may also be terminated without notice. However, ASPIRE1 will still continue to operate normally, and jobs that are in the queue or already running will not be affected.

The below services will not be affected: 

  •     NSCC VPN access
  •     Internet access from NSCC

We apologise for the inconvenience caused.

Please contact us at [email protected] if you have any questions. 

Urgent Koppen Service Disruption Notification on 7 Oct [Resolved]

2020-10-08 19:25 – We are pleased to announce that the earlier issue with the compute nodes has been rectified.

We have released the job queues in the system. Please check your jobs and resubmit them if you encounter any issues.

2020-10-08 12:45 – We have conducted an investigation and discovered a hardware device failure in the compute nodes on Koppen System. We are working with the vendor to get the part replaced.

We will update you when we have any new developments. Our apologies for any inconvenience caused.

2020-10-07 18:00 – We are experiencing some issues with the compute nodes on Koppen System. We are currently working to resolve it.

We will inform users again once we have any updates on the situation. We apologise for any inconvenience caused.

Please contact us at [email protected] if you have any questions.

 

Urgent ASPIRE 1 Service Disruption Notification on 7 Sep [Resolved] 

2020-09-07 16:15 – We are pleased to announce that the earlier issue with the scratch disk has been rectified. We have released the job queues in the system. Please check your jobs and resubmit them if you encounter any issues. 

2020-09-07 14:45 – Please be informed that we are experiencing some issues with the scratch disk on ASPIRE 1. We are currently working to resolve it.  As a precaution we have stopped dispatching all jobs in queue, and all logins have been disabled.

We will inform users again once we have any updates on the situation. We apologise for any inconvenience caused.
 
Please contact us at [email protected] if you have any questions.
 

Urgent ASPIRE 1 Service Disruption Notification [Resolved]

2020-08-29 22:58 – We are pleased to announce that the earlier issue with the scratch disk has been rectified. We have released the job queues in the system. Please check your jobs and resubmit them if you encounter any issues. 

2020-08-29 21:59 – Please be informed that we are experiencing some issues with the scratch disk on ASPIRE 1. We are currently working to resolve it.  As a precaution we have stopped dispatching all jobs in queue, and all logins have been disabled.

We will inform users again once we have any updates on the situation. We apologise for any inconvenience caused.
 
Please contact us at [email protected] if you have any questions.
 

Urgent ASPIRE 1 Service Disruption Notification [Resolved]

Dear Users,

Please be informed that some users may be experiencing issues with the “rm” command for ASPIRE 1’s /home and /project volumes. We are currently working to resolve it. The /scratch and /data volumes are unaffected.

As a precaution we have temporarily stopped dispatching new jobs to the system.

We will inform users again once we have any updates on the situation. We apologise for any inconvenience caused.

Please contact us at [email protected] if you have any questions.

NSCC Team

ASPIRE 1 Scheduled Maintenance [Completed]

2020-07-20 13:38 – ASPIRE 1 maintenance has completed and users can log in now.

2020-07-20 09:00 – Commencing final system and services verification in preparation for system release.

2020-07-19 09:00 – Maintenance work is still in progress. The next update will be on 20 July 09:00 unless there is extenuating issue.

2020-07-18 08:55 – PBS Pro and File System version upgrades in progress. The next update will be on 19 July 09:00 unless there is extenuating issue.

2020-07-17 15:32 – Sophos software upgrades in progress.

2020-07-17 15:00 – All users logged out, commencing systems shutdown.

ASPIRE 1 Scheduled System Maintenance from 17 July 2020, 3pm to 20 July 2020, 12pm [Completed]

Dear Users,

Please be informed that there will be a scheduled ASPIRE 1 system maintenance from 17 July 2020 (Friday), 3pm to 20 July 2020 (Monday), 12 noon.

Please take note of the important timelines that will impact your work below: 

  • 12 July 2020 (Sunday), 3pm:
    • All long queues will stop dispatching jobs from the above date and time till the end of the shutdown. This is to ensure jobs do not get terminated during the shutdown period.
    • Jobs which can be completed before the shutdown may still be dispatched manually, subject to the availability of resources.        
  • 16 July 2020 (Thursday), 3pm:
    • Normal queues will stop dispatching jobs.
    • Jobs which can complete before the shutdown may still be dispatched manually, subject to the availability of resources.        
  • 17 July 2020 (Friday), 3pm:
    • All users will be logged out of the system.
    • All remaining running jobs will be allowed to terminate gracefully before the system shuts down. 
  • 20 July 2020 (Monday), 12pm:
    • ASPIRE 1 system will be brought back online and made available to users.

 Please contact us at [email protected] if you have any questions.

NSCC Team

Urgent Köppen Scheduled Maintenance on 9 July 2020 from 0900 hrs to 2100 hrs [Completed]

Dear Köppen Users,

We will be performing urgent system maintenance on 9 July 2020, Thursday from 0900 hrs to 2100 hrs.  This is to apply important security patches to fix critical system vulnerabilities. 

During the maintenance period, you will not be able to access Köppen or run any jobs. However, jobs waiting in the queue will not be affected.

We apologise for any inconvenience caused.

Please contact us at [email protected] if you have any questions. 

ASPIRE 1 Network Connection Disruption at NTU site [Resolved]

2020-06-23 20:45 – We are pleased to announce that the issue has resolved. You may proceed to access the system as per usual. 

2020-06-23 18:00Please be informed that we are experiencing some network technical issues at NTU. We are currently working to resolve it.

During this period, you will not be able to access ASPIRE 1 via ntu.nscc.sg. However, you may still access ASPIRE1 via NSCC VPN connection https://vpn.nscc.sg

We apologise for any inconvenience caused.

Please contact us at [email protected] if you have any questions. 

NSCC Team

ASPIRE 1 Network Connection Disruption at NTU site [Resolved]

2020-06-13 14:15 – We are pleased to announce that the issue has been resolved. You may proceed to access the system as per usual.

2020-06-13 10:30Please be informed that we are experiencing some network technical issues at NTU. We are currently working to resolve it.

During this period, you will not be able to access ASPIRE 1 via ntu.nscc.sg. However, you may still access ASPIRE1 via NSCC VPN connection https://vpn.nscc.sg.

We apologise for any inconvenience caused.

Please contact us at [email protected] if you have any questions. 

 

NSCC’s Update for COVID-19
Dear NSCC Users,

 

With the closure of most workplaces from 7 April to 4 May 2020 that was mandated by the government, most of NSCC’s staff will be working remotely, leaving only essential personnel to oversee operations onsite.

 
We would like to assure our users that the supercomputer systems remain operational and that we have put in place measures to keep NSCC’s systems functioning as normal even with minimal staff onsite. We will endeavour to minimise any disruptions to our HPC services and to your current jobs.
 
Our online helpdesk continues to function as usual. Do feel free to contact us at [email protected] if you have any queries.
 
NSCC Team 
 

NSCC VPN Service Disruption on 23 March 2020 [Resolved]

2020-03-23 12:15 – We are pleased to announce that the issue with the VPN connection at https://vpn.nscc.sg has been rectified. You may proceed to access the system as per usual.

2020-03-23 11:30 – Please be informed that we are experiencing some technical issue with our NSCC VPN connection at https://vpn.nscc.sg. We are currently working to resolve it.

Users from A*STAR, NUS, NTU and SUTD who are telecommuting can still access ASPIRE1 via their own institution’s VPN.
We apologise for any inconvenience caused and will inform all users once we have more update.
Please contact us at [email protected] if you have any questions.

Service Disruption [Resolved]

2020-03-12 18:30 – We are pleased to announce that the issue with the cooling system has been rectified and our systems have been restored to normal. You may proceed to access the system as per usual.

2020-03-12 12:30 – We are facing some technical issues with the cooling systems in our data centre. As a precaution and to avoid any potential damage to our systems we have shut down all our compute nodes for the time being while the issue is being rectified.

We are continuing to monitor the situation. We apologise for any inconvenience caused and will inform once we have more update.

Scheduled Maintenance [Completed]

2020-01-28 20:20 – ASPIRE 1 maintenance completed and users can log in now

2020-01-28 19:00 – We are facing some issues mounting GPFS back onto ASPIRE 1. We are currently working on it. We will keep you updated on the progress.

2020-01-28 17:00 – Maintenance work is still in progress. We are performing the final verification and target to release ASPIRE 1 system by 6pm. We apologise for any inconvenience caused.

Scheduled Maintenance [Completed]

To facilitate the implementation of our business contingency plan (BCP), there will be a scheduled system maintenance for ASPIRE 1 on 28 January 2020 (Tuesday) from 10am to 5pm.

During the maintenance period, please take note of the below:

1. All users will not be able to access to ASPIRE 1 system.
2. Existing job already running or on the queue will not be affected.

Please contact us at [email protected] if you have any questions.

Service Disruption [Resolved]

2019-12-28 17:30 – The /data file volume has been repaired and remounted.

2019-12-28 10:00 –  We are facing some issues with the /data file volume. We have taken it offline and are working on it now.

Most users will not be affected as this volume is used by a small group of projects. Nevertheless we have separately informed those who are directly affected.

We will update this page as we make progress in our recovery efforts. 

Scheduled Maintenance [Completed]

2019-12-03 17:25 – Issues with the NTU login nodes have been resolved. You may now access the NTU login nodes at ntu.nscc.sg.

2019-12-03 12:30 – ASPIRE 1 maintenance completed and users can log in now.
Aside to NTU users:  We are facing some issues with the NTU login nodes at ntu.nscc.sg. We are still working to resolve it. However your jobs that are already in queue will not be affected and will be dispatched as per normal. In the meantime, you may still log in to ASPIRE 1 via https://vpn.nscc.sg if you need to access your account . We will update you once the issue is resolved.

2019-12-03 09:00 –  Commencing final system and services verification in preparation for system release.

2019-12-02 09:00 – MetroX switches firmware upgrade are in progress. The next update will be on 3 December 09:00 unless there is extenuating issue.

2019-11-29 15:35 – PBS Pro and File System version upgrade are in progress. The next update will be on 2 December 09:00 unless there is extenuating issue.

 

2019-11-29 15:00 – All users logged out, commencing systems shutdown.

 

ASPIRE 1 Scheduled System Maintenance from 29 Nov 2019, 3pm to 3 Dec 2019, 12pm

Please be informed that there will be a scheduled ASPIRE 1 system maintenance from 29 November 2019 (Friday), 3pm to 3 December 2019 (Tuesday), 12 noon. Please take note of the important timelines that will impact your work below:

  1. 24 November 2019 (Sunday), 3pm:
    • All long queues will stop dispatching jobs from the above date and time till the end of the shutdown. This is to ensure jobs do not get terminated during the shutdown period.
    • Jobs which can be completed before the shutdown may still be dispatched manually, subject to the availability of resources.
  2. 28 November 2019 (Thursday), 3pm:
    • Normal queues will stop dispatching jobs.
    • Jobs which can complete before the shutdown may still be dispatched manually, subject to the availability of resources.
  3. 29 November 2019 (Friday), 3pm:
    • All users will be logged out of the system.
    • All remaining running jobs will be allowed to terminate gracefully before the system shuts down.
  4. 3 December 2019 (Tuesday), 12pm:
    • ASPIRE 1 system will be brought back online and made available to users.

 Please contact us at [email protected] if you have any questions.

Scheduled Maintenance [Completed]

2019-06-12 12:45 – ASPIRE 1 maintenance completed and users can log in now

2019-06-11 12:00 – Maintenance works in progress, including update of PBS and Lustre. The next update will be on 12 June 12:00 unless there are any extenuating issues

2019-06-10 12:00 – Maintenance works in progress, including update of PBS and Lustre. The next update will be on 11 June 12:00 unless there are any extenuating issues.

2019-06-09 13:00 – Commencing maintenance works. The next update will be on 10 June 12:00 unless there are any extenuating issues.

2019-06-08 16:00 – Power restored to the data centre.

2019-06-07 20:20 – System shutdown completed. The next update is expected to be on 9 June 13:00 when power is restored to the data centre.

2019-06-07 15:00 – All users logged out, commencing shutdown operations.

 

ASPIRE 1 Scheduled Maintenance from 7 June to 12 June 2019

The building owners, JTC, have planned the annual power shutdown of the Fusionopolis building on 7 June 2019. As such the ASPIRE 1 system will be affected and will be shut down from 7 June 2019 (Friday), 3:00pm  to 12 June 2019 (Wednesday), 12:00pm.

We are also taking this opportunity to extend the shutdown period to carry out critical works to the data centre and to ASPIRE 1 itself. Please note that this shutdown exercise is longer than usual. Details of the shutdown maintenance can be found below.

IMPORTANT 

In preparation for the power shutdown, please take note of the following dates and times:

1.     2 June 2019 (Sunday), 3pm:

  • All long queues will stop dispatching jobs from the above date and time till the end of the shutdown. This is to ensure jobs do not get terminated during the shutdown period.
  • Jobs which can be completed before the shutdown may still be dispatched manually, subject to the availability of resources.

2.     6 June 2019 (Thursday), 3pm: 

  • Normal queues will stop dispatching jobs.
  • Jobs which can be completed before the shutdown may still be dispatched manually, subject to the availability of resources.

3.     7 June 2019 (Friday), 3pm: 

  • All users will be logged out of the system.
  • All remaining running jobs will be allowed to terminate gracefully before the system shuts down.

4.     10 June 2019 (Monday), 9am

  • ​We will also be upgrading AMS (i.e Allocation Management System)However with this upgrade all jobs that have not been dispatched may be lost. Please check your jobs and resubmit them if required.  

5.     12 June 2019 (Wednesday), 12pm: 

  • ASPIRE 1 system will be brought back online and made available to users.

Shutdown and Maintenance Details –

The following activities will be carried out during the shutdown period from 7 June (Friday), 3pm to 12 June (Wednesday), 12pm:
1. JTC will be carrying out annual maintenance works to the electrical supply of the entire Fusionopolis building as mandated by law
2. NSCC will be replacing all the circuit breakers in the data centre. This is an added safety measure to reduce the likelihood of short circuits in the data centre.
3. NSCC will be upgrading Lustre (i.e the scratch file system) from version 2.07 to version 2.10. The new version of Lustre will address any potential compatibility issues.
4. NSCC will be upgrading the allocation management system (AMS) component of PBS. With this upgrade the occasional qstat or qsub timeout issues encountered should be mitigated. However, all jobs which have not been dispatched may be lost.

NSCC VPN Maintenance Scheduled on 11 May 2019 [Completed]

Please be informed that there will be an urgent maintenance to the NSCC VPN scheduled on 11 May  2019 (Saturday), from 10am to 4pm.

During this period, users logging in via NSCC VPN at https://vpn.nscc.sg may experience intermittent service disruption. Users logging in from A*STAR, NUS, NTU and SUTD will not be affected.

Scheduled Maintenance [Completed]

2018-12-28 12:00 – Inspection of hardware has been completed.

2018-12-26 14:00 – 95% of hardware has been inspected and released for use

2018-12-21 12:05 – 75% of hardware has been inspected and released for use

2018-12-20 12:05 – 60% of hardware has been inspected and released for use

2018-12-19 12:05 – 50% of hardware has been inspected and released for use

2018-12-18 12:05 – 45% of hardware has been inspected and released for use

2018-12-17 13:00 – ASPIRE 1 maintenance completed and users can log in now. 40% of hardware has been inspected and released for use

2018-12-16 23:39 – Maintenance of system completed. Hardware inspection is still working in progress

2018-12-16 09:00 – Hardware inspection started

2018-12-14 16:56 – Maintenance of system initiated

2018-12-14 16:23 – System shutdown completed

2018-12-14 15:27 – System shutdown initiated

2018-12-14 15:05 – Log out all users

Maintenance Window: 2018-12-14 (Fri) 15:00 to 2018-12-17 (Mon) 12:00
Services Affected : ASPIRE 1 System maintenance [Completed]

 

Please take note of the important timelines below that will impact your work:

  • 9 December 2018 (Sunday), 15:00– All long queues will stop dispatching jobs. This is to ensure jobs do not get terminated during the shutdown period. Jobs which can complete before the shutdown may still be dispatched manually, subject to the availability of resources.
  • 13 December 2018 (Thursday), 15:00– Normal queues will stop dispatching jobs. Jobs which can complete before the shutdown may still be dispatched manually, subject to the availability of resources.
  • 14 December 2018 (Friday), 15:00– All users will be logged out of the system. All remaining running jobs will be allowed to terminate gracefully before the system shuts down.
  • 17 December 2018 (Monday), 12:00– the system will be brought online and released to users.

Service Restoration [Completed]

2018-09-26 20:10 – We are pleased to inform you that the ASPIRE 1 system has been brought back online. You may log in to the system. We thank you for your patience and kind understanding.

2018-09-26 16:15 – We are still working in progress to bring up the ASPIRE 1 system.

2018-09-26 12:05 – We are still working in progress to bring up the ASPIRE 1 system.

2018-09-25 12:20 – The chilled water supply has been restored. We are working to bring up ASPIRE 1 system now.

2018-09-24 21:00 – The building management team is still working to restore the chilled water supply. Once it is properly restored, we will begin to power up ASPIRE 1 system.

2018-09-24 10:00 – The building management team is still working to restore the chilled water supply floor by floor. We are expecting further delays as we will only be able to power up the system after our chilled water supply is restored.

2018-09-23 12:35 – The building management team has advised us that the burst water pipe has also affected our power supply. We expect to have our power restored only tomorrow at the earliest.

2018-09-23 06:35 – System shutdown completed

2018-09-23 05:31 – System shutdown initiated

2018-09-23 05:15 – Log in Nodes not available

Service Notification [Completed]

Dear ASPIRE 1 Users, The chilled water pipe in Fusionopolis has burst, and our data centre is affected. Therefore we have shut down the supercomputer and all associated services to prevent it from damage.Unfortunately all running jobs will be lost. Jobs that are still in queue should not be affected though.

We will update you as we have any developments. In the meantime you can refer to https://status.nscc.sg for the latest information.

Please accept our apologies for any inconvenience caused.

Scheduled Maintenance [Completed]

2018-06-08 15:00 – Log out all users

2018-06-08 16:00 – Start: upgrade HCA, OFED

2018-06-08 17:30 – Start: upgrade File System Servers OS

2018-06-08 21:00 – Complete: upgrade HCA, OFED

2018-06-09 09:00 – Start: upgrade NTU/NUS DWDM PacketLight

2018-06-09 09:00 – Start: upgrade Compute Nodes BIOS

2018-06-09 13:50 – Complete: upgrade NTU/NUS DWDM PacketLight

2018-06-10 22:00 – Complete: upgrade Compute Nodes BIOS

2018-06-10 22:00 – Complete: upgrade File System Servers OS

2018-06-10 22:00 – Services Verification

2018-06-11 12:30 – Complete: ASPIRE 1 maintenance and users can log in now

Scheduled Maintenance [Completed]

Maintenance Window: 2018-06-08 (Fri) 15:00 to 2018-06-11 12:00
Services Affected : ASPIRE 1 System maintenance

Please take note of the important timelines below that may impact your work:

  • 2018-06-03 (Sunday), 15:00 – All long queues will stop dispatching jobs. This is to ensure jobs do not get terminated during the maintenance period. Jobs which can complete before the maintenance may still be dispatched manually, subject to the availability of resources.
  • 2018-06-07 (Thursday), 15:00 – Normal queues will stop dispatching jobs.
  • 2018-06-08 (Friday), 15:00 – All users will be logged out of the system. All remaining running jobs will be allowed to terminate gracefully before the system maintenance begins.
  • 2018-06-11 (Monday), 12:00 – System will be brought online and released to users.

Please contact us at [email protected] if you have any questions.

Back to Top