Warning: count(): Parameter must be an array or an object that implements Countable in /var/www/html/new_account/api/pear/pear/DB.php on line 776
Warning: sizeof(): Parameter must be an array or an object that implements Countable in /var/www/html/new_account/api/pear/pear/DB/common.php on line 1653
System News
Filtered by: Unscheduled Outages | System Notices | (Remove Filters)
-
Blue Waters System Reboot in Progress
Created by: jenos 2021-12-20 10:57:57 (Channels: unscheduledoutages)
The Blue Waters compute system is being rebooted this morning due to an issue with its high speed network that was causing most jobs to stall or fail. Login node, filesystem, and data mover components are not expected to be interrupted by this operation. Thank you for your patience while the issue...
Read More -
Blue Waters network maintenance notice
Created by: jenos 2021-12-15 11:53:52 (Channels: systemnotices)
Blue Waters partners- A network component connecting Blue Waters services will have a brief maintenance operation performed today between 1-2pm CT. No major service impact is expected and we will be monitoring for connection interrupts, but if you experience connection issues, please report them to help+bw@ncsa.illinois.edu so we can ensure...
Read More -
Blue Waters returned to service
Created by: jenos 2021-09-28 09:05:33 (Channels: unscheduledoutages)
Blue Waters has been returned to service as of 9:00AM CT. Running jobs may have experienced I/O suspension if dependent upon the part of the scratch filesystem that was unavailable between 7AM-9AM this morning, or potentially no interruption at all if not. The job scheduler has been resumed. Please report any...
Read More -
Blue Waters System Outage
Created by: kingda 2021-09-28 07:58:43 (Channels: unscheduledoutages)
The Blue Waters Scratch filesystem is partially unavailable due to a power issue. Login nodes are still accessible, but the scheduler is paused. Please check the Blue Waters system blog for updates as recovery progresses.
Thank you for your patience as this is resolved.
Blue Waters Admins
-
Blue Waters Notice: Login Maintenance Update
Created by: tbouvet 2021-09-25 17:36:50 (Channels: scheduledoutages|systemnotices)
Update: The login nodes h2ologin1 and h2ologin3 were returned to service September 24th, 4PM CT. The login nodes h2ologin1 and h2ologin3 will be removed from service September 22, 2021 for maintenance. They will have access restricted and current login sessions will drain for 24hrs. Please access the available login nodes...
Read More -
Blue Waters Notice : Login Maintenance
Created by: tbouvet 2021-09-25 09:15:05 (Channels: scheduledoutages|systemnotices)
The login nodes h2ologin1 and h2ologin3 will be removed from service September 22, 2021 for maintenance. They will have access restricted and current login sessions will drain for 24hrs. Please access the available login nodes by using the round-robin alias bw.ncsa.illinois.edu or h2ologin.ncsa.illinois.edu. A followup email will be sent when...
Read More -
Blue Waters returned to service
Created by: jenos 2021-09-21 11:55:53 (Channels: unscheduledoutages)
Blue Waters has been returned to service as of 11:15AM CT. Most running jobs survived the event, but may have experienced extreme communication speed reduction or even failure in some cases during the event. Existing login sessions were not interrupted, but new login session attempts may have seen intermittent failure...
Read More -
Blue Waters System Outage
Created by: jenos 2021-09-21 10:25:10 (Channels: unscheduledoutages)
The Blue Waters compute resource is currently unavailable due to a power issue. Login nodes and filesystems are still accessible. Please check the Blue Waters system blog for updates as recovery progresses.
Thank you for your patience as this is resolved.
Blue Waters Admins
-
Blue Waters scheduler and login issues
Created by: kingda 2021-09-21 08:07:18 (Channels: unscheduledoutages)
The Blue Waters Scheduler and h2ologin3 is currently offline due to a power issue. While this issue persists, jobs cannot be submitted and will not start. Scheduling commands will not be available. No jobs should be lost. Administrators are investigating and will post updates to the system blog.
Blue Waters Admins
-
Blue Waters returned to service
Created by: jenos 2021-08-25 18:47:14 (Channels: unscheduledoutages|systemnotices)
Blue Waters has been returned to service as of 6:45 pm CT after a compute subsystem reboot to correct a problem with the high speed network.
In addition, h2ologin1 has been returned to the service rotation addressed by the [bw|h2ologin].ncsa.illinois.edu hostnames.
We appreciate your patience while we addressed these matters.
The Blue Waters Team
-
Blue Waters System Reboot in Progress
Created by: jenos 2021-08-25 15:46:14 (Channels: unscheduledoutages)
Update: Return to service is now estimated at 6pm CT. The Blue Waters compute system is being rebooted this morning due to an issue with its high speed network that was causing most jobs to fail. Login node, filesystem, and data mover components are not expected to be interrupted by this operation. ...
Read More -
Blue Waters Notice : Login Maintenance Update
Created by: tbouvet 2021-08-13 14:35:22 (Channels: scheduledoutages|systemnotices)
The login nodes h2ologin3 and h2ologin4 have been returned to service. The login node h2ologin1 will be removed from production Aug 22, 2021 for general maintenance. It will have access restricted and current login sessions will drain for 24hrs. Please access the available login nodes by using the round-robin alias bw.ncsa.illinois.,edu...
Read More -
Blue Waters Notice : Login Maintenance
Created by: tbouvet 2021-07-29 16:46:08 (Channels: scheduledoutages|systemnotices)
The login nodes h2ologin3 and h2ologin4 will be removed from production Aug 9, 2021 for general maintenance. They will have access restricted and current login sessions will drain for 24hrs. Please access the available login nodes by using the round-robin alias bw.ncsa.illinois.,edu or h2ologin.ncsa.illinois.edu. A followup email will be sent...
Read More -
Blue Waters filesystem issue resolved
Created by: jenos 2021-03-10 01:36:08 (Channels: unscheduledoutages)
The Blue Waters scratch filesystem has been serviced and resumed responsiveness by 01:30 AM on 3/10/2021. System impact from this issue may have been observed as early as 05:00 PM on 3/09/2021. Thank you for being patient while this issue was resolved.
Blue Waters Admins
-
Blue Waters filesystem issue
Created by: jenos 2021-03-10 00:02:29 (Channels: unscheduledoutages)
The Blue Waters /scratch filesystem is currently encountering an issue causing it to be slow or unresponsive. While this issue persists, login sessions may hang, and jobs may stop making progress or fail. Administrators are investigating and will post updates to the system blog.
Blue Waters Admins
-
Blue Waters Notice: shifter service issue
Created by: gbauer 2021-02-12 09:06:33 (Channels: unscheduledoutages)
The issue has been resolved.
-----
The shifter container service on Blue Waters is currently experiencing issues when pulling some new images through the shifter gateway.
Jobs that use currently pulled images should run as expected
We are looking into the issue and will update the blog periodically.,
We apologize for the inconvenience.
-Blue Waters
-
H2ologin4 had a system issue that required a reboot to resolve.
Created by: tbouvet 2020-12-16 15:23:51 (Channels: unscheduledoutages|systemnotices)
H2ologin4 was rebooted to resolve a system issue that it was experiencing today. We apologize for any inconvenience this may have caused.
-
Intermittent Login Issues
Created by: kingda 2020-12-09 15:54:08 (Channels: unscheduledoutages)
The Blue Waters login nodes were having issues that were identified and resolved. The issues started at 12:15pm CST and ended at 3:30pm CST.
Questions? Mail help+bw@ncsa.illinois.edu.
-
Blue Waters Filesystem Issue Resolved
Created by: mshow 2020-12-02 20:01:29 (Channels: unscheduledoutages)
The Blue Waters file system slowness issue has been resolved without a system interruption. The period where jobs or login nodes may have experienced slow filesystem symptoms spanned from 2:44 PM CST 12/2 to 7:55 PM CST 12/2.
Questions? Mail help+bw@ncsa.illinois.edu.
-
Blue Waters Filesystem Issue
Created by: mshow 2020-12-02 17:18:56 (Channels: unscheduledoutages)
The Blue Waters file system is experiencing an issue causing extreme slowness. Admins are investigating the issue and will update the portal blog with any updates prior to resolution. There is no estimated time for return to service yet.
Questions? Mail help+bw@ncsa.illinois.edu.
-
Blue Waters Filesystem Issue Resolved
Created by: jenos 2020-11-25 10:05:56 (Channels: unscheduledoutages)
The Blue Waters file system slowness issue has been resolved without a system interruption. The period where jobs or login nodes may have experienced slow filesystem symptoms spanned from 9:49 PM CST 11/24 to 6:00 AM CST 11/25.
Questions? Mail help+bw@ncsa.illinois.edu.
-
Blue Waters Filesystem Issue
Created by: jenos 2020-11-25 02:01:49 (Channels: unscheduledoutages)
The Blue Waters file system is experiencing an issue causing extreme slowness. Admins are investigating the issue and will update the portal blog with any updates prior to resolution. There is no estimated time for return to service yet.
Questions? Mail help+bw@ncsa.illinois.edu.
-
Blue Waters returned to service
Created by: jenos 2020-11-24 17:37:19 (Channels: unscheduledoutages|systemnotices)
Blue Waters has been returned to service as of 5:35 pm CT after a full system reboot to correct the problem with system boot infrastructure. All previously running jobs were lost.
The Blue Waters Team
-
Blue Waters System Reboot in Progress
Created by: bbode 2020-11-24 11:14:49 (Channels: unscheduledoutages)
The Blue Waters compute system is being rebooted this morning due to an issue with its primary boot infrastructure. Login node, scheduler, filesystem, and data mover components are not expected to be interrupted by this operation. Thank you for your patience while the issue is resolved. Estimated return to service for...
Read More -
Blue Waters upcoming login interrupts
Created by: tbouvet 2020-11-04 14:13:35 (Channels: scheduledoutages|systemnotices)
Login Maintenance Schedule As logins are rotated out of production, a wall message will be issued as a reminder that an interrupt is pending. h2ologin1 has already been serviced. h2ologin2 access restricted 11/02/2020 10AM Reboot and return to service 11/03/2020 10AM h2ologin3 access restricted 11/02/2020 10AM Reboot and return to service 11/03/2020 10AM h2ologin4 access restricted...
Read More -
Blue Waters returned to service
Created by: mshow 2020-10-15 18:35:37 (Channels: unscheduledoutages)
Blue Waters has been returned to service as of 6:30pm CT after a full system reboot to correct the problem on the high-speed network. All previously running jobs were lost.
The Blue Waters Team
-
Blue Waters System Reboot in Progress
Created by: bbode 2020-10-15 09:49:04 (Channels: unscheduledoutages)
The Blue Waters compute system is being rebooted this morning due to an HSN issue. Login node, scheduler, filesystem, and data mover components are not expected to be interrupted by this operation. During the reboot a set of accumulated security related software updates will be applied. Those updates are not...
Read More -
Blue Waters Notice: Software and Maintenance
Created by: jenos 2020-06-11 12:12:35 (Channels: scheduledoutages|systemnotices|softwareupdate)
Blue Waters Partners: Summary: Globus command line interface issue is resolved. Shifter software upgrade on 6/18/2020 Reminder: Portal dynamic data interruption on 6/12/2020 The Globus issue experienced when using the command line interface provided by bwpy/2.0.2 has been resolved without a need for any local updates, as changes performed by...
Read More -
Blue Waters Maintenance Notice
Created by: mshow 2020-06-09 15:08:42 (Channels: scheduledoutages|systemnotices)
Blue Waters users- On Friday June 12 startnig at 2 PM central, maintenance will be performed on a backend database supporting dynamic system information presented in the Blue Waters portal. The hardware maintenance should complete in less than an hour. During this timeframe, updated dynamic data regarding Blue Waters system status may...
Read More -
Blue Waters returned to service
Created by: jenos 2020-05-22 14:07:30 (Channels: unscheduledoutages)
Blue Waters has been returned to service as of 2:00pm CT after a full system reboot to correct the problem on the high-speed network. All previously running jobs were lost.
The Blue Waters Team
-
Blue Waters System Reboot in Progress
Created by: jenos 2020-05-22 09:05:09 (Channels: unscheduledoutages)
The Blue Waters compute system is being rebooted this morning due to an unexpected complication with hardware and cooling maintenance. Login node, scheduler, filesystem, and data mover components are not expected to be interrupted by this operation. Thank you for your patience while the issue is resolved. Estimated return to...
Read More -
Blue Waters Update: RSA deactivation
Created by: jenos 2020-04-07 16:22:40 (Channels: systemnotices|policychange)
Blue Waters partners- RSA authentication has been deactivated as scheduled, with one exception noted below due to Globus staff availability and illness circumstance. Please note that the Globus Organizational login page will continue to accept RSA, but you may also use your Globus account, as described here. An updated schedule for...
Read More -
Blue Waters Notice: RSA deactivation on 4/7/2020
Created by: jenos 2020-03-31 10:02:31 (Channels: systemnotices|policychange)
Blue Waters partners- As noted previously, RSA authentication to Blue Waters has an upcoming deactivation scheduled. The replacement authentication method via Duo was activated on Blue Waters in December, 2019. If you have not set up Duo and tested Duo authentication yet (for ssh or Globus endpoints), please do so as...
Read More -
Blue Waters returned to service
Created by: kingda 2020-03-29 18:41:43 (Channels: unscheduledoutages|systemnotices)
Blue Waters has been returned to service after a full system reboot to correct a problem on the high-speed network. All previously running jobs were lost.
The Blue Waters Team
-
Blue Waters Unscheduled Outage
Created by: jenos 2020-03-29 13:01:11 (Channels: unscheduledoutages)
Blue Waters is experiencing an issue with the high-speed network that began at 12:02 PM CT. System support staff are evaluating and attempting to restore normal service. Job scheduling is paused until the issue is resolved. The file systems and data transfer services are operating normally. Interim updates will be posted on...
Read More -
Blue Waters returned to service
Created by: bbode 2020-03-16 16:07:00 (Channels: unscheduledoutages|systemnotices)
Blue Waters has been returned to service after a full system reboot to correct a problem on the high-speed network. All previously running jobs were lost.
The Blue Waters Team
-
Blue Waters Notice: Programming Environments changed
Created by: gbauer 2020-01-23 17:59:03 (Channels: systemnotices)
Blue Waters partners; As announced back in December, the version defaults for the 4 programming environments (cray, gnu, intel and pgi) have been changed to newer versions as of 1/23/2020. A table with the changes is available at https://bluewaters.ncsa.illinois.edu/pe-updates#pe18.12. Note that gcc moved from 4.9.3 to 6.3.0 which can impact C++ codes...
Read More -
Blue Waters Reminder: Programming Environment change testing period 12/23/2019 to 01/23/2020
Created by: gbauer 2020-01-22 20:50:24 (Channels: systemnotices)
Blue Waters partners; This is a reminder that on January 23rd 2020 the version defaults for the 4 programming environments (cray, gnu, intel and pgi) will change to newer versions. Statically linked applications should continue to run without needing to be recompiled. Execution of dynamically linked applications should be checked. You can use...
Read More -
Reminder: Blue Waters System Changes
Created by: jenos 2020-01-14 00:08:02 (Channels: systemnotices|policychange|newfunctionality)
Blue Waters Users: In December 2019, you received a notification detailing some planned system changes. Transition summary: https://bluewaters.ncsa.illinois.edu/transition January 14, 2020 is the planned date to effect two changes worth noting: SSH host key change for those using RSA-based access to login nodes Accessing logins via ssh for some may yield an alarming warning...
Read More -
BlueWaters returned to service
Created by: mshow 2019-12-22 23:32:00 (Channels: mandatory|unscheduledoutages)
After a full system reboot and checkout, the system hs been retrned to full service operations
-
BlueWaters system reboot
Created by: mshow 2019-12-22 13:55:58 (Channels: unscheduledoutages|systemnotices)
While the filesystem issue has been resolved, a full system reboot will be required before returning to production status. It is our expectation that the system will return later this evening.
BW Admin Team
-
Blue Waters: UPDATE Scheduler Remains Paused
Created by: tbouvet 2019-12-22 08:21:11 (Channels: unscheduledoutages|systemnotices)
UPDATE: Scheduler Remains Paused as we continue to restore the scratch file system to service. The Blue Waters scheduler is currently paused due to a meda data server issue with the scratch file system. We are actively working the issue and new logins will likely hang without completion. Status updates will be...
Read More -
Blue Waters: Scheduler paused because of a scratch file system issue.
Created by: tbouvet 2019-12-21 20:06:21 (Channels: unscheduledoutages|systemnotices)
The Blue Waters scheduler is currently paused due to a meda data server issue with the scratch file system. We are actively working the issue and new logins will likely hang without completion.
Status updates will be posted to the blog on the Blue Waters portal.
The Blue Waters team.
-
Blue Waters Returned to Service
Created by: tbouvet 2019-12-21 19:13:07 (Channels: unscheduledoutages|systemnotices)
Blue Waters returned to service 12/21/2019 at 7:00PM following today's file system issue.
Please email help+bw@ncsa.illinois.edu to report any issues.
-
Blue Waters: Scheduler paused due to file system issue
Created by: bbode 2019-12-21 13:08:13 (Channels: unscheduledoutages|systemnotices)
The Blue Waters scheduler is currently paused due to two down storage targets in the scratch file system. Staff are currently working to resolve the issue.
Status updates will be posted to the blog on the Blue Waters portal.
The Blue Waters team.
-
Blue Waters Operations Transitioning
Created by: jenos 2019-12-19 19:03:08 (Channels: mandatory|systemnotices|general|policychange)
Hello Blue Waters partners, Today marks the conclusion of regular NSF Blue Waters operations and allocations. Allocations ending today will be granted the normal 90 day grace period to transfer data off Blue Waters storage systems. After today, jobs remaining in the queue will be permitted to run but job submission...
Read More -
Blue Waters Returned to Service
Created by: bbode 2019-11-28 22:36:06 (Channels: unscheduledoutages|systemnotices)
Blue Waters has been rebooted and returned to service at 10:35PM following an issue with the high-speed network earlier this afternoon. All running jobs were lost due to the outage.
Please email help+bw@ncsa.illinois.edu to report any issues.
-
Blue Waters: HSN issues full reboot in progress
Created by: bbode 2019-11-28 17:59:37 (Channels: unscheduledoutages|systemnotices)
An issue with the high-speed network on Blue Waters has forced a full system reboot. We currently anticipate a return to service of 11PM CST.
Status updates will be posted to the blog on the Blue Waters portal.
The Blue Waters team.
-
Nearline Tape Library Has Returned To Service
Created by: briandi 2019-11-14 17:19:54 (Channels: unscheduledoutages|systemnotices)
The NCSA_Nearline storage subsystem issue on Blue Waters was resolved and the system returned to normal operations at 3:30 pm.
-
Nearline Tape Library Emergency System Maintenance
Created by: briandi 2019-11-14 12:06:32 (Channels: unscheduledoutages|systemnotices)
Blue Waters Users, Blue Waters is experiencing an issue on a subset of the HPSS storage subsystem (ncsa#Nearline) that began early this morning. System support staff are evaluating and attempting to restore it to normal service. The rest of Blue Waters subsystems remain in normal operation. Some transfers in and out of...
Read More -
Blue Waters notice: disabling scheduler support for minimum wall clock (minwclimit) jobs
Created by: gbauer 2019-10-09 17:37:28 (Channels: systemnotices)
Blue Waters partners; We have recently discovered an issue in the batch job scheduler's support for minimum wall clock (minwclimit) jobs. Such jobs express wall clock flexibility with both a minimum wall clock and a preferred wall clock to improve job throughput. The issue is not expected to be resolved in a timely...
Read More -
Blue Waters Returned to Service
Created by: kingda 2019-10-01 20:46:38 (Channels: scheduledoutages|unscheduledoutages|systemnotices)
Blue Waters returned to service at 8:45PM following today's scheduler maintenance.
Please email help+bw@ncsa.illinois.edu to report any issues.
-
Scheduled systems testing period extended to 8PM Central
Created by: jenos 2019-10-01 16:54:57 (Channels: scheduledoutages|unscheduledoutages|systemnotices)
Update: This testing period will be extended to 8PM due to unanticipated system-related delays. A scheduled systems testing period will take place on Tuesday, October 1st from 7AM to 5PM 8PM Central, necessitating a shutdown of the job scheduler. Compute nodes will not be available during the test period. Login nodes and file...
Read More -
Blue Waters outage
Created by: mshow 2019-10-01 10:15:54 (Channels: unscheduledoutages|systemnotices)
Multiple cabinets have failed within the Blue Waters system. The failed area will be bypassed and operations will continue.
-
Reminder: Scheduled systems testing period on Tuesday, October 1st from 7AM to 5PM Central
Created by: kingda 2019-09-30 16:44:24 (Channels: scheduledoutages|systemnotices)
A scheduled systems testing period will take place on Tuesday, October 1st from 7AM to 5PM Central, necessitating a shutdown of the job scheduler. Compute nodes will not be available during the test period. Login nodes and file systems will remain accessible.
-
Scheduled systems testing period on Tuesday, October 1st from 7AM to 5PM Central
Created by: kingda 2019-09-23 20:36:26 (Channels: scheduledoutages|systemnotices)
A scheduled systems testing period will take place on Tuesday, October 1st from 7AM to 5PM Central, necessitating a shutdown of the job scheduler. Compute nodes will not be available during the test period. Login nodes and file systems will remain accessible.
-
Blue Waters Returned to Service
Created by: kingda 2019-09-12 16:28:51 (Channels: scheduledoutages|systemnotices)
Blue Waters returned to service at 4:00PM following today's scheduler maintenance.
Please email help+bw@ncsa.illinois.edu to report any issues.
-
Scheduled systems testing period on Thursday, September 12th from 10AM to 5PM Central
Created by: gbauer 2019-09-12 09:51:55 (Channels: scheduledoutages|systemnotices)
A scheduled systems testing period will take place on Thursday, September 12th from 10AM to 5PM Central, necessitating a shutdown of the job scheduler. Compute nodes will not be available during the test period. Login nodes and file systems will remain accessible.
-
Blue Waters: NPCF Power Issue Update
Created by: tbouvet 2019-07-06 03:45:32 (Channels: unscheduledoutages|systemnotices)
Blue Waters: NPCF Power Issue 7/5/2019 3PM
All Blue Waters Resources are available except for the compute nodes. Blue Waters Computes are being rebooted and all running jobs were lost. No RTS eta yet.
-
Blue Waters Returned to Service
Created by: bbode 2019-07-05 17:57:32 (Channels: unscheduledoutages|systemnotices)
Blue Waters has been rebooted and returned to service at 5:55PM following a power interuption earlier this afternoon. All running jobs were lost due to the outage.
Please email help+bw@ncsa.illinois.edu to report any issues.
-
Blue Waters: NPCF Power Issue, Scheduler paused expect full reboot
Created by: tbouvet 2019-07-05 13:32:15 (Channels: unscheduledoutages|systemnotices)
A power outage at the building housing the Blue Waters system has caused a service interruption; the Login Nodes, Network, Storage, Compute Nodes, and Near-line Storage may be unavailable. It is unknown at this time when a return to service can be expected. Watch the Blue Waters portal blog for updates.
-
Blue Waters: NPCF Facility Power Maintenance Complete
Created by: tbouvet 2019-06-24 06:07:40 (Channels: scheduledoutages|systemnotices)
The NPCF facility power maintenance is complete and Blue Waters has returned to service at 06:00 hours.
Please email help+bw@ncsa.illinois.edu to report any issues.
-
Blue Waters Scheduled Maintence Extended until June 24 6AM CT
Created by: tbouvet 2019-06-24 05:18:12 (Channels: scheduledoutages|systemnotices)
The scheduled maintenance has been extended until June 24th 6AM due to technical difficulties. We apologize for the delay in Return to Service. Availability details: Subsystem / Services Projected Interruption Duration Compute June 23 05:00 - June 24 06:00 Login / Filesystem June 23 05:00 - June 24 06:00 Scheduler June 23 05:00 - June 24...
Read More -
Reminder: Blue Waters Scheduled Maintenance June 23rd 2019 5AM
Created by: tbouvet 2019-06-22 14:39:23 (Channels: scheduledoutages|systemnotices)
Reminder of Maintenance, On Sunday June 23, electrical maintenance is scheduled to take place at the NPCF Building which hosts Blue Waters. Therefore, beginning at 05:00 CT, all Blue Waters subsystems will be unavailable. The outage is expected to last until 03:00 the following morning, June 24. During this interruption, Blue Waters and...
Read More -
Blue Waters: Nearline Tape Library Has Returned To Service
Created by: tbouvet 2019-05-31 12:58:05 (Channels: unscheduledoutages|systemnotices)
The Nearline subsystem reboot completed at 11:30AM today.
All existing transfers should resume as Nearline was returned to service.
-
Nearline Tape Library System Reboot
Created by: bbode 2019-05-31 09:28:04 (Channels: unscheduledoutages|systemnotices)
The Nearline subsystem is currently undergoing an emergancy reboot to clear multiple issues. It is expected to return to service by 1PM today.
All existing transfers will resume once Nearline returns to service.
-
Blue Waters Has Returned to Service
Created by: tbouvet 2019-05-16 17:15:55 (Channels: unscheduledoutages|systemnotices)
The storage issue on Blue Waters projects file system has been resolved and the system returned to normal operations at "5:07" PM CT. Any teams who were impacted by the file system issue have been contacted individually. The scheduler has resumed normal operations. Thank you for your patience while this was...
Read More -
Blue Waters Project File System Update
Created by: tbouvet 2019-05-16 09:29:40 (Channels: unscheduledoutages|systemnotices)
We are in the process of running a file system check and repair of a small portion of the projects file system. When that is complete we will access the results and take appropriate action. Update: File system repair continues and is expected to last until late this afternoon (5/16). If...
Read More -
Blue Waters Project File System Issue
Created by: tbouvet 2019-05-15 15:20:25 (Channels: unscheduledoutages|systemnotices)
Blue Waters Users, Blue Waters is currently experiencing a storage server issue for a portion of the projects filesystem. As a result, all I/O transactions targeting the affected storage server will block. A single storage server supplies a small fraction of the file system data and all remaining storage servers continue normal operation....
Read More -
Blue Waters OSS Return to Service
Created by: squaire3 2019-05-12 12:33:07 (Channels: unscheduledoutages|systemnotices)
The storage issue on Blue Waters was resolved and the system returned to normal operations at 12:30 PM CT. Scheduler has also been resumed.
-
Blue Waters OSS Failover
Created by: squaire3 2019-05-12 09:18:53 (Channels: unscheduledoutages|systemnotices)
Blue Waters is currently experiencing a storage server failover on the OSS file system that began at 8:08 AM CT. As a result, all I/O transactions targeting the affected storage server will block until the failover completes on all clients. A single storage server supplies a small fraction of the...
Read More -
Nearline Tape Library System Maintenance
Created by: glasgow 2019-04-29 16:55:33 (Channels: unscheduledoutages|systemnotices)
Nearline maintenance work is complete and the service has returned to full operations as of: 1600hrs, April 29th, 2019 --- Nearline is undergoing emergency service on one tape library. The work is related to fallout from last week's hardware service and is expected to take approximately 10 hours to complete. Some files may...
Read More -
Nearline Tape Library System Maintenance
Created by: glasgow 2019-04-29 09:42:49 (Channels: unscheduledoutages|systemnotices)
Nearline is undergoing emergency service on one tape library. The work is related to fallout from last week's hardware service and is expected to take approximately 10 hours to complete. Some files may not be accessible during that time.
Start time: 0945hrs to ~ 2000hrs, April 29, 2019
-
Nearline Tape Library System Maintenance
Created by: glasgow 2019-04-22 15:04:14 (Channels: scheduledoutages|systemnotices)
One of Nearline's four tape libraries will undergo maintenance to correct power control problems. This library system will be unavailable for four hours while the work is conducted. Files stored in the library will be unavailable for staging during that time. Retrieval jobs from Globus will wait for the files...
Read More -
Blue Waters Return to Service
Created by: squaire3 2019-04-12 15:41:38 (Channels: unscheduledoutages|systemnotices)
The high-speed network issue on Blue Waters was resolved and the system returned to normal operations at 3:34 PM CT. Scheduler has also been resumed.
-
Blue Waters Scheduler Paused - HSN Issue
Created by: squaire3 2019-04-12 15:02:55 (Channels: unscheduledoutages|systemnotices)
Blue Waters is experiencing an issue on the high-speed network that began at 2:10 PM CT. System support staff are evaluating and attempting to restore normal service. Job scheduling is paused until the issue is resolved. The file systems and data transfer services are operating normally. Logins have been occationally hanging...
Read More -
Nearline Endpoint Paused for Storage Maintenance
Created by: glasgow 2019-04-08 23:14:20 (Channels: unscheduledoutages|systemnotices)
The Nearline endpoint has now been returned to normal operations. The Blue Water's Nearline endpoint will be paused beginning at 1700hrs CDT. New and current user actions/requests will be paused and will resume normal activity when the endpoint is released. No user action is necessary. This maintenance...
Read More -
Nearline Endpoint Paused for Storage Maintenance
Created by: glasgow 2019-04-08 16:27:20 (Channels: unscheduledoutages|systemnotices)
The Blue Water's Nearline endpoint will be paused beginning at 1700hrs CDT. New and current user actions/requests will be paused and will resume normal activity when the endpoint is released. No user action is necessary. This maintenance window will be used to conduct resource management operations that have been deferred...
Read More -
Blue Waters Returned to Service
Created by: tbouvet 2019-04-07 15:14:36 (Channels: unscheduledoutages|systemnotices)
Blue WAters Users,
The storage server issue on the scratch file system is resolved. I/O transactions initiated during the outage should have resumed when the Lustre target returned. Blue Waters has resumed normal operations.
-
Blue Waters Scheduler is paused 9:30 AM
Created by: tbouvet 2019-04-07 10:30:00 (Channels: unscheduledoutages|systemnotices)
Blue Waters Users, Blue Waters is currently experiencing a storage server issue for a small portion of the scratch filesystem. As a result, all I/O transactions targeting the affected storage server will block. A single storage server supplies a small fraction of the file system data and all remaining storage servers continue normal...
Read More -
BlueWaters cabinet failure
Created by: mshow 2019-03-23 16:44:36 (Channels: unscheduledoutages|systemnotices)
The cabinet has been restored.
-
BlueWaters repair complete
Created by: mshow 2019-03-23 05:13:22 (Channels: unscheduledoutages|systemnotices)
The cabinet has been restored
-
BlueWaters cabinet failure
Created by: mshow 2019-03-23 03:46:37 (Channels: unscheduledoutages|systemnotices)
A cabinet has shutdown resulting in job loss and an incomplete network configuration. It is unknown at this time when a return to service can be expected for that cabinet. Watch the Blue Waters portal blog for updates.
-
Blue Waters returned to service
Created by: mshow 2019-02-24 03:41:58 (Channels: mandatory|unscheduledoutages|systemnotices)
The high-speed network issue on Blue Waters was resolved and the system returned to normal operations at 3:30 AM CT.
-
Blue Waters High Speed Network issue
Created by: mshow 2019-02-23 23:43:19 (Channels: unscheduledoutages|systemnotices)
Blue Waters is experiencing an issue on the high-speed network that began at 9:48 PM CT. System support staff are evaluating and attempting to restore normal service. Job scheduling is paused until the issue is resolved. The file systems and data transfer services are operating normally. Interim updates will be posted on...
Read More -
Blue Waters Notice: System returned to service
Created by: jenos 2019-02-06 15:31:27 (Channels: unscheduledoutages)
Blue Waters Users:
The reboot is complete and the system has returned to service as of 3:14pm CT. Tomorrow's near-line storage maintenance will proceed as planned.
We apologize for any inconvenience.
-
Blue Waters Unplanned Reboot
Created by: bbode 2019-02-06 10:00:16 (Channels: unscheduledoutages)
Blue Waters Users, We experienced an issue that has the high speed network in an unrecoverable state. We have to reboot the system to recover and all running jobs will be lost. The login nodes and endpoints (ncsa#Nearline ncsa#BlueWaters) will remain available during the reboot. The current estimate for return to...
Read More -
Blue Waters Notice: System returned to service
Created by: tbouvet 2019-01-12 13:09:15 (Channels: scheduledoutages|unscheduledoutages)
Blue Waters Users:
The reboot is complete and the system has returned to service.
We apologize for any inconvenience.
-
Blue Waters Notice: Unplanned Reboot
Created by: tbouvet 2019-01-12 09:58:18 (Channels: unscheduledoutages|systemnotices)
Blue Waters Users, We experienced an issue that has the high speed network in an unrecoverable state. We have to reboot the system to recover and all running jobs will be lost. The login nodes and endpoints (ncsa#Nearline ncsa#BlueWaters) will remain available during the reboot. The current estimate for return to...
Read More -
Blue Waters Notice: Nearline System (HPSS) return to service
Created by: tbouvet 2019-01-10 15:24:20 (Channels: unscheduledoutages|systemnotices)
Blue Waters Users:
The ncsa#Nearline endpoint is available and has returned to service at 3PM CT. We apologize for any inconvenience.
-Blue Waters
-
Blue Waters Notice: Nearline System (HPSS) remains unavailable.
Created by: tbouvet 2019-01-10 09:38:30 (Channels: unscheduledoutages|systemnotices)
Blue Waters Users:
The ncsa#Nearline endpoint is paused pending repairs suffered to systems during the data center power failure on 1/09/19. We apologize for any inconvenience.
Please check back for an update on the situtation.
-Blue Waters
-
Blue Waters Notice: System has returned to service
Created by: jenos 2019-01-10 09:37:52 (Channels: unscheduledoutages|systemnotices)
Blue Waters Users: The power outage at the NPCF building has been resolved. All Blue Waters services are now available with the exception of the Nearline storage system, which will take a bit longer to recover. The job scheduler has been resumed and login nodes have access re-enabled. There will be...
Read More -
Blue Waters Notice: Power disruption
Created by: jenos 2019-01-09 16:21:59 (Channels: unscheduledoutages|systemnotices)
Blue Waters Users: Update 4:20pm: All systems up except Nearline. Access remains restricted while performance tests complete. A power outage at the building housing the Blue Waters system has caused a service interruption; all running jobs were consequently terminated. All Blue Waters subsystems have been affected and are currently out of...
Read More -
Blue Waters Notice: System returned to service
Created by: jenos 2018-12-27 23:43:11 (Channels: unscheduledoutages|systemnotices)
Blue Waters Users:
The power outage at the Blue Waters building has been resolved and the compute nodes have been restarted. The Blue Waters system has been returned to normal operations at 11:40 PM CT. We apologize for any inconvenience.
-
Blue Waters Announcement: Winter Holiday services
Created by: gbauer 2018-12-21 09:21:34 (Channels: systemnotices|general)
Blue Waters partners; NCSA staff will observe the University of Illinois Winter break schedule starting 5pm Friday December 21, 2018, and will resume normal business hours on Wednesday, January 2nd, 2019. On-call staff will monitor Blue Waters and the service request system during this period, and respond to issues in a...
Read More -
Blue Waters Return to Service
Created by: tbouvet 2018-11-14 18:35:11 (Channels: unscheduledoutages)
The meta data server failover on the Home file system completed at 6:23 PM CT. I/O transactions initiated during the failover should have resumed normal operation when failover completed. Blue Waters has resumed normal operations.
-
Blue Waters File System Issue
Created by: tbouvet 2018-11-14 17:45:36 (Channels: unscheduledoutages)
Blue Waters is currently experiencing a meta data server failover on the home file system that began at 5:30 PM CT. As a result, all I/O transactions for the home filesystem will block until the failover completes on all clients. The rest of Blue Waters including the other filesystems are operating...
Read More -
Blue Waters Return to Service
Created by: tbouvet 2018-11-14 11:32:42 (Channels: unscheduledoutages)
The meta data server failover on the Home file system completed at 11:26 AM CT. The file system issue start at 10:45 AM. I/O transactions initiated during the failover should have resumed normal operation when failover completed. Blue Waters has resumed normal operations.
-
Blue Waters File System Issue
Created by: tbouvet 2018-11-14 11:19:47 (Channels: unscheduledoutages)
Blue Waters is currently experiencing a meta data server failover on the home file system that began at 11:15 AM CT. As a result, all I/O transactions for the home filesystem will block until the failover completes on all clients. The rest of Blue Waters including the other filesystems are operating...
Read More -
Blue Waters Announcement: Ongoing Blue Waters Job Discounts
Created by: gbauer 2018-11-12 21:44:41 (Channels: systemnotices|general|policychange)
This is a reminder that the first period of job discounts ends on Thursday, November 15th. The second period starts on the 16th but with different criteria. Get your jobs submitted soon. Stop by the NCSA booth and say hi if you are at SC18. > The Blue Waters project is happy to...
Read More -
BlueWaters returned to service
Created by: mshow 2018-11-05 06:18:32 (Channels: unscheduledoutages)
The system has been returned to full service operation after a brief unscheduled outage.
-
BlueWaters system issue
Created by: mshow 2018-11-05 01:21:18 (Channels: unscheduledoutages|general)
The system has experienced a fault that will require a full system shutdown/reboot. This will take several hours before the system is returned to full operations.
-
Blue Waters Discount Update
Created by: jenos 2018-11-02 17:22:33 (Channels: systemnotices|general|policychange)
The previously announced discounts have not changed; however, they have not been activated yet to be reflected in current charges. This is expected to be activated next week, but will be retroactively applied to the original start date of Nov 1, 2018.
-
Blue Waters Announcement: Blue Friday Job Discounts start today!
Created by: jenos 2018-11-01 12:34:44 (Channels: systemnotices|general|policychange)
The Blue Waters project is happy to announce two job discount periods as our way of saying thank you for another successful year of computing and the your help creating our 2017-2018 annual report. There are two discount periods: Nov 1 - Nov 15: 50% charge discount on all queues, all jobs completing during...
Read More -
Blue Waters Notice: Temporary changes to shell stack size limit
Created by: gbauer 2018-10-05 14:40:33 (Channels: systemnotices)
Blue Waters partners, We would like to inform you of a change to the Blue Waters user environment that will be made today, Friday October 5th, at 2:30 pm Central time. A subsequent notice will be sent when the change are reverted. The change is: A. The shell stack size soft limit will be set to 2097152 kbytes and the...
Read More -
Globus SSH-based CLI End of Life
Created by: gbauer 2018-08-01 15:19:49 (Channels: systemnotices)
As of August 1, 2018 the SSH-based command-line interface (CLI) to Globus is end-of-life.
Please see view-source:https://bluewaters.ncsa.illinois.edu/data-transfer-doc (scroll to the CLI section) and/or the Globus CLI page https://docs.globus.org/cli/ for more information on the new Python based CLI.
Send email to help+bw@ncsa.illinois.edu for assistance, questions, etc.
-
Blue Waters Scheduled Maintenance: Return to Service
Created by: gbauer 2018-07-16 19:43:38 (Channels: scheduledoutages|systemnotices|softwareupdate)
Blue Waters Users, Blue Waters has been returned to service after a security update and programming environment update. CUDA 9.1 is now available as the default cudatoolkit but with gcc/4.9.3 remaining as the default GNU compiler. The default programming environment otherwise did not change to allow for an extended transition to...
Read More -
Blue Waters Scheduled Maintenance Update
Created by: jenos 2018-07-13 11:57:54 (Channels: scheduledoutages|systemnotices|softwareupdate)
Blue Waters Users, Please note the following scheduled maintenance update: The scheduled maintenance for CUDA 9.1 deployment on Blue Waters has been expanded to a near full system service outage that will span 13 hours on Monday July 16th, from 9am to 10pm. This adjustment will eliminate the need for an additional...
Read More -
Blue Waters XK (GPU) resource and HPSS Nearline resources will be unavailable from 9am to 12pm on Monday July 16th
Created by: kingda 2018-07-09 09:08:10 (Channels: scheduledoutages|systemnotices|softwareupdate)
Blue Waters Users, Blue Waters XK compute resource and HPSS ncsa#Nearline resource will be undergoing maintenance Monday (July 16th) morning beginning at 9 AM. The XK compute nodes are anticipated to be unavailable for up to 3 hours. The login nodes and XE compute resource will remain in service. A new Programming Environment will be installed...
Read More -
Blue Waters Partial Scratch Outage
Created by: tbouvet 2018-06-23 21:31:02 (Channels: unscheduledoutages|systemnotices)
Blue Waters experienced a newtork switch failure that resulted in a partial outage of the scratch filesystem (ost168-179) from 7:44 PM CT to 7:59 PM CT. Jobs that ended during this time may have been impacted. I/O transactions targeting the affected storage server should block until the ost targets returned...
Read More -
Blue Waters Nearline Endpoint Busy
Created by: gbauer 2018-06-21 23:07:24 (Channels: systemnotices)
Due to very high demand for data retrieval from Nearline, a pause rule is in effect to allow manual task scheduling. You may submit tasks as normal and they will be run as quickly as possible. Tasks submitted to Globus will start in a paused state but will be released to...
Read More -
Blue Waters returned to full service
Created by: mshow 2018-06-12 10:23:37 (Channels: mandatory|unscheduledoutages|systemnotices)
Blue Waters has returned to full service after recovery from a power event.
-
Blue Waters system power interruption
Created by: mshow 2018-06-12 04:56:14 (Channels: mandatory|unscheduledoutages|systemnotices)
Thunderstorms have resulted in a power interruption of the BlueWaters System. This outage imacts both the compute nodes and all filesystems. Therefore, a full reboot will be necessary.Return to service is estimated to be approximately 10 am Centeral time.
BW Admin
-
Blue Waters has Returned to Service
Created by: tbouvet 2018-06-07 14:27:12 (Channels: unscheduledoutages|systemnotices)
Blue Waters has returned to full service at 2:14 PM CT. The issue encountered required a full system reboot to resolve. All running jobs were lost so please resubmit your jobs from latest checkpoint file if your job exited prematurely.
-
Blue Waters System Issue
Created by: tbouvet 2018-06-07 10:12:41 (Channels: unscheduledoutages)
Blue Waters is experiencing a full system issue that began at 6:30 AM CT. System support staff are evaluating and attempting to restore normal service but may require a full system reboot. Job scheduling is paused until the issue is resolved. Interim updates will be posted on the Blue Waters...
Read More -
Blue Waters notice: Container software re-enabled on Blue Waters
Created by: gbauer 2018-05-09 16:25:31 (Channels: systemnotices|softwareupdate)
We are pleased to announce that the container software package Shifter and its associated service have been re-enabled on Blue Waters. Shifter was checked for vulnerabilities and none were found. To avoid concern, a patch was applied, and Shifter has been updated and re-enabled. Please see the Shifter page on the portal for...
Read More -
Blue Waters notice: Container software disabled on Blue Waters
Created by: gbauer 2018-05-02 14:28:53 (Channels: systemnotices)
This notice affects Blue Waters users relying on the container technologies of Shifter or Singularity. Due to a recently announced security vulnerability related to container software, Shifter and Singularity container software have been disabled immediately to address the issue. We took this action to eliminate the risk of potential compromise of Blue Waters...
Read More -
Blue Waters Announcement: Winter Holiday services
Created by: gbauer 2017-12-21 15:49:34 (Channels: systemnotices|general)
Blue Waters partners; NCSA staff will observe the University of Illinois Winter break schedule starting 5pm Friday December 22, 2017, and will resume normal business hours on Tuesday, January 2nd, 2018. On-call staff will monitor Blue Waters and the service request system during this period and respond to issues in a...
Read More