Warning: count(): Parameter must be an array or an object that implements Countable in /var/www/html/new_account/api/pear/pear/DB.php on line 776
Warning: sizeof(): Parameter must be an array or an object that implements Countable in /var/www/html/new_account/api/pear/pear/DB/common.php on line 1653
System News
Filtered by: Scheduled Outages | Unscheduled Outages | (Remove Filters)
-
Blue Waters System Reboot in Progress
Created by: jenos 2021-12-20 10:57:57 (Channels: unscheduledoutages)
The Blue Waters compute system is being rebooted this morning due to an issue with its high speed network that was causing most jobs to stall or fail. Login node, filesystem, and data mover components are not expected to be interrupted by this operation. Thank you for your patience while the issue...
Read More -
Blue Waters returned to service
Created by: jenos 2021-09-28 09:05:33 (Channels: unscheduledoutages)
Blue Waters has been returned to service as of 9:00AM CT. Running jobs may have experienced I/O suspension if dependent upon the part of the scratch filesystem that was unavailable between 7AM-9AM this morning, or potentially no interruption at all if not. The job scheduler has been resumed. Please report any...
Read More -
Blue Waters System Outage
Created by: kingda 2021-09-28 07:58:43 (Channels: unscheduledoutages)
The Blue Waters Scratch filesystem is partially unavailable due to a power issue. Login nodes are still accessible, but the scheduler is paused. Please check the Blue Waters system blog for updates as recovery progresses.
Thank you for your patience as this is resolved.
Blue Waters Admins
-
Blue Waters Notice: Login Maintenance Update
Created by: tbouvet 2021-09-25 17:36:50 (Channels: scheduledoutages|systemnotices)
Update: The login nodes h2ologin1 and h2ologin3 were returned to service September 24th, 4PM CT. The login nodes h2ologin1 and h2ologin3 will be removed from service September 22, 2021 for maintenance. They will have access restricted and current login sessions will drain for 24hrs. Please access the available login nodes...
Read More -
Blue Waters Notice : Login Maintenance
Created by: tbouvet 2021-09-25 09:15:05 (Channels: scheduledoutages|systemnotices)
The login nodes h2ologin1 and h2ologin3 will be removed from service September 22, 2021 for maintenance. They will have access restricted and current login sessions will drain for 24hrs. Please access the available login nodes by using the round-robin alias bw.ncsa.illinois.edu or h2ologin.ncsa.illinois.edu. A followup email will be sent when...
Read More -
Blue Waters returned to service
Created by: jenos 2021-09-21 11:55:53 (Channels: unscheduledoutages)
Blue Waters has been returned to service as of 11:15AM CT. Most running jobs survived the event, but may have experienced extreme communication speed reduction or even failure in some cases during the event. Existing login sessions were not interrupted, but new login session attempts may have seen intermittent failure...
Read More -
Blue Waters System Outage
Created by: jenos 2021-09-21 10:25:10 (Channels: unscheduledoutages)
The Blue Waters compute resource is currently unavailable due to a power issue. Login nodes and filesystems are still accessible. Please check the Blue Waters system blog for updates as recovery progresses.
Thank you for your patience as this is resolved.
Blue Waters Admins
-
Blue Waters scheduler and login issues
Created by: kingda 2021-09-21 08:07:18 (Channels: unscheduledoutages)
The Blue Waters Scheduler and h2ologin3 is currently offline due to a power issue. While this issue persists, jobs cannot be submitted and will not start. Scheduling commands will not be available. No jobs should be lost. Administrators are investigating and will post updates to the system blog.
Blue Waters Admins
-
Blue Waters returned to service
Created by: jenos 2021-08-25 18:47:14 (Channels: unscheduledoutages|systemnotices)
Blue Waters has been returned to service as of 6:45 pm CT after a compute subsystem reboot to correct a problem with the high speed network.
In addition, h2ologin1 has been returned to the service rotation addressed by the [bw|h2ologin].ncsa.illinois.edu hostnames.
We appreciate your patience while we addressed these matters.
The Blue Waters Team
-
Blue Waters System Reboot in Progress
Created by: jenos 2021-08-25 15:46:14 (Channels: unscheduledoutages)
Update: Return to service is now estimated at 6pm CT. The Blue Waters compute system is being rebooted this morning due to an issue with its high speed network that was causing most jobs to fail. Login node, filesystem, and data mover components are not expected to be interrupted by this operation. ...
Read More -
Blue Waters Notice : Login Maintenance Update
Created by: tbouvet 2021-08-13 14:35:22 (Channels: scheduledoutages|systemnotices)
The login nodes h2ologin3 and h2ologin4 have been returned to service. The login node h2ologin1 will be removed from production Aug 22, 2021 for general maintenance. It will have access restricted and current login sessions will drain for 24hrs. Please access the available login nodes by using the round-robin alias bw.ncsa.illinois.,edu...
Read More -
Blue Waters Notice : Login Maintenance
Created by: tbouvet 2021-07-29 16:46:08 (Channels: scheduledoutages|systemnotices)
The login nodes h2ologin3 and h2ologin4 will be removed from production Aug 9, 2021 for general maintenance. They will have access restricted and current login sessions will drain for 24hrs. Please access the available login nodes by using the round-robin alias bw.ncsa.illinois.,edu or h2ologin.ncsa.illinois.edu. A followup email will be sent...
Read More -
Blue Waters filesystem issue resolved
Created by: jenos 2021-03-10 01:36:08 (Channels: unscheduledoutages)
The Blue Waters scratch filesystem has been serviced and resumed responsiveness by 01:30 AM on 3/10/2021. System impact from this issue may have been observed as early as 05:00 PM on 3/09/2021. Thank you for being patient while this issue was resolved.
Blue Waters Admins
-
Blue Waters filesystem issue
Created by: jenos 2021-03-10 00:02:29 (Channels: unscheduledoutages)
The Blue Waters /scratch filesystem is currently encountering an issue causing it to be slow or unresponsive. While this issue persists, login sessions may hang, and jobs may stop making progress or fail. Administrators are investigating and will post updates to the system blog.
Blue Waters Admins
-
Blue Waters Notice: shifter service issue
Created by: gbauer 2021-02-12 09:06:33 (Channels: unscheduledoutages)
The issue has been resolved.
-----
The shifter container service on Blue Waters is currently experiencing issues when pulling some new images through the shifter gateway.
Jobs that use currently pulled images should run as expected
We are looking into the issue and will update the blog periodically.,
We apologize for the inconvenience.
-Blue Waters
-
H2ologin4 had a system issue that required a reboot to resolve.
Created by: tbouvet 2020-12-16 15:23:51 (Channels: unscheduledoutages|systemnotices)
H2ologin4 was rebooted to resolve a system issue that it was experiencing today. We apologize for any inconvenience this may have caused.
-
Intermittent Login Issues
Created by: kingda 2020-12-09 15:54:08 (Channels: unscheduledoutages)
The Blue Waters login nodes were having issues that were identified and resolved. The issues started at 12:15pm CST and ended at 3:30pm CST.
Questions? Mail help+bw@ncsa.illinois.edu.
-
Blue Waters Filesystem Issue Resolved
Created by: mshow 2020-12-02 20:01:29 (Channels: unscheduledoutages)
The Blue Waters file system slowness issue has been resolved without a system interruption. The period where jobs or login nodes may have experienced slow filesystem symptoms spanned from 2:44 PM CST 12/2 to 7:55 PM CST 12/2.
Questions? Mail help+bw@ncsa.illinois.edu.
-
Blue Waters Filesystem Issue
Created by: mshow 2020-12-02 17:18:56 (Channels: unscheduledoutages)
The Blue Waters file system is experiencing an issue causing extreme slowness. Admins are investigating the issue and will update the portal blog with any updates prior to resolution. There is no estimated time for return to service yet.
Questions? Mail help+bw@ncsa.illinois.edu.
-
Blue Waters Filesystem Issue Resolved
Created by: jenos 2020-11-25 10:05:56 (Channels: unscheduledoutages)
The Blue Waters file system slowness issue has been resolved without a system interruption. The period where jobs or login nodes may have experienced slow filesystem symptoms spanned from 9:49 PM CST 11/24 to 6:00 AM CST 11/25.
Questions? Mail help+bw@ncsa.illinois.edu.
-
Blue Waters Filesystem Issue
Created by: jenos 2020-11-25 02:01:49 (Channels: unscheduledoutages)
The Blue Waters file system is experiencing an issue causing extreme slowness. Admins are investigating the issue and will update the portal blog with any updates prior to resolution. There is no estimated time for return to service yet.
Questions? Mail help+bw@ncsa.illinois.edu.
-
Blue Waters returned to service
Created by: jenos 2020-11-24 17:37:19 (Channels: unscheduledoutages|systemnotices)
Blue Waters has been returned to service as of 5:35 pm CT after a full system reboot to correct the problem with system boot infrastructure. All previously running jobs were lost.
The Blue Waters Team
-
Blue Waters System Reboot in Progress
Created by: bbode 2020-11-24 11:14:49 (Channels: unscheduledoutages)
The Blue Waters compute system is being rebooted this morning due to an issue with its primary boot infrastructure. Login node, scheduler, filesystem, and data mover components are not expected to be interrupted by this operation. Thank you for your patience while the issue is resolved. Estimated return to service for...
Read More -
Blue Waters upcoming login interrupts
Created by: tbouvet 2020-11-04 14:13:35 (Channels: scheduledoutages|systemnotices)
Login Maintenance Schedule As logins are rotated out of production, a wall message will be issued as a reminder that an interrupt is pending. h2ologin1 has already been serviced. h2ologin2 access restricted 11/02/2020 10AM Reboot and return to service 11/03/2020 10AM h2ologin3 access restricted 11/02/2020 10AM Reboot and return to service 11/03/2020 10AM h2ologin4 access restricted...
Read More -
Blue Waters returned to service
Created by: mshow 2020-10-15 18:35:37 (Channels: unscheduledoutages)
Blue Waters has been returned to service as of 6:30pm CT after a full system reboot to correct the problem on the high-speed network. All previously running jobs were lost.
The Blue Waters Team
-
Blue Waters System Reboot in Progress
Created by: bbode 2020-10-15 09:49:04 (Channels: unscheduledoutages)
The Blue Waters compute system is being rebooted this morning due to an HSN issue. Login node, scheduler, filesystem, and data mover components are not expected to be interrupted by this operation. During the reboot a set of accumulated security related software updates will be applied. Those updates are not...
Read More -
Blue Waters Notice: Software and Maintenance
Created by: jenos 2020-06-11 12:12:35 (Channels: scheduledoutages|systemnotices|softwareupdate)
Blue Waters Partners: Summary: Globus command line interface issue is resolved. Shifter software upgrade on 6/18/2020 Reminder: Portal dynamic data interruption on 6/12/2020 The Globus issue experienced when using the command line interface provided by bwpy/2.0.2 has been resolved without a need for any local updates, as changes performed by...
Read More -
Blue Waters Maintenance Notice
Created by: mshow 2020-06-09 15:08:42 (Channels: scheduledoutages|systemnotices)
Blue Waters users- On Friday June 12 startnig at 2 PM central, maintenance will be performed on a backend database supporting dynamic system information presented in the Blue Waters portal. The hardware maintenance should complete in less than an hour. During this timeframe, updated dynamic data regarding Blue Waters system status may...
Read More -
Blue Waters returned to service
Created by: jenos 2020-05-22 14:07:30 (Channels: unscheduledoutages)
Blue Waters has been returned to service as of 2:00pm CT after a full system reboot to correct the problem on the high-speed network. All previously running jobs were lost.
The Blue Waters Team
-
Blue Waters System Reboot in Progress
Created by: jenos 2020-05-22 09:05:09 (Channels: unscheduledoutages)
The Blue Waters compute system is being rebooted this morning due to an unexpected complication with hardware and cooling maintenance. Login node, scheduler, filesystem, and data mover components are not expected to be interrupted by this operation. Thank you for your patience while the issue is resolved. Estimated return to...
Read More -
Blue Waters Maintenance Notice
Created by: jenos 2020-05-01 16:02:55 (Channels: scheduledoutages)
Blue Waters users- For the duration spanning May 5 through May 6, maintenance will be performed on a backend database supporting dynamic system information presented in the Blue Waters portal. During this timeframe, updated dynamic data regarding Blue Waters system status may not be available. The Blue Waters system is expected...
Read More -
Blue Waters returned to service
Created by: kingda 2020-03-29 18:41:43 (Channels: unscheduledoutages|systemnotices)
Blue Waters has been returned to service after a full system reboot to correct a problem on the high-speed network. All previously running jobs were lost.
The Blue Waters Team
-
Blue Waters Unscheduled Outage
Created by: jenos 2020-03-29 13:01:11 (Channels: unscheduledoutages)
Blue Waters is experiencing an issue with the high-speed network that began at 12:02 PM CT. System support staff are evaluating and attempting to restore normal service. Job scheduling is paused until the issue is resolved. The file systems and data transfer services are operating normally. Interim updates will be posted on...
Read More -
Blue Waters returned to service
Created by: bbode 2020-03-16 16:07:00 (Channels: unscheduledoutages|systemnotices)
Blue Waters has been returned to service after a full system reboot to correct a problem on the high-speed network. All previously running jobs were lost.
The Blue Waters Team
-
Blue Waters Announcement: Nearline Retirement Approaching!
Created by: bbode 2020-03-02 09:28:06 (Channels: mandatory|scheduledoutages)
Hello Blue Waters Partners, The Blue Waters nearline (tape) subsystem will be permanently shutdown March 31, 2020. This is a hard deadline with no extension possible as NCSA is contractually obligated to remove the controlling software and metadata following the shutdown since we will no longer have the right...
Read More -
BlueWaters returned to service
Created by: mshow 2019-12-22 23:32:00 (Channels: mandatory|unscheduledoutages)
After a full system reboot and checkout, the system hs been retrned to full service operations
-
BlueWaters system reboot
Created by: mshow 2019-12-22 13:55:58 (Channels: unscheduledoutages|systemnotices)
While the filesystem issue has been resolved, a full system reboot will be required before returning to production status. It is our expectation that the system will return later this evening.
BW Admin Team
-
Blue Waters: UPDATE Scheduler Remains Paused
Created by: tbouvet 2019-12-22 08:21:11 (Channels: unscheduledoutages|systemnotices)
UPDATE: Scheduler Remains Paused as we continue to restore the scratch file system to service. The Blue Waters scheduler is currently paused due to a meda data server issue with the scratch file system. We are actively working the issue and new logins will likely hang without completion. Status updates will be...
Read More -
Blue Waters: Scheduler paused because of a scratch file system issue.
Created by: tbouvet 2019-12-21 20:06:21 (Channels: unscheduledoutages|systemnotices)
The Blue Waters scheduler is currently paused due to a meda data server issue with the scratch file system. We are actively working the issue and new logins will likely hang without completion.
Status updates will be posted to the blog on the Blue Waters portal.
The Blue Waters team.
-
Blue Waters Returned to Service
Created by: tbouvet 2019-12-21 19:13:07 (Channels: unscheduledoutages|systemnotices)
Blue Waters returned to service 12/21/2019 at 7:00PM following today's file system issue.
Please email help+bw@ncsa.illinois.edu to report any issues.
-
Blue Waters: Scheduler paused due to file system issue
Created by: bbode 2019-12-21 13:08:13 (Channels: unscheduledoutages|systemnotices)
The Blue Waters scheduler is currently paused due to two down storage targets in the scratch file system. Staff are currently working to resolve the issue.
Status updates will be posted to the blog on the Blue Waters portal.
The Blue Waters team.
-
Blue Waters Returned to Service
Created by: bbode 2019-11-28 22:36:06 (Channels: unscheduledoutages|systemnotices)
Blue Waters has been rebooted and returned to service at 10:35PM following an issue with the high-speed network earlier this afternoon. All running jobs were lost due to the outage.
Please email help+bw@ncsa.illinois.edu to report any issues.
-
Blue Waters: HSN issues full reboot in progress
Created by: bbode 2019-11-28 17:59:37 (Channels: unscheduledoutages|systemnotices)
An issue with the high-speed network on Blue Waters has forced a full system reboot. We currently anticipate a return to service of 11PM CST.
Status updates will be posted to the blog on the Blue Waters portal.
The Blue Waters team.
-
Nearline Tape Library Has Returned To Service
Created by: briandi 2019-11-14 17:19:54 (Channels: unscheduledoutages|systemnotices)
The NCSA_Nearline storage subsystem issue on Blue Waters was resolved and the system returned to normal operations at 3:30 pm.
-
Nearline Tape Library Emergency System Maintenance
Created by: briandi 2019-11-14 12:06:32 (Channels: unscheduledoutages|systemnotices)
Blue Waters Users, Blue Waters is experiencing an issue on a subset of the HPSS storage subsystem (ncsa#Nearline) that began early this morning. System support staff are evaluating and attempting to restore it to normal service. The rest of Blue Waters subsystems remain in normal operation. Some transfers in and out of...
Read More -
Blue Waters Returned to Service
Created by: kingda 2019-10-01 20:46:38 (Channels: scheduledoutages|unscheduledoutages|systemnotices)
Blue Waters returned to service at 8:45PM following today's scheduler maintenance.
Please email help+bw@ncsa.illinois.edu to report any issues.
-
Scheduled systems testing period extended to 8PM Central
Created by: jenos 2019-10-01 16:54:57 (Channels: scheduledoutages|unscheduledoutages|systemnotices)
Update: This testing period will be extended to 8PM due to unanticipated system-related delays. A scheduled systems testing period will take place on Tuesday, October 1st from 7AM to 5PM 8PM Central, necessitating a shutdown of the job scheduler. Compute nodes will not be available during the test period. Login nodes and file...
Read More -
Blue Waters outage
Created by: mshow 2019-10-01 10:15:54 (Channels: unscheduledoutages|systemnotices)
Multiple cabinets have failed within the Blue Waters system. The failed area will be bypassed and operations will continue.
-
Reminder: Scheduled systems testing period on Tuesday, October 1st from 7AM to 5PM Central
Created by: kingda 2019-09-30 16:44:24 (Channels: scheduledoutages|systemnotices)
A scheduled systems testing period will take place on Tuesday, October 1st from 7AM to 5PM Central, necessitating a shutdown of the job scheduler. Compute nodes will not be available during the test period. Login nodes and file systems will remain accessible.
-
Scheduled systems testing period on Tuesday, October 1st from 7AM to 5PM Central
Created by: kingda 2019-09-23 20:36:26 (Channels: scheduledoutages|systemnotices)
A scheduled systems testing period will take place on Tuesday, October 1st from 7AM to 5PM Central, necessitating a shutdown of the job scheduler. Compute nodes will not be available during the test period. Login nodes and file systems will remain accessible.
-
Blue Waters Returned to Service
Created by: kingda 2019-09-12 16:28:51 (Channels: scheduledoutages|systemnotices)
Blue Waters returned to service at 4:00PM following today's scheduler maintenance.
Please email help+bw@ncsa.illinois.edu to report any issues.
-
Scheduled systems testing period on Thursday, September 12th from 10AM to 5PM Central
Created by: gbauer 2019-09-12 09:51:55 (Channels: scheduledoutages|systemnotices)
A scheduled systems testing period will take place on Thursday, September 12th from 10AM to 5PM Central, necessitating a shutdown of the job scheduler. Compute nodes will not be available during the test period. Login nodes and file systems will remain accessible.
-
Notice: Brief suspension of job scheduling Friday, 14:00-18:00
Created by: jenos 2019-07-17 12:05:02 (Channels: scheduledoutages)
Blue Waters will suspend job operations during a brief (4 hour) period on Friday, 7/19, from 14:00-18:00 CT. The system is expected to remain up and available during that period, but there will be a temporarily increased potential for an unexpected facility-related interrupt. This action is to minimize any risk to...
Read More -
Blue Waters: NPCF Power Issue Update
Created by: tbouvet 2019-07-06 03:45:32 (Channels: unscheduledoutages|systemnotices)
Blue Waters: NPCF Power Issue 7/5/2019 3PM
All Blue Waters Resources are available except for the compute nodes. Blue Waters Computes are being rebooted and all running jobs were lost. No RTS eta yet.
-
Blue Waters Returned to Service
Created by: bbode 2019-07-05 17:57:32 (Channels: unscheduledoutages|systemnotices)
Blue Waters has been rebooted and returned to service at 5:55PM following a power interuption earlier this afternoon. All running jobs were lost due to the outage.
Please email help+bw@ncsa.illinois.edu to report any issues.
-
Blue Waters: NPCF Power Issue, Scheduler paused expect full reboot
Created by: tbouvet 2019-07-05 13:32:15 (Channels: unscheduledoutages|systemnotices)
A power outage at the building housing the Blue Waters system has caused a service interruption; the Login Nodes, Network, Storage, Compute Nodes, and Near-line Storage may be unavailable. It is unknown at this time when a return to service can be expected. Watch the Blue Waters portal blog for updates.
-
Blue Waters: NPCF Facility Power Maintenance Complete
Created by: tbouvet 2019-06-24 06:07:40 (Channels: scheduledoutages|systemnotices)
The NPCF facility power maintenance is complete and Blue Waters has returned to service at 06:00 hours.
Please email help+bw@ncsa.illinois.edu to report any issues.
-
Blue Waters Scheduled Maintence Extended until June 24 6AM CT
Created by: tbouvet 2019-06-24 05:18:12 (Channels: scheduledoutages|systemnotices)
The scheduled maintenance has been extended until June 24th 6AM due to technical difficulties. We apologize for the delay in Return to Service. Availability details: Subsystem / Services Projected Interruption Duration Compute June 23 05:00 - June 24 06:00 Login / Filesystem June 23 05:00 - June 24 06:00 Scheduler June 23 05:00 - June 24...
Read More -
Reminder: Blue Waters Scheduled Maintenance June 23rd 2019 5AM
Created by: tbouvet 2019-06-22 14:39:23 (Channels: scheduledoutages|systemnotices)
Reminder of Maintenance, On Sunday June 23, electrical maintenance is scheduled to take place at the NPCF Building which hosts Blue Waters. Therefore, beginning at 05:00 CT, all Blue Waters subsystems will be unavailable. The outage is expected to last until 03:00 the following morning, June 24. During this interruption, Blue Waters and...
Read More -
Blue Waters Scheduled Maintenance
Created by: jenos 2019-06-17 12:10:59 (Channels: scheduledoutages)
On Sunday June 23, electrical maintenance is scheduled to take place at the NPCF which hosts Blue Waters. Therefore, beginning at 05:00 CT, all Blue Waters subsystems will be unavailable. The outage is expected to last until 03:00 the following morning, June 24. During this interruption, Blue Waters and its subsystems will...
Read More -
Blue Waters: Nearline Tape Library Has Returned To Service
Created by: tbouvet 2019-05-31 12:58:05 (Channels: unscheduledoutages|systemnotices)
The Nearline subsystem reboot completed at 11:30AM today.
All existing transfers should resume as Nearline was returned to service.
-
Nearline Tape Library System Reboot
Created by: bbode 2019-05-31 09:28:04 (Channels: unscheduledoutages|systemnotices)
The Nearline subsystem is currently undergoing an emergancy reboot to clear multiple issues. It is expected to return to service by 1PM today.
All existing transfers will resume once Nearline returns to service.
-
Blue Waters Has Returned to Service
Created by: tbouvet 2019-05-16 17:15:55 (Channels: unscheduledoutages|systemnotices)
The storage issue on Blue Waters projects file system has been resolved and the system returned to normal operations at "5:07" PM CT. Any teams who were impacted by the file system issue have been contacted individually. The scheduler has resumed normal operations. Thank you for your patience while this was...
Read More -
Blue Waters Project File System Update
Created by: tbouvet 2019-05-16 09:29:40 (Channels: unscheduledoutages|systemnotices)
We are in the process of running a file system check and repair of a small portion of the projects file system. When that is complete we will access the results and take appropriate action. Update: File system repair continues and is expected to last until late this afternoon (5/16). If...
Read More -
Blue Waters Project File System Issue
Created by: tbouvet 2019-05-15 15:20:25 (Channels: unscheduledoutages|systemnotices)
Blue Waters Users, Blue Waters is currently experiencing a storage server issue for a portion of the projects filesystem. As a result, all I/O transactions targeting the affected storage server will block. A single storage server supplies a small fraction of the file system data and all remaining storage servers continue normal operation....
Read More -
Blue Waters OSS Return to Service
Created by: squaire3 2019-05-12 12:33:07 (Channels: unscheduledoutages|systemnotices)
The storage issue on Blue Waters was resolved and the system returned to normal operations at 12:30 PM CT. Scheduler has also been resumed.
-
Blue Waters OSS Failover
Created by: squaire3 2019-05-12 09:18:53 (Channels: unscheduledoutages|systemnotices)
Blue Waters is currently experiencing a storage server failover on the OSS file system that began at 8:08 AM CT. As a result, all I/O transactions targeting the affected storage server will block until the failover completes on all clients. A single storage server supplies a small fraction of the...
Read More -
Nearline Tape Library System Maintenance
Created by: glasgow 2019-04-29 16:55:33 (Channels: unscheduledoutages|systemnotices)
Nearline maintenance work is complete and the service has returned to full operations as of: 1600hrs, April 29th, 2019 --- Nearline is undergoing emergency service on one tape library. The work is related to fallout from last week's hardware service and is expected to take approximately 10 hours to complete. Some files may...
Read More -
Nearline Tape Library System Maintenance
Created by: glasgow 2019-04-29 09:42:49 (Channels: unscheduledoutages|systemnotices)
Nearline is undergoing emergency service on one tape library. The work is related to fallout from last week's hardware service and is expected to take approximately 10 hours to complete. Some files may not be accessible during that time.
Start time: 0945hrs to ~ 2000hrs, April 29, 2019
-
Nearline Tape Library System Maintenance
Created by: glasgow 2019-04-22 15:04:14 (Channels: scheduledoutages|systemnotices)
One of Nearline's four tape libraries will undergo maintenance to correct power control problems. This library system will be unavailable for four hours while the work is conducted. Files stored in the library will be unavailable for staging during that time. Retrieval jobs from Globus will wait for the files...
Read More -
Blue Waters Return to Service
Created by: squaire3 2019-04-12 15:41:38 (Channels: unscheduledoutages|systemnotices)
The high-speed network issue on Blue Waters was resolved and the system returned to normal operations at 3:34 PM CT. Scheduler has also been resumed.
-
Blue Waters Scheduler Paused - HSN Issue
Created by: squaire3 2019-04-12 15:02:55 (Channels: unscheduledoutages|systemnotices)
Blue Waters is experiencing an issue on the high-speed network that began at 2:10 PM CT. System support staff are evaluating and attempting to restore normal service. Job scheduling is paused until the issue is resolved. The file systems and data transfer services are operating normally. Logins have been occationally hanging...
Read More -
Nearline Endpoint Paused for Storage Maintenance
Created by: glasgow 2019-04-08 23:14:20 (Channels: unscheduledoutages|systemnotices)
The Nearline endpoint has now been returned to normal operations. The Blue Water's Nearline endpoint will be paused beginning at 1700hrs CDT. New and current user actions/requests will be paused and will resume normal activity when the endpoint is released. No user action is necessary. This maintenance...
Read More -
Nearline Endpoint Paused for Storage Maintenance
Created by: glasgow 2019-04-08 16:27:20 (Channels: unscheduledoutages|systemnotices)
The Blue Water's Nearline endpoint will be paused beginning at 1700hrs CDT. New and current user actions/requests will be paused and will resume normal activity when the endpoint is released. No user action is necessary. This maintenance window will be used to conduct resource management operations that have been deferred...
Read More -
Blue Waters Returned to Service
Created by: tbouvet 2019-04-07 15:14:36 (Channels: unscheduledoutages|systemnotices)
Blue WAters Users,
The storage server issue on the scratch file system is resolved. I/O transactions initiated during the outage should have resumed when the Lustre target returned. Blue Waters has resumed normal operations.
-
Blue Waters Scheduler is paused 9:30 AM
Created by: tbouvet 2019-04-07 10:30:00 (Channels: unscheduledoutages|systemnotices)
Blue Waters Users, Blue Waters is currently experiencing a storage server issue for a small portion of the scratch filesystem. As a result, all I/O transactions targeting the affected storage server will block. A single storage server supplies a small fraction of the file system data and all remaining storage servers continue normal...
Read More -
BlueWaters cabinet failure
Created by: mshow 2019-03-23 16:44:36 (Channels: unscheduledoutages|systemnotices)
The cabinet has been restored.
-
BlueWaters repair complete
Created by: mshow 2019-03-23 05:13:22 (Channels: unscheduledoutages|systemnotices)
The cabinet has been restored
-
BlueWaters cabinet failure
Created by: mshow 2019-03-23 03:46:37 (Channels: unscheduledoutages|systemnotices)
A cabinet has shutdown resulting in job loss and an incomplete network configuration. It is unknown at this time when a return to service can be expected for that cabinet. Watch the Blue Waters portal blog for updates.
-
Blue Waters returned to service
Created by: mshow 2019-02-24 03:41:58 (Channels: mandatory|unscheduledoutages|systemnotices)
The high-speed network issue on Blue Waters was resolved and the system returned to normal operations at 3:30 AM CT.
-
Blue Waters High Speed Network issue
Created by: mshow 2019-02-23 23:43:19 (Channels: unscheduledoutages|systemnotices)
Blue Waters is experiencing an issue on the high-speed network that began at 9:48 PM CT. System support staff are evaluating and attempting to restore normal service. Job scheduling is paused until the issue is resolved. The file systems and data transfer services are operating normally. Interim updates will be posted on...
Read More -
Completed: Blue Waters Near-line Storage Maintenance
Created by: jenos 2019-02-07 21:51:57 (Channels: scheduledoutages)
The Blue Waters Near-line storage system hardware and software upgrade process has completed successfully as of 5:30pm Feb 7. Please report any issues to help+bw@ncsa.illinois.edu . Availability details: Subsystem / Services Projected Interruption Duration Compute No interrupt Login / Filesystem No interrupt Scheduler No interrupt Globus online endpoint (ncsa#BlueWaters) No interrupt Globus nearline endpoint (ncsa#Nearline) Feb 7 5am...
Read More -
Blue Waters Notice: System returned to service
Created by: jenos 2019-02-06 15:31:27 (Channels: unscheduledoutages)
Blue Waters Users:
The reboot is complete and the system has returned to service as of 3:14pm CT. Tomorrow's near-line storage maintenance will proceed as planned.
We apologize for any inconvenience.
-
Blue Waters Unplanned Reboot
Created by: bbode 2019-02-06 10:00:16 (Channels: unscheduledoutages)
Blue Waters Users, We experienced an issue that has the high speed network in an unrecoverable state. We have to reboot the system to recover and all running jobs will be lost. The login nodes and endpoints (ncsa#Nearline ncsa#BlueWaters) will remain available during the reboot. The current estimate for return to...
Read More -
Blue Waters Near-line Storage Maintenance Feb 7-8
Created by: jenos 2019-01-25 17:09:55 (Channels: scheduledoutages)
Maintenance for the Blue Waters Nearline storage system has been rescheduled after passing regression tests: it will undergo maintenance starting on Thursday, February 7th, possibly extending through February 8th. The system will be undergoing a significant software and hardware update that is expected to enhance performance and function. All other Blue Waters subsystems,...
Read More -
Blue Waters Notice: System returned to service
Created by: tbouvet 2019-01-12 13:09:15 (Channels: scheduledoutages|unscheduledoutages)
Blue Waters Users:
The reboot is complete and the system has returned to service.
We apologize for any inconvenience.
-
Blue Waters Notice: Unplanned Reboot
Created by: tbouvet 2019-01-12 09:58:18 (Channels: unscheduledoutages|systemnotices)
Blue Waters Users, We experienced an issue that has the high speed network in an unrecoverable state. We have to reboot the system to recover and all running jobs will be lost. The login nodes and endpoints (ncsa#Nearline ncsa#BlueWaters) will remain available during the reboot. The current estimate for return to...
Read More -
Blue Waters Notice: Nearline System (HPSS) return to service
Created by: tbouvet 2019-01-10 15:24:20 (Channels: unscheduledoutages|systemnotices)
Blue Waters Users:
The ncsa#Nearline endpoint is available and has returned to service at 3PM CT. We apologize for any inconvenience.
-Blue Waters
-
Blue Waters Notice: Nearline System (HPSS) remains unavailable.
Created by: tbouvet 2019-01-10 09:38:30 (Channels: unscheduledoutages|systemnotices)
Blue Waters Users:
The ncsa#Nearline endpoint is paused pending repairs suffered to systems during the data center power failure on 1/09/19. We apologize for any inconvenience.
Please check back for an update on the situtation.
-Blue Waters
-
Blue Waters Notice: System has returned to service
Created by: jenos 2019-01-10 09:37:52 (Channels: unscheduledoutages|systemnotices)
Blue Waters Users: The power outage at the NPCF building has been resolved. All Blue Waters services are now available with the exception of the Nearline storage system, which will take a bit longer to recover. The job scheduler has been resumed and login nodes have access re-enabled. There will be...
Read More -
Blue Waters Near-line Storage Maintenance Jan 23-25
Created by: jenos 2019-01-09 16:59:42 (Channels: scheduledoutages)
The Blue Waters Nearline storage system will undergo maintenance starting on Wednesday, January 23th, possibly extending a service outage through January 25th. The system will be undergoing a significant software and hardware update that is expected to enhance performance and function. All other Blue Waters subsystems, as summarized below, will...
Read More -
Blue Waters Notice: Power disruption
Created by: jenos 2019-01-09 16:21:59 (Channels: unscheduledoutages|systemnotices)
Blue Waters Users: Update 4:20pm: All systems up except Nearline. Access remains restricted while performance tests complete. A power outage at the building housing the Blue Waters system has caused a service interruption; all running jobs were consequently terminated. All Blue Waters subsystems have been affected and are currently out of...
Read More -
Blue Waters Notice: System returned to service
Created by: jenos 2018-12-27 23:43:11 (Channels: unscheduledoutages|systemnotices)
Blue Waters Users:
The power outage at the Blue Waters building has been resolved and the compute nodes have been restarted. The Blue Waters system has been returned to normal operations at 11:40 PM CT. We apologize for any inconvenience.
-
Blue Waters Return to Service
Created by: tbouvet 2018-11-14 18:35:11 (Channels: unscheduledoutages)
The meta data server failover on the Home file system completed at 6:23 PM CT. I/O transactions initiated during the failover should have resumed normal operation when failover completed. Blue Waters has resumed normal operations.
-
Blue Waters File System Issue
Created by: tbouvet 2018-11-14 17:45:36 (Channels: unscheduledoutages)
Blue Waters is currently experiencing a meta data server failover on the home file system that began at 5:30 PM CT. As a result, all I/O transactions for the home filesystem will block until the failover completes on all clients. The rest of Blue Waters including the other filesystems are operating...
Read More -
Blue Waters Return to Service
Created by: tbouvet 2018-11-14 11:32:42 (Channels: unscheduledoutages)
The meta data server failover on the Home file system completed at 11:26 AM CT. The file system issue start at 10:45 AM. I/O transactions initiated during the failover should have resumed normal operation when failover completed. Blue Waters has resumed normal operations.
-
Blue Waters File System Issue
Created by: tbouvet 2018-11-14 11:19:47 (Channels: unscheduledoutages)
Blue Waters is currently experiencing a meta data server failover on the home file system that began at 11:15 AM CT. As a result, all I/O transactions for the home filesystem will block until the failover completes on all clients. The rest of Blue Waters including the other filesystems are operating...
Read More -
BlueWaters returned to service
Created by: mshow 2018-11-05 06:18:32 (Channels: unscheduledoutages)
The system has been returned to full service operation after a brief unscheduled outage.
-
BlueWaters system issue
Created by: mshow 2018-11-05 01:21:18 (Channels: unscheduledoutages|general)
The system has experienced a fault that will require a full system shutdown/reboot. This will take several hours before the system is returned to full operations.
-
Blue Waters Scheduled Maintenance - Template
Created by: jenos 2018-10-17 13:55:56 (Channels: scheduledoutages)
Blue Waters will undergo urgent maintenance on (DATE:Monday, October 15th), from (TIME: 8am until 7pm) in order to apply a security patch, as well as some other accumulated patches that have been awaiting a system interruption. The compute system will be unavailable for the entire duration; the login nodes may return earlier. Check the...
Read More -
Blue Waters Scheduled Maintenance Complete - Returned to Service
Created by: jenos 2018-10-15 21:15:30 (Channels: scheduledoutages)
We are pleased to inform you that Blue Waters has been returned to service as of 8:50pm on Oct 15th. Job queue backlog has been significantly lower than usual over the past week, resulting in reduced wait time for jobs to run in most cases. We recommend taking advantage of such...
Read More -
Blue Waters Scheduled Maintenance - Delayed Return to 9pm
Created by: jenos 2018-10-15 20:29:46 (Channels: scheduledoutages)
< Delayed return to service from 7pm to 9pm > Blue Waters will undergo urgent maintenance on Monday, October 15th, from 8am until 9pm in order to apply a security patch, as well as some other accumulated patches that have been awaiting a system interruption. The compute system will be unavailable for the entire...
Read More -
Blue Waters Scheduled Maintenance - Monday, October 15th
Created by: jenos 2018-10-14 08:40:08 (Channels: scheduledoutages)
Blue Waters will undergo urgent maintenance on Monday, October 15th, from 8am until 7pm in order to apply a security patch, as well as some other accumulated patches that have been awaiting a system interruption. The compute system will be unavailable for the entire duration; the login nodes may return earlier. Check the portal blog...
Read More -
Blue Waters Scheduled Maintenance: Return to Service
Created by: gbauer 2018-07-16 19:43:38 (Channels: scheduledoutages|systemnotices|softwareupdate)
Blue Waters Users, Blue Waters has been returned to service after a security update and programming environment update. CUDA 9.1 is now available as the default cudatoolkit but with gcc/4.9.3 remaining as the default GNU compiler. The default programming environment otherwise did not change to allow for an extended transition to...
Read More -
Blue Waters Scheduled Maintenance Update
Created by: jenos 2018-07-13 11:57:54 (Channels: scheduledoutages|systemnotices|softwareupdate)
Blue Waters Users, Please note the following scheduled maintenance update: The scheduled maintenance for CUDA 9.1 deployment on Blue Waters has been expanded to a near full system service outage that will span 13 hours on Monday July 16th, from 9am to 10pm. This adjustment will eliminate the need for an additional...
Read More -
Blue Waters XK (GPU) resource and HPSS Nearline resources will be unavailable from 9am to 12pm on Monday July 16th
Created by: kingda 2018-07-09 09:08:10 (Channels: scheduledoutages|systemnotices|softwareupdate)
Blue Waters Users, Blue Waters XK compute resource and HPSS ncsa#Nearline resource will be undergoing maintenance Monday (July 16th) morning beginning at 9 AM. The XK compute nodes are anticipated to be unavailable for up to 3 hours. The login nodes and XE compute resource will remain in service. A new Programming Environment will be installed...
Read More -
Blue Waters Partial Scratch Outage
Created by: tbouvet 2018-06-23 21:31:02 (Channels: unscheduledoutages|systemnotices)
Blue Waters experienced a newtork switch failure that resulted in a partial outage of the scratch filesystem (ost168-179) from 7:44 PM CT to 7:59 PM CT. Jobs that ended during this time may have been impacted. I/O transactions targeting the affected storage server should block until the ost targets returned...
Read More -
Blue Waters returned to full service
Created by: mshow 2018-06-12 10:23:37 (Channels: mandatory|unscheduledoutages|systemnotices)
Blue Waters has returned to full service after recovery from a power event.
-
Blue Waters system power interruption
Created by: mshow 2018-06-12 04:56:14 (Channels: mandatory|unscheduledoutages|systemnotices)
Thunderstorms have resulted in a power interruption of the BlueWaters System. This outage imacts both the compute nodes and all filesystems. Therefore, a full reboot will be necessary.Return to service is estimated to be approximately 10 am Centeral time.
BW Admin
-
Blue Waters has Returned to Service
Created by: tbouvet 2018-06-07 14:27:12 (Channels: unscheduledoutages|systemnotices)
Blue Waters has returned to full service at 2:14 PM CT. The issue encountered required a full system reboot to resolve. All running jobs were lost so please resubmit your jobs from latest checkpoint file if your job exited prematurely.
-
Blue Waters System Issue
Created by: tbouvet 2018-06-07 10:12:41 (Channels: unscheduledoutages)
Blue Waters is experiencing a full system issue that began at 6:30 AM CT. System support staff are evaluating and attempting to restore normal service but may require a full system reboot. Job scheduling is paused until the issue is resolved. Interim updates will be posted on the Blue Waters...
Read More -
Blue Waters has Returned to Service
Created by: tbouvet 2018-02-27 02:54:45 (Channels: scheduledoutages)
The maintenance was a success and Blue Waters was returned to service February 27 at 1:35AM, with ncsa#Nearline endpoint (HPSS) returned earlier February 26 at 10PM.
-
Reminder: Blue Waters Maintenance Monday, February 26, 2018, 6AM for 24hrs
Created by: tbouvet 2018-02-25 21:21:26 (Channels: scheduledoutages)
Reminder of upcoming maintenance: Blue Waters will be unavailable during scheduled maintenance Monday (February 26th) beginning at 6 AM for a duration of 24hrs. All Blue Waters resources will be unavailable including the Globus Online endpoints and login nodes. Interim updates will be posted on the Blue Waters Message of the...
Read More -
UPDATE: Blue Waters Maintenance Outage Monday, February 26, 2018, 6AM for 24hrs
Created by: tbouvet 2018-02-18 14:45:11 (Channels: scheduledoutages)
Corrected month in body of email below. Outage will be February 26, 2018. Blue Waters will be unavailable during scheduled maintenance Monday (February 26th) beginning at 6 AM for a duration of 24hrs. All Blue Waters resources will be unavailable including the Globus Online endpoints and login nodes. Interim updates will...
Read More -
Re: Blue Waters Compute Maintenance Outage Monday, January 22, 2017, between 7:00am and 3:00pm CST
Created by: hwleong 2018-01-22 13:28:29 (Channels: scheduledoutages)
The compute nodes maintenance was a success and Blue Waters has returned to full service at 1:05PM today. We apologize for any inconvenience this may have caused.
Please kindly report any issue to help+bw@ncsa.illinois.edu.
-
Blue Waters Compute Maintenance Outage Monday, January 22, 2017, between 7:00am and 3:00pm CST
Created by: kingda 2018-01-19 15:42:06 (Channels: scheduledoutages)
Blue Waters Users, Blue Waters compute hardware will be undergoing maintenance Monday (Jan 22th) morning beginning at 7 AM. The compute nodes are anticipated to be unavailable for up to 8 hours. The login nodes and data transfer nodes will remain in service. Interim updates will be posted on the Blue Waters Message of...
Read More -
Re: Blue Waters Nearline Maintenance and Globus-wide service outage Saturday, December 9, 2017, between 10:00am and 4:00pm CST
Created by: gbauer 2017-12-09 21:07:24 (Channels: scheduledoutages)
Globus service resumed at 2PM Central time today. The Blue Waters endpoints ncsa#BlueWaters and ncsa#Nearline are again accessible via Globus Online. Transfers initiated prior to the service outage have resumed. Blue Waters compute and login services were unaffected by the Globus outage.
Please report any issues to help+bw@ncsa.illinois.edu.
-
Blue Waters Nearline Maintenance and Globus-wide service outage Saturday, December 9, 2017, between 10:00am and 4:00pm CST
Created by: gbauer 2017-12-04 15:34:21 (Channels: scheduledoutages)
Blue Waters Users, We will take advantage of the scheduled Globus outage (Saturday, December 9th from 10am to 2pm) to perform maintenance on the Blue...
Read More