From lance at osuosl.org Mon Jul 26 22:06:18 2021 From: lance at osuosl.org (Lance Albertson) Date: Mon, 26 Jul 2021 15:06:18 -0700 Subject: [osuosl-openpower] Backend storage issues Message-ID: All, We encountered a momentary issue with the backend storage while rebooting some of the nodes. This has caused several VMs to be stuck in an unstable state. We're going through all of the affected VMs and forcing a reboot on them to resolve the issue. Apologies for outages this may have caused. Thanks! -- Lance Albertson Director Oregon State University | Open Source Lab -------------- next part -------------- An HTML attachment was scrubbed... URL: From lance at osuosl.org Tue Jul 27 00:28:48 2021 From: lance at osuosl.org (Lance Albertson) Date: Mon, 26 Jul 2021 17:28:48 -0700 Subject: [osuosl-openpower] Backend storage issues In-Reply-To: References: Message-ID: It looks as though all of the VMs were affected by this issue and will require a hard reset. Since I need to reboot the hypervisors anyway, I'm going to go ahead and do that on each node and let it shutdown the VMs instead of doing a live migrations. This will impact every VM on the cluster. I will try and spot check the VMs to ensure they're back online after the reboot. Again, sorry for the issues this is causing. Thanks- On Mon, Jul 26, 2021 at 3:06 PM Lance Albertson wrote: > All, > > We encountered a momentary issue with the backend storage while rebooting > some of the nodes. This has caused several VMs to be stuck in an unstable > state. We're going through all of the affected VMs and forcing a reboot on > them to resolve the issue. > > Apologies for outages this may have caused. > > Thanks! > > -- > Lance Albertson > Director > Oregon State University | Open Source Lab > -- Lance Albertson Director Oregon State University | Open Source Lab -------------- next part -------------- An HTML attachment was scrubbed... URL: From lance at osuosl.org Tue Jul 27 03:13:06 2021 From: lance at osuosl.org (Lance Albertson) Date: Mon, 26 Jul 2021 20:13:06 -0700 Subject: [osuosl-openpower] Backend storage issues In-Reply-To: References: Message-ID: I just completed the reboots. All the systems should be back online. Please let me know if you have any issue with your VM(s). Thanks! On Mon, Jul 26, 2021 at 5:28 PM Lance Albertson wrote: > It looks as though all of the VMs were affected by this issue and will > require a hard reset. Since I need to reboot the hypervisors anyway, I'm > going to go ahead and do that on each node and let it shutdown the VMs > instead of doing a live migrations. This will impact every VM on the > cluster. I will try and spot check the VMs to ensure they're back online > after the reboot. > > Again, sorry for the issues this is causing. > > Thanks- > > On Mon, Jul 26, 2021 at 3:06 PM Lance Albertson wrote: > >> All, >> >> We encountered a momentary issue with the backend storage while rebooting >> some of the nodes. This has caused several VMs to be stuck in an unstable >> state. We're going through all of the affected VMs and forcing a reboot on >> them to resolve the issue. >> >> Apologies for outages this may have caused. >> >> Thanks! >> >> -- >> Lance Albertson >> Director >> Oregon State University | Open Source Lab >> > > > -- > Lance Albertson > Director > Oregon State University | Open Source Lab > -- Lance Albertson Director Oregon State University | Open Source Lab -------------- next part -------------- An HTML attachment was scrubbed... URL: