Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

macos11.0 release hosts are both offline #3179

Closed
targos opened this issue Feb 5, 2023 · 15 comments
Closed

macos11.0 release hosts are both offline #3179

targos opened this issue Feb 5, 2023 · 15 comments

Comments

@targos
Copy link
Member

targos commented Feb 5, 2023

release-nearform-macos11.0-arm64-1 is offline and its IP address is unreachable
release-macstadium-macos11.0-arm64-1 is offline and refuses ssh connections

@richardlau
Copy link
Member

I've sent an email to our NearForm contacts regarding the NearForm machine. (FYI @efrisby).

@mhdawson
Copy link
Member

mhdawson commented Feb 6, 2023

@UlisesGascon is

release-macstadium-macos11.0-arm64-1 is offline and #3084

Something you can help with?

@UlisesGascon
Copy link
Member

I can help and see if I can restore the machine, but I will need some support to re-ansible as we did for release-orka-10.15-x64-1 in the past

@targos
Copy link
Member Author

targos commented Feb 7, 2023

Before re-ansibling, maybe there is a secondary way to access the machine from Macstadium's infra/network?

@UlisesGascon
Copy link
Member

I will check the machine and share the status before I do a destructive or non-reversible action 👍

@UlisesGascon
Copy link
Member

Here is an update on what I have tried. Thanks to @targos for the great support:

  • I reviewed the internal documents about MacStadium but did not find a new way to access the machines.
  • The MacStadium web UI does not offer any option to access the machine, but the stats confirmed that they are connected and running.
  • I hard rebooted all the bare metal machines (including the test ones, macstadium macos11 immediate closed connection  #3084), but even after the machines were back online, none of them had improved connectivity.
  • I tried to VNC connect with them following an official tutorial (from 10 years ago) but had no luck, possibly because the username and password were not the same as in the internal documentation or the VNC Server is disabled (as expected).
  • I also reviewed the Bare Metal official documentation but had no luck.
  • I attempted to communicate to the machines from the Orka VPN but was unsuccessful.

I see two options: we can raise a ticket and ask for support, pointing to this issue, or we can add one or several additional release VMs in the Orka platform using the backups I made. We would only need to register the new machines in the inventory and re-Ansible them.

@richardlau
Copy link
Member

I see two options: we can raise a ticket and ask for support

Am I misremembering or did @AshCripps try that before (or perhaps it was a different VM)?

@UlisesGascon
Copy link
Member

In the meantime I will add a new macos11 in Orka for releases

UlisesGascon added a commit to UlisesGascon/build that referenced this issue Feb 7, 2023
- base image: `macos11-x64-1_11012023`
- Ref: nodejs#3179 (comment)
@AshCripps
Copy link
Member

I see two options: we can raise a ticket and ask for support

Am I misremembering or did @AshCripps try that before (or perhaps it was a different VM)?

Yes it was a very similar issue and took some effort to get them to acknowledge it - should still be in the system somewhere of what I asked for.

@richardlau
Copy link
Member

In the meantime I will add a new macos11 in Orka for releases

Ah, forgot that these are arm64 machines/VMs and therefore not in Orka.

@UlisesGascon
Copy link
Member

UlisesGascon commented Feb 7, 2023

🎉 Good News! I created a new bare metal instance, and the machine is up and running.

So... here is the plan:

So... here is the NEW plan

  • I will ask support to restore the credentials for the arm machines (ticket)
  • Check that SSH is working in all the machines with the new credentials
  • Add build SSH key to test-macstadium-macos11.0-arm64-3
  • Re-ansible test-macstadium-macos11.0-arm64-3
  • Re-ansible test-macstadium-macos11.0-arm64-4
  • I will ask release team to reansible the release one (release-macstadium-macos11.0-arm64-1)

UlisesGascon added a commit to UlisesGascon/build that referenced this issue Feb 7, 2023
Signed-off-by: Ulises Gascon <UlisesGascon@users.noreply.github.com>

---------

- Ref: nodejs#3179 (comment)

- There is a new machine with IP `207.254.55.235` that replace the IP `207.254.38.74`, but the machine keeps the same name `release-macstadium-macos11.0-arm64-1`
- The new machine has a different hardware `AS/M1/8C/16G/1T/SSD/10G` with product reference `Mac mini G5K`. The old machine has the hardware `AS/M1/8C/8G/256G/SSD/1G` with product reference `Mac mini G5A`
UlisesGascon added a commit to UlisesGascon/build that referenced this issue Feb 8, 2023
Signed-off-by: Ulises Gascon <UlisesGascon@users.noreply.github.com>

---------

- Ref: nodejs#3179 (comment)

- There is a new machine with IP `207.254.55.248` that replace the IP `207.254.38.89`, but the machine keeps the same name `test-macstadium-macos11.0-arm64-4`
- The new machine has a different hardware `AS/M1/8C/16G/256G/SSD/10G` with product reference `Mac mini G5J`. The old machine has the hardware `AS/M1/8C/8G/256G/SSD/1G` with product reference `Mac mini G5A`
@UlisesGascon UlisesGascon self-assigned this Feb 8, 2023
UlisesGascon added a commit to UlisesGascon/build that referenced this issue Feb 8, 2023
Signed-off-by: Ulises Gascon <UlisesGascon@users.noreply.github.com>

---------

- Ref: nodejs#3179 (comment)

- There is a new machine with IP `207.254.55.247` that replace the IP `207.254.38.86`, but the machine keeps the same name `test-macstadium-macos11.0-arm64-3`
- The new machine has a different hardware `AS/M1/8C/16G/256G/SSD/10G` with product reference `Mac mini G5J`. The old machine has the hardware `AS/M1/8C/8G/256G/SSD/1G` with product reference `Mac mini G5A`
@richardlau
Copy link
Member

FWIW the Nearform machine is now back online. (Thank you @efrisby.)

@richardlau
Copy link
Member

I've reansibled the release and two test macos11 arm64 macstadium machines.
Currently running a release rebuild on today's nightly (osx11-tar happened to fail) with just osx11-tar and osx11-pkg: https://ci-release.nodejs.org/job/iojs+release/9138/

@richardlau
Copy link
Member

Currently running a release rebuild on today's nightly (osx11-tar happened to fail) with just osx11-tar and osx11-pkg: https://ci-release.nodejs.org/job/iojs+release/9138/

Build passed. Release machine looks good.

@UlisesGascon
Copy link
Member

Thanks for the conformation @richardlau. I will close the issue then 😉

targos pushed a commit that referenced this issue Apr 12, 2023
- base image: `macos11-x64-1_11012023`
- Ref: #3179 (comment)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

5 participants