Postmortem September 17th outage and rollback

Buoyant Armiger · Sep 17, 2023

Spooky. Thank you for doing the needful.

Alaric the Visigoth · Sep 17, 2023

@Null If you're at liberty to say which raid where you using for your server?

Onni Kalsarikännit · Sep 17, 2023

Thanks for sacrificing your Sunday so we can say nigger and faggot, Null. You are my favorite niggerfaggot

It hasn't been adressed in this thread so far but tor is very broken.
See here (.st link)

Edit: posting and quoting works fine on tor again. all stickers also work again. Thx, Jersh!

Pill Cosby · Sep 17, 2023

Here's me right now thinking Liz-Fong-Jones who is a self admitted rapist; who also admitted to scum journalists being part of a collective who have committed to crimes to prevent a lawful US business from operating using tactics which are (I believe) a federal crime.

Pedophobe · Sep 17, 2023

The Hero of Kvatch said:
Kiwi Farms going down is like 9/11 for autistic people.

All those shitposts lost forever... like sneeds in the rain...

Nisse Hult · Sep 17, 2023

Nobody is better at taking down Kiwi Farms than Kiwi Farms.

Dollar Store Sentai · Sep 17, 2023

Real Fakeman said:
If four SSDs fail at the exact same time I would suspect that it's a problem with the firmware, controller, backplane or software.

I had two HDD's die within days of each other because they were the same batch, bought and installed at the same time. It is more common than you think.

Null · Sep 17, 2023

chat up

Nottafed · Sep 17, 2023

Assuming its a pci-e x8 nvme controller, that likely shit the bed, which would cause all 4 to show dead simultaneously. Sucks because it's not something people just have laying around.

Kendall Motor Oil · Sep 17, 2023

Now someone needs to buy him Optane sticks when they flood the used market.

keytar · Sep 17, 2023

I CAN sneed

Moths · Sep 17, 2023

Throw a coin to your slobbermutt
Great to see that your hardware always seems to have a lifetime of a few years

Don Yagon · Sep 17, 2023

Can't post normally since the post form doesn't load because of borked CSP. Had to use an addon that temporarily disables CSP just to post.

Upd: seems to be fixed now.

Patrick Bait-man · Sep 17, 2023

9/17
Never forget the day the quadruple drives failed and the posts that died.

potatoman · Sep 17, 2023

All of the drives at once is bad luck. Glad Null wasn’t so retarded he didn’t do backups.

Lake · Sep 17, 2023

Ayyyy the quick fix was an unexpected surprise. Thanks, Jersh.

WelperHelper99 · Sep 17, 2023

Nisse Hult said:
Nobody is better at taking down Kiwi Farms than Kiwi Farms.

The only thing that can kill the farms is itself

IT Dude · Sep 17, 2023

Made an account to clarify things. I'm the dude that helps Null with this stuff when he needs it. No, I don't monitor the server cause it's Null's. But maybe I'll setup some stuff for Null to monitor stuff including drive health.

The storage on the server use ZFS pools. The SATA SSD array (SNEED Pool) bypasses the RAID Controller and is entirely JBOD passthrough.
The 4 x NVMe drives are U.2 drives in the front and are in FEED ZFS Pool. The backplane handles SATA, SAS, and U.2. U.2 has it's own area for those drives and connects to the motherboard with a OCuLink cable to a JNVMe header.
The 4 x 1.6TB WD Ultrastar DC SN620 NVMe U.2 drives disappeared from the server last night. But before that, the kernel reported write errors to one of them.

There are a few reasons this could have happened, from most likely to less likely:
- BIOS/UEFI Firmware stopped communicating with the NVMe drives. This happened with a certain BIOS setting when it was initially setup
- The drives actually died from the workload. Unlikely considering these can handle 1.7 Drive Writes per day. But very feasible. These are 2nd hand enterprise drives
- The backplane/JNVMe headers exploded. Super unlikely

The drives are likely still alive, and the server's firmware probably took a shit.
We need to inspect the server's BIOS settings or possibly even update the firmware. Then we can determine if the drives are toast or useless.
There was no foul play at hand here. At best a firmware bug, at worse, the drives sudoku'd.

Toolbox · Sep 17, 2023

Don Yagon said:
Can't post normally since the post form doesn't load because of borked CSP. Had to use an addon that temporarily disables CSP just to post.
View attachment 5343500
View attachment 5343503
View attachment 5343501

That brokeness appears to be affecting the onion site and only the onion site. Tried to login multiple times there before hopping on .st, everything's good on the clear web version.

Quandaries Sage · Sep 17, 2023

Ty dear feeder for your hard work
We appreciate it

Postmortem September 17th outage and rollback

Buoyant Armiger

N'wahs in Bravil

Alaric the Visigoth

The Mumbler

Onni Kalsarikännit

VIOLENCE IS GOLDEN

Pill Cosby

my pronouns are fag/faggot

Pedophobe

Anime avatar

Nisse Hult

Vem fan är Nisse Hult?

Dollar Store Sentai

Null

Ooperator

Nottafed

Justice has a price. That price is freedom.

Kendall Motor Oil

keytar

Moths

Buzz Buzz

Don Yagon

Indoor Garloid Farmer

Patrick Bait-man

The Perfect Bait-er

potatoman

More Retarded Than The Average Bear

Lake

Now with 30% less dopamine!

WelperHelper99

Unlimited Sneed Works

IT Dude

Special Helper

Toolbox

Trusted the PlQn

Quandaries Sage

Lol, lmao even