Hello openSUSE forums!
I will apologize in advance for this being so long-winded but I want to be as thorough as possible for the sake of clarity & being guided to the correct place.
Hardware overview:
Ryzen 7 1700
32GB Corsair DDR4
EVGA GTX 980Ti
2x Corsair 500GB SATA SSD (Linux on one, the other as a data drive)
1x 1TB Crucial nVME (Windows 10 for games)
EDUP AX3000 PCIe WiFi/Bluetooh (Intel Wi-Fi 6E AX210 chipset)
For the majority of 2020 I distro hopped & tried to find a distro that worked for me. But I constantly ran into an ungainly problem: my computer would lock up, be completely unresponsive, and I would be forced to reboot. Would happen 3-5 times a week. My system is dual-booted between Linux & Windows 10 (separate physical drives) and would never happen on Windows; never had power issues of any kind. After hopping between various Pop!OS, Ubuntu, and Debian I began to think it was something with Debian-based distros. So I switched to Fedora at the advice of a friend (and Fedora advocate) only to find the issue was far worse (6-10 times/week). After swapping everything except the PSU, motherboard, and CPU, I began to suspect maybe it wasn’t hardware & decided to try openSUSE (I learned Linux using SLE in college, seemed like a good idea). My system was far, far more stable. Since making the switch in September it’s only locked up twice that I recall until this past week. One time it did display a curious error, as instead of locking up it went into an unprompted shutdown. It dropped X11 entirely, had a terminal-like view, and showed “Central Processor Hardware Error…” very briefly (with some other text I wasn’t able to read) before turning off. Turning it back on seemed like everything was fine again & I went on with my life. That was a couple weeks ago, no issues since.
And this is where my unwise decision making comes in.
I was attempting to run updates via zypper, but was getting a lot of errors. In doing some research I found a post on Stack Exchange that suggested clearing the /tmp and /var/tmp directories. I ran ls -l to see what was in there & sure enough lots of various things that look pretty safe to delete. So I ran **rm -rf *** … except I was in my home directory (/home/jim not /home), I hadn’t cd’d out to /var/tmp - so I blew away my home folder.
In getting some advice from my Discord Linux group I found Test Disk, installed it, and attempted to recover my deleted directories to a .dd file on my primary openSUSE drive. Well, my system lock-up issue reappeared during that time. While copying the image.dd file to my data drive the system locked up. I thought that maybe it was just the GUI locked up & I could just let it sit for a few hours to let Test Disk finish. Eventually, didn’t really have a choice except to reboot the thing and when I did openSUSE predictably couldn’t load the GUI, it dropped me to a prompt for the root password & showed the root terminal prompt. I can also still boot into Windows 10 fine, so GRUB seems to be intact.
If I can fix the openSUSE boot somehow that would be nice, but not critical.
What I am concerned about is that my data drive, which I had formatted to exFAT so I could read it everywhere without a 4GB file limit, is now unreadable on anything. In fact, it shows up as a BTRFS drive. I tried running a couple BTRFS and exFAT utils to see if I could fsck it or anything but no good. exFAT utils doesn’t recognize it as an exFAT drive and BTRFS utils states it can’t find a BTRFS partition. I also tried fooling around with it in Windows 10 and macOS, and on Manjaro via my Pinebook Pro & a USB adapter: no luck anywhere.
TLDR: I broke my openSUSE install & corrupted my data drive. Would anyone be willing to provide some advice, or even point me to another post here on the forums, about how to tackle this? I would’ve looked already but I feel my situation is somewhat unique in that the exFAT drive now thinks it’s BTRFS. Also since it’s a SATA SSD I’ve no idea if I really can do anything at all.
And yes, I’ve already contact AMD about seeing if I can have my CPU warrantied (I think in my 10+ years of IT work I’ve only ever seen two legitimate CPU failures, so I know it’s terribly uncommon).
Thank you for your time in reading this, and on any advice you can offer!