14
submitted 1 month ago* (last edited 1 month ago) by Passerby6497@lemmy.world to c/selfhosted@lemmy.world

So I had a micro PC that was running one of my core services and it only supports NVMe drives. Unfortunately, this little guy cooked itself and I'm not in a position to replace the drive. The system is still good and is fairly powerful, so I want to be able to reuse it.

I'm thinking I want to set up some kind of netboot appliance on another server to be able to allow me to boot the system without ever having a local disk. One thing I want to is run some docker images (specifically Frigate) but i wont be able to write anything to persistent storage locally. NFS shares are common in my setup.

Is it even possible to make a 'gold image' of a docker host and have it netboot? I expect that memory limitations (16GB) will be my main issue, but I'm just trying to think of how to bring this system back into use. I have two NAS appliances that I can use for backend long term storage (where I keep my docker files and non-database files anyway), so it shouldn't be too difficult to have some kind of easily editable storage solution. I don't want to use USB drives as persistent storage due to lifespan concerns from using them in production environments.

top 25 comments
sorted by: hot top controversial new old
[-] mosiacmango@lemm.ee 6 points 1 month ago* (last edited 1 month ago)

Id be pretty wary of using any system that "cooked" an nvme. That not the sign of an actual healthy system.

Was the failure just heat damage?

[-] Passerby6497@lemmy.world 3 points 1 month ago

I'm actually not 100% what killed the drive. It could have been an issue with the drive wearing out, but my services didn't write much locally and it wasn't super old so I assume its a heat issue with a fanless micro system. I try to write everything important to my NASs so I don't have to worry about random hardware failures, but this one didn't have backups configured before it failed. Other than the drive issue its been solid for 1.5-2 years of near constant uptime.

[-] Jimmycakes@lemmy.world 2 points 1 month ago

Unless you are writing petabytes the nvme did not just burn "wear" out. Probably shouldn't do anything until you figured out what caused this failure

[-] MangoPenguin@lemmy.blahaj.zone 1 points 1 month ago

Consumer SSDs generally only have a 200-600TBW rating, not petabytes. Its pretty easy to wear one out in a few years installed in a server.

[-] loganb@lemmy.world 1 points 1 month ago

Is the drive totally dead? Curious what SMART would report.

My gut feeling is that it's probably cheaper to buy a replacement m.2 than the hours of time to get netboot working but it could be a fun project!

[-] possiblylinux127@lemmy.zip 5 points 1 month ago

I wouldn't do this. If you are spending the time to do netboot you might as well get a proper boot drive.

[-] scrubbles@poptalk.scrubbles.tech 3 points 1 month ago

Kind of, but probably not. I started writing this and was like "totally it could be stateless". Docker runs stateless, and I believe when it starts it is still stateless (or at least could be mounted on a ramdrive) - but then I started thinking, and what about the images? Have to be downloaded and ran somewhere, and that's going to eat ram quickly. So I amend to you don't need it to be stateful, you could have an image like you talked about that is loaded every time (that's essentially what kubernetes does), but you will still need space somewhere as scratch drive. A place docker will places images and temporary file systems while it's running.

For state, check out docker's volume backings here: https://docs.docker.com/engine/storage/volumes/. You could use nfs to another server as an example for your volumes. Your volumes would never need to be on your "app server", but instead could be loaded via nfs from your storage server.

This is all nearing into kubernetes territory though. If you're thinking about netboot and automatically starting containers, and handling stateless volumes and storing volumes in a way that are synced with a storage server.... it might be time for kubernetes.

[-] umami_wasbi@lemmy.ml 1 points 1 month ago

I guess you can also use NFS/iSCSI for images too?

[-] DarkDarkHouse@lemmy.sdf.org 1 points 1 month ago

Correct, I run docker on a compute host that has no local storage. The host’s disks are on iSCSI LUNs.

[-] Passerby6497@lemmy.world 1 points 1 month ago

That's really good to know. Do you ever have issues writing database files on those disks? Database files on nfs mounts have been the bane of my existence.

[-] DarkDarkHouse@lemmy.sdf.org 2 points 1 month ago

FWIW I run only very small databases e.g., sqlite ones shipped with applications, but haven’t had any problems in about a year now, and nothing that wasn’t recoverable from backup.

[-] Passerby6497@lemmy.world 0 points 1 month ago

So I amend to you don’t need it to be stateful, you could have an image like you talked about that is loaded every time (that’s essentially what kubernetes does), but you will still need space somewhere as scratch drive. A place docker will places images and temporary file systems while it’s running.

Putting the image somewhere is easy. I've got TBs of space available on my NAS drives, especially right now with not acquiring any additional linux ISOs.

For state, check out docker’s volume backings here: https://docs.docker.com/engine/storage/volumes/. You could use nfs to another server as an example for your volumes. Your volumes would never need to be on your “app server”, but instead could be loaded via nfs from your storage server.

I'll check that out. If that allows me to actually write databases to disk on the nfs backing volume, that would be amazing. That's the biggest issue I run into (regularly).

This is all nearing into kubernetes territory though. If you’re thinking about netboot and automatically starting containers, and handling stateless volumes and storing volumes in a way that are synced with a storage server… it might be time for kubernetes.

I don't think I've ever looked into kubernetes. I'll have to look into that at some point... Any good beginner resources?

[-] scrubbles@poptalk.scrubbles.tech 1 points 1 month ago

Personally I suggest k3s, setting up a test cluster and playing with it. For volume management I use longhorn. It's a HUGE learning curve, but it's officially something companies will shell out big money for too if you're willing to learn it. Soup to nuts from setting up test cluster and playing with it all the way to all of my services running was about 4 months of tinkering for me- but I'll never go back

[-] OneCardboardBox@lemmy.sdf.org 3 points 1 month ago

You can host docker volumes over NFS, but the actual container images need to exist on a filesystem that supports overlay (which NFS does not) unless you want things to be slow as shit. And I really do mean miserably slow. A container image shared over NFS will take forever to spin up because it has to duplicate the entire container filesystem instead of using overlays, and then it'll blow up your disk usage by copying all these files around instead of overlaying them. It's truly unusable.

[-] mlaga97@lemmy.mlaga97.space 2 points 1 month ago

I've done something extremely similar with a custom NixOS iso for my docker VMs to make versioning and backups easier (golden image live disk with SSH+Docker+Dockge shared between all VMs + local persistent storage specific to each VM).

You can configure frigate via OCI container with custom config, as well as NFS mounts, SSH server, etc and then have a read-only live disk that boots up, mounts NFS share, and then starts up frigate.

[-] Passerby6497@lemmy.world 1 points 1 month ago

Do you have any info on the custom setup? Sounds like a fun project/learning experience.

And do you mean OCI like oracle cloud?

[-] mlaga97@lemmy.mlaga97.space 1 points 1 month ago

https://git.mlaga97.space/mlaga97/persistent-live-docker-flake has a builder for a live disk that will mount /dev/sda as ext4 to /persistent, and then start up dockge and whatever containers are present from the previous boot automagically.

OCI as in Open Container Initiative.

[-] thumdinger@lemmy.world 2 points 1 month ago* (last edited 1 month ago)

You mention frigate specifically. Were you running this on the system when the drive failed, or is this a future endeavour?

I bring this up because I also use frigate, and for some time I was running with a misconfigured docker compose that drove my SSD wearout to 40% in a matter of months.

Make sure that the tmpfs is configured per the frigate documentation and example config. If misconfigured like mine was, all of that IO is on disk. I believe the ramdisk is used for temp storage of camera streams, until an event occurs and the corresponding clip is committed to disk.

Good luck!

[-] Passerby6497@lemmy.world 1 points 1 month ago

Interesting, it was running on this system, so it may actually have been wear that killed the drive. I'll have to look into that config and see if it's worth getting a new nvme to throw into the cook box.

Thanks for that info!

[-] sugar_in_your_tea@sh.itjust.works 1 points 1 month ago

It's possible, but I recommend more RAM because you won't have any swap.

But adding a network boot is a hassle, I recommend buying another drive, it'll be cheaper than RAM esp since you don't need much storage.

[-] fruitycoder@sh.itjust.works 1 points 1 month ago

I do it with k3s right now on fedora. I like it personally.

Nice thing if you use k8s settings up persistent net storage with something like longhorn is an option too.

[-] Passerby6497@lemmy.world 3 points 1 month ago

Do you have any resources on that kind of setup? I appreciate that constructive advice!

[-] just_another_person@lemmy.world 0 points 1 month ago

Run a livecd of whatever Linux distro you like, and get a USB drive. Store persistent files on the USB stick.

[-] Passerby6497@lemmy.world 0 points 1 month ago

I don't want to use a USB for storage, because those aren't going to have a great lifespan in my experience. I've used them as the install media for something like ESX, but I'd rather not run a 'real' OS from a disk because I wasn't impressed with overall lifespan on some of the systems we managed at work.

[-] possiblylinux127@lemmy.zip 1 points 1 month ago* (last edited 1 month ago)

Then get a proper drive. Maybe a USB adapter?

this post was submitted on 05 Jan 2025
14 points (100.0% liked)

Selfhosted

41868 readers
354 users here now

A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don't control.

Rules:

  1. Be civil: we're here to support and learn from one another. Insults won't be tolerated. Flame wars are frowned upon.

  2. No spam posting.

  3. Posts have to be centered around self-hosting. There are other communities for discussing hardware or home computing. If it's not obvious why your post topic revolves around selfhosting, please include details to make it clear.

  4. Don't duplicate the full text of your blog or github here. Just post the link for folks to click.

  5. Submission headline should match the article title (don’t cherry-pick information from the title to fit your agenda).

  6. No trolling.

Resources:

Any issues on the community? Report it using the report flag.

Questions? DM the mods!

founded 2 years ago
MODERATORS