251
submitted 2 weeks ago* (last edited 2 weeks ago) by Tea@programming.dev to c/technology@lemmy.world
top 19 comments
sorted by: hot top controversial new old
[-] NeoNachtwaechter@lemmy.world 36 points 2 weeks ago

The truth is stored on their harddisks. But the truth may become very illegal very soon.

They better move the whole thing out of Usa now.

[-] pogmommy@lemmy.ml 26 points 2 weeks ago

I love the IA but they need to be infinitely more decentralized like yesterday

[-] douglasg14b@lemmy.world 5 points 2 weeks ago

And funded by who?

It's nice to say that it should be decentralized, but who is funding the development of that? Are you donating to IA?

[-] SoftestSapphic@lemmy.world 11 points 2 weeks ago

TBH this is an important enough resource the UN should fund it.

They won't but they should.

[-] pogmommy@lemmy.ml 3 points 2 weeks ago

I mean, yeah like another user said, ideally it would be in the interest of groups which allege to have am interest in some form of democracy. But additionally, the ability to set up browsable partial mirrors which could be hosted by miscellaneous nonprofits and individuals both within and outside of the US would be a massive first step to preserving the information that IA stores. The fact that attacks on their servers can eradicate all access to the information they store is troubling given how many enemies they've made simply through the work they do.

[-] douglasg14b@lemmy.world 3 points 2 weeks ago

The actual volume of data is kind of insane for distribution. You start running into many scale problems.

At ~70PB of storage, assumed redundant as well. And at ~$15/TB JUST for HDDs alone, you're talking $2.1 million in just hard drives.

Installation, hardware, and facility costs will at least pentuple that number, if we're being crazy conservative. Making the cost to stand up an archive $10.5 million?


During this process I found out that their finances are public and there is more reliable information out there:

  • $2/GB for permanent storage, overall ( $2000/TB)

The cost to store the data and run the archive is a whopping $36mill/y at the moment.

Which if you consider what they do is incredibly cheap. And easily fundable by even a small municipality never mind a large Nation.

[-] turmacar@lemmy.world 2 points 2 weeks ago* (last edited 2 weeks ago)

It would be interesting to have encrypted blobs scattered around volunteer computers/servers, like a storage version of BOINC / @HOME.

People tend to have dramatically less spare storage space than space compute time though and it would need to be very redundant to be guaranteed not to lose data.

[-] General_Effort@lemmy.world 2 points 2 weeks ago

Well, not to Europe. They've always been illegal here. I don't know where they could even go.

[-] rickrolled767@ttrpg.network 11 points 2 weeks ago

They're illegal in Europe? Could you elaborate a bit on that?

[-] General_Effort@lemmy.world 9 points 2 weeks ago

In practice, copyright would be the big problem. There is no Fair Use in Europe. There is no difference between what they do and Anna's Archive or LibGen. As far as copyright people are concerned, this is just "theft" on a gigantic scale.

Then there's the GDPR. As far as the EU is concerned, this is one huge human rights violation. The GDPR does allow for archives, but figuring out how the IA should operate would take some litigation. I doubt they would be allowed to provide the Wayback Machine.

[-] rickrolled767@ttrpg.network 3 points 2 weeks ago

Gotcha. Thanks for explaining it. I'm in the US so I was really curious on what was different in the EU that would cause problems for them

[-] dan@upvote.au 34 points 2 weeks ago

I didn't realise they do tours every Friday at 1pm. I'll have to visit some time!

I really hope the lawsuits don't kill the Internet Archive. It's an important resource.

[-] arafatknee@lemmy.dbzer0.com 27 points 2 weeks ago* (last edited 2 weeks ago)

The Internet archive is like the digital equivalent of the Svalbard Global seed vault.

https://en.m.wikipedia.org/wiki/Svalbard_Global_Seed_Vault

[-] witx@lemmy.sdf.org 7 points 2 weeks ago

Is there a way we can help? E.g torrent seeding of the content?

[-] General_Effort@lemmy.world 12 points 2 weeks ago

http://warrior.archiveteam.org/


The ArchiveTeam Warrior is a virtual archiving appliance. You can run it to help with the ArchiveTeam archiving efforts. It will download sites and upload them to our archive — and it’s really easy to do!

The warrior is a virtual machine, so there is no risk to your computer. The warrior will only use your bandwidth and some of your disk space.

The warrior runs on Windows, OS X and Linux. You’ll need VirtualBox (recommended), VMware or a similar program to run the virtual machine.

[-] killeronthecorner@lemmy.world 3 points 2 weeks ago

I wonder if I can run a resource-constrained instance of this on esxi.. something to look into this weekend, thank you.

[-] boonhet@lemm.ee 4 points 2 weeks ago

It barely uses any resources. You can have up to 6 active jobs and most of the time you'll be waiting for an upload slot to open up so you can get one of your 6 uploaded.

You can just set it and forget it unless you have a bandwidth cap and set it on a video site.

[-] dan@upvote.au 3 points 2 weeks ago

They push the VM images, but there's a Docker container available too.

[-] Appoxo@lemmy.dbzer0.com 2 points 2 weeks ago

That's the whole reason it's 100TB uploaded...

this post was submitted on 23 Mar 2025
251 points (97.7% liked)

Technology

68528 readers
2959 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS