Introducing Bitmagnet: A self-hosted BitTorrent indexer, DHT crawler, content classifier and torrent search engine with web UI, GraphQL API and Servarr stack integration (bitmagnet.io)

submitted 1 year ago by mgdigital@lemmy.world to c/selfhosted@lemmy.world

65 comments fedilink hide all child comments

I'm excited to announce the first alpha preview of this project that I've been working on for the past 4 months. I'm initially posting about this in a few small communities, and hoping to get some input from early adopters and beta testers.

What is a DHT crawler?

The DHT crawler is Bitmagnet’s killer feature that (currently) makes it unique. Well, almost unique, read on…

So what is it? You might be aware that you can enable DHT in your BitTorrent client, and that this allows you find peers who are announcing a torrent’s hash to a Distributed Hash Table (DHT), rather than to a centralized tracker. DHT’s lesser known feature is that it allows you to crawl the info hashes it knows about. This is how Bitmagnet’s DHT crawler works works - it crawls the DHT network, requesting metadata about each info hash it discovers. It then further enriches this metadata by attempting to classify it and associate it with known pieces of content, such as movies and TV shows. It then allows you to search everything it has indexed.

This means that Bitmagnet is not reliant on any external trackers or torrent indexers. It’s a self-contained, self-hosted torrent indexer, connected via the DHT to a global network of peers and constantly discovering new content.

The DHT crawler is not quite unique to Bitmagnet; another open-source project, magnetico was first (as far as I know) to implement a usable DHT crawler, and was a crucial reference point for implementing this feature. However that project is no longer maintained, and does not provide the other features such as content classification, and integration with other software in the ecosystem, that greatly improve usability.

Currently implemented features of Bitmagnet:

A DHT crawler
A generic BitTorrent indexer: Bitmagnet can index torrents from any source, not only the DHT network - currently this is only possible via the /import endpoint; more user-friendly methods are in the pipeline
A content classifier that can currently identify movie and television content, along with key related attributes such as language, resolution, source (BluRay, webrip etc.) and enriches this with data from The Movie Database
An import facility for ingesting torrents from any source, for example the RARBG backup
A torrent search engine
A GraphQL API: currently this provides a single search query; there is also an embedded GraphQL playground at /graphql
A web user interface implemented in Angular: currently this is a simple single-page application providing a user interface for search queries via the GraphQL API
A Torznab-compatible endpoint for integration with the Serverr stack

Interested?

If this project interests you then I'd really appreciate your input:

How did you get along with following the documentation and installation instructions? Were there any pain points?
There's a roadmap of high-priority features on the website - what do you see as the highest priority for near-term development?
If you're a developer, are you interested in contributing to the project?

Thanks for your attention. If you're interested in this project and would like to help it gain momentum then please give it a star on GitHub, and expect further updates soon!

you are viewing a single comment's thread
view the rest of the comments

[-] prim3r@lemmy.ca 10 points 1 year ago

This looks really cool! How resource intensive is this? What sort of storage requirements are there for this to be a reasonably reliable method of acquiring media? I'm probably just gonna find out myself. I've recently fully switched over to usenet, but this could make torrents pretty compelling again.

[-] kautau@lemmy.world 3 points 1 year ago* (last edited 1 year ago)

As someone interested in Usenet, what’s the best provider and client to start with in your opinion?

[-] prim3r@lemmy.ca 4 points 1 year ago

I've been using easynews/nzbgeek/nzbget with an arr stack on debian and it's worked well for me. I'm fairly new to usenet, so take this with a giant grain of salt.

[-] kautau@lemmy.world 1 points 1 year ago

Cool, thanks for the reply!

[-] Kushan@lemmy.world 2 points 1 year ago

Sabnzbd is probably the best choice of download client, fyi.

[-] CosmicApe@kbin.social 5 points 1 year ago

Linux program names are fucking wild

load more comments (2 replies)

load more comments (4 replies)

this post was submitted on 04 Oct 2023

521 points (98.5% liked)

Selfhosted

39677 readers

186 users here now

A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don't control.

Rules:

Be civil: we're here to support and learn from one another. Insults won't be tolerated. Flame wars are frowned upon.
No spam posting.
Posts have to be centered around self-hosting. There are other communities for discussing hardware or home computing. If it's not obvious why your post topic revolves around selfhosting, please include details to make it clear.
Don't duplicate the full text of your blog or github here. Just post the link for folks to click.
Submission headline should match the article title (don’t cherry-pick information from the title to fit your agenda).
No trolling.

Resources:

selfh.st Newsletter and index of selfhosted software and apps
awesome-selfhosted software
awesome-sysadmin resources
Self-Hosted Podcast from Jupiter Broadcasting

Any issues on the community? Report it using the report flag.

Questions? DM the mods!

founded 1 year ago

MODERATORS

HybridSarcasm@lemmy.world

HybridSarcasm@lemmy.hybridsarcasm.xyz