1373

submitted 2 years ago by Oiconomia@feddit.de to c/memes@lemmy.ml

43 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[-] CodeInvasion@sh.itjust.works 17 points 2 years ago

This is done by combining a Diffusion model with ControlNet interface. As long as you have a decently modern Nvidia GPU and familiarity with Python and Pytorch it's relatively simple to create your own model.

The ControlNet paper is here: https://arxiv.org/pdf/2302.05543.pdf

I implemented this paper back in March. It's as simple as it is brilliant. By using methods originally intended to adapt large pre-trained language models to a specific application, the author's created a new model architecture that can better control the output of a diffusion model.

this post was submitted on 01 Oct 2023

1373 points (97.8% liked)

Memes

54889 readers

1234 users here now

Rules:

Be civil and nice.
Try not to excessively repost, as a rule of thumb, wait at least 2 months to do it if you have to.

founded 6 years ago

MODERATORS

gary_host_laptop@lemmy.ml

cyclohexane@lemmy.ml

cypherpunks@lemmy.ml