323

just p-hack it. (mander.xyz)

submitted 1 month ago by fossilesque@mander.xyz to c/science_memes@mander.xyz

8 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[-] uuldika@lemmy.ml 4 points 1 month ago

if they existed they'd be killer for RL. RL is insanely unstable when the distribution shifts as the policy starts exploring different parts of the state space. you'd think there'd be some clean approach to learning P(Xs|Ys) that can handle continuous shift of the Ys distribution in the training data, but there doesn't seem to be. just replay buffers and other kludges.

this post was submitted on 24 Jun 2025

323 points (98.2% liked)

Science Memes

16205 readers

2061 users here now

Welcome to c/science_memes @ Mander.xyz!

A place for majestic STEMLORD peacocking, as well as memes about the realities of working in a lab.

Rules

Don't throw mud. Behave like an intellectual and remember the human.
Keep it rooted (on topic).
No spam.
Infographics welcome, get schooled.

This is a science community. We use the Dawkins definition of meme.

Research Committee

!spiders@lemmy.world

Other Mander Communities

Science and Research

Biology and Life Sciences

Physical Sciences

Humanities and Social Sciences

Practical and Applied Sciences

Memes

Miscellaneous

founded 2 years ago

MODERATORS

fossilesque@mander.xyz

SciBot@mander.xyz

fossilesque@lemmy.dbzer0.com