[-] calcopiritus@lemmy.world -1 points 1 day ago

I don't think Microsoft invented scrapping. Or LLM training.

Also, GitHub doesn't have an issue with Microsoft scraping its data. They can just directly access whatever data they want. And rate-limiting non logged in accounts won't affect Microsoft's LLM training at all.

I'm not defending a monopolist because of monopolist actions. First of all because GitHub doesn't have any kind of monopoly. There are plenty of git forges. And second of all. How does this make their position on the market stronger? If anything, it makes it weaker.

[-] calcopiritus@lemmy.world 0 points 1 day ago

No. I cannot find the flaws in my reasoning. Because you are not attacking my reasoning, you are saying that i am on the side of the bad people, and the bad people are bad, and you are opposed to the bad people, therefore you are right.

The world is more than black or white. GitHub rate-limiting non-logged-in users makes sense, and is the expected result in the age of web scrapping LLM training.

Yes, the parent company of GitHub also does web scrapped for the purpose of training LLMs. I don't see what that has to do with defending themselves from other scrappers.

[-] calcopiritus@lemmy.world 1 points 1 day ago

It's not the same making API costs unbearable for a social media user and limiting the rate non-logged-in users.

You can still use GitHub without being logged in. You can still use GitHub without almost any limit on a free account.

You cannot even use reddit on a third party app with an account with reddit gold.

[-] calcopiritus@lemmy.world -1 points 2 days ago

Or we just realize that GitHub without logging in is a service we are getting for free. And when there's something free, there's someone trying to exploit it. Using GitHub while logged in is also free and has none of these limits, while allowing them to much easier block exploiters.

[-] calcopiritus@lemmy.world 1 points 2 days ago

If they can charge for it. It means they can block it. https://www.wired.com/story/stack-overflow-will-charge-ai-giants-for-training-data/

You can also rate-limit. Blacklist known scrapper IPs.

And if it doesn't work. You make signing-in not optional. Which makes rate-limiting way easier.

The rate of human data consumption is much lower than LLM's. The humans won't even notice that they have a rate limit. At most they would only notice the need to create a stack overflow account.

[-] calcopiritus@lemmy.world 11 points 2 days ago

Yes. But not just in the "obvious" way.

I first started to contribute back when LLMs first appeared. Then SO allowed became LLM training grounds. Which made me stop contributing instantly.

I guess a not-insignificant amount of people stopped answering questions, which means less search results, which ends in less traffic.

I'm sure the fall wouldn't be as big as it is if they didn't allow LLMs to train on their data.

[-] calcopiritus@lemmy.world 4 points 2 days ago

As I said in the comment. You couldn't direct the pigs back then. There weren't carrots in a stick. There weren't even carrots. Saddles are very old items.

[-] calcopiritus@lemmy.world 8 points 2 days ago* (last edited 2 days ago)

At the start, horses didn't exist. But saddles did. Why? Because pigs can use saddles too. You couldn't even direct them, you'd just be on top of them while they wandered on their own.

Saddles weren't a tool, they were just a fun useless toy. Which makes sense you couldn't craft them. It was just a silly reward for finding a dungeon.

Then they introduced horses, which used the same saddles, but forgot why they weren't craftable in the first place.

EDIT: they did have a use. It was for the achievement of falling to death from a pig. Which makes it make extra-sense it being uncraftable. That is, until they introduced horses.

[-] calcopiritus@lemmy.world 2 points 3 days ago

You have to pay that mortgage, it doesn't matter if your house can cover it or not.

What are you gonna do? Sell your house to pay off your mortgage? And then where do you live?

If you own a single house, the synchronized raise/fall of house prices only affect the speed at which you can "upgrade" to a more expensive home. So prices going down benefit you.

[-] calcopiritus@lemmy.world 9 points 3 days ago

Everyone needs one house. When you sell your house, you have to buy (or rent) another one. If the value of your house drops by the same amount as everyone else's, then you lost nothing.

In fact, you probably gained because if you plan to buy a more expensive house, you have to pay less.

The only people for whom the fall of housing prices would be negative are those that plan on having less houses. That is, you have multiple, and want to sell some.

The median citizen is no real state investor.

[-] calcopiritus@lemmy.world 3 points 5 days ago

There is no chance you can make a consumer-facing product using imgui. The closest to that I've seen is imhex, which admittedly is way better looking than I thought was possible with imgui. But it is a tool for mainly developers, not a consumer-facing product.

[-] calcopiritus@lemmy.world 5 points 6 days ago

Just make a file system that maps each file name to 2 files. The 0 file and the 1 file.

Now with just a filename and 1 bit, you can have any file! The file is just 1 bit. It's the filesystems that needs more than that.

view more: next ›

calcopiritus

joined 2 years ago