[-] bouncing@partizle.com 3 points 1 year ago

I wouldn't say you need no dependencies in a Java project, but by all means check the average number of dependencies you get with Java or Python and compare it to almost any Node project.

You could probably sample projects on GitHub, look at the dependency graph, and compare.

[-] bouncing@partizle.com 3 points 1 year ago

Are we going to magically assume the traffic just vanished?

It's an underground highway. Out of sight, out of mind. I imagine they probably also improved the overall road design, like Seattle, Denver, and Boston have done (or are doing) with their projects to bury highways below-grade.

[-] bouncing@partizle.com 3 points 1 year ago

Her lawsuit doesn't say that. It says,

when ChatGPT is prompted, ChatGPT generates summaries of Plaintiffs’ copyrighted works—something only possible if ChatGPT was trained on Plaintiffs’ copyrighted works

That's an absurd claim. ChatGPT has surely read hundreds, perhaps thousands of reviews of her book. It can summarize it just like I can summarize Othello, even though I've never seen the play.

[-] bouncing@partizle.com 2 points 1 year ago

I very much agree.

[-] bouncing@partizle.com 5 points 1 year ago

Maybe you don't care, but the OSI definition does.

[-] bouncing@partizle.com 4 points 1 year ago

In fairness, they didn't release anything open at all.

[-] bouncing@partizle.com 4 points 1 year ago

You're getting lost in the weeds here and completely misunderstanding both copyright law and the technology used here.

First of all, copyright law does not care about the algorithms used and how well they map what a human mind does. That's irrelevant. There's nothing in particular about copyright that applies only to humans but not to machines. Either a work is transformative or it isn't. Either it's derivative of it isn't.

What AI is doing is incorporating individual works into a much, much larger corpus of writing style and idioms. If a LLM sees an idiom used a handful of times, it might start using it where the context fits. If a human sees an idiom used a handful of times, they might do the same. That's true regardless of algorithm and there's certainly nothing in copyright or common sense that separates one from another. If I read enough Hunter S Thompson, I might start writing like him. If you feed an LLM enough of the same, it might too.

Where copyright comes into play is in whether the new work produced is derivative or transformative. If an entity writes and publishes a sequel to The Road, Cormac McCarthy's estate is owed some money. If an entity writes and publishes something vaguely (or even directly) inspired by McCarthy's writing, no money is owed. How that work came to be (algorithms or human flesh) is completely immaterial.

So it's really, really hard to make the case that there's any direct copyright infringement here. Absorbing material and incorporating it into future works is what the act of reading is.

The problem is that as a consumer, if I buy a book for $12, I'm fairly limited in how much use I can get out of it. I can only buy and read so many books in my lifetime, and I can only produce so much content. The same is not true for an LLM, so there is a case that Congress should charge them differently for using copyrighted works, but the idea that OpenAI should have to go to each author and negotiate each book would really just shut the whole project down. (And no, it wouldn't be directly negotiated with publishers, as authors often retain the rights to deny or approve licensure).

[-] bouncing@partizle.com 4 points 1 year ago* (last edited 1 year ago)

Isn’t learning the basic act of reading text?

not even close. that’s not how AI training models work, either.

Of course it is. It's not a 1:1 comparison, but the way generative AI works and the we incorporate styles and patterns are more similar than not. Besides, if a tensorflow script more closely emulated a human's learning process, would that matter for you? I doubt that very much.

Thousands of authors demand payment from AI companies for use of copyrighted works::Thousands of published authors are requesting payment from tech companies for the use of >> their copyrighted works in training artificial intelligence tools

Having to individually license each unit of work for a LLM would be as ridiculous as trying to run a university where you have to individually license each student reading each textbook. It would never work.

What we're broadly talking about is generative work. That is, by absorbing one a body of work, the model incorporates it into an overall corpus of learned patterns. That's not materially different from how anyone learns to write. Even my use of the word "materially" in the last sentence is, surely, based on seeing it used in similar patterns of text.

The difference is that a human's ability to absorb information is finite and bounded by the constraints of our experience. If I read 100 science fiction books, I can probably write a new science fiction book in a similar style. The difference is that I can only do that a handful of times in a lifetime. A LLM can do it almost infinitely and then have that ability reused by any number of other consumers.

There's a case here that the renumeration process we have for original work doesn't fit well into the AI training models, and maybe Congress should remedy that, but on its face I don't think it's feasible to just shut it all down. Something of a compulsory license model, with the understanding that AI training is automatically fair use, seems more reasonable.

0
submitted 1 year ago by bouncing@partizle.com to c/tech@partizle.com

Just something fun

0
submitted 1 year ago* (last edited 1 year ago) by bouncing@partizle.com to c/privacy@lemmy.world

The EU-US Data Privacy Framework (DPF) is the third attempt between the trading bloc and the US to iron out privacy kinks in the flow of data about their citizens. This latest agreement marks the EU's determination that "the United States ensures an adequate level of protection – comparable to that of the European Union – for personal data transferred from the EU to US companies under the new framework," the Commission said in a statement.

Key to today's decision [PDF] was an October executive order signed by US President Joe Biden that the Commission said adds new safeguards that address the problems raised with the second attempt at a transatlantic data agreement, Privacy Shield.

0
submitted 1 year ago by bouncing@partizle.com to c/tech@partizle.com

The EU-US Data Privacy Framework (DPF) is the third attempt between the trading bloc and the US to iron out privacy kinks in the flow of data about their citizens. This latest agreement marks the EU's determination that "the United States ensures an adequate level of protection – comparable to that of the European Union – for personal data transferred from the EU to US companies under the new framework," the Commission said in a statement.

Key to today's decision [PDF] was an October executive order signed by US President Joe Biden that the Commission said adds new safeguards that address the problems raised with the second attempt at a transatlantic data agreement, Privacy Shield.

0

"Other platforms cannot replace it," said a senior member of the Taliban in a tweet, explaining that Meta is "intolerant."

0
submitted 1 year ago by bouncing@partizle.com to c/tech@partizle.com

TL/DR: Google used flimsier parts that are more likely to break over time (aluminum over stainless steel, etc).

1
submitted 1 year ago by bouncing@partizle.com to c/tech@partizle.com
[-] bouncing@partizle.com 4 points 1 year ago

Is that site still around?

[-] bouncing@partizle.com 2 points 1 year ago

Posting your band's tour dates on Facebook doesn't really even change your privacy status that much.

Whether you have a Facebook account or not, Facebook tracks you around the web. Data brokers sell your data. Your cell phone company sells your location and browsing history, etc.

People over-estimate how much not using any given social media app really matters.

Now granted, installing it on your phone gives them a level of data they wouldn't have from a web browser. That's probably why Threads is phone-only.

[-] bouncing@partizle.com 2 points 1 year ago

They’re defederating smaller entities because the network got consumed by corpos. And abuse, but lots of that comes from big services and they don’t defed those.

It's tempting to believe the email issue really is some conspiracy to keep the little guy down, but it really is just that a new domain, with low volume, is a strong signal for abuse. That is true with or without trouble from Gmail, Yahoo, etc. If you wrote a machine learning algorithm to find spam, your ML would come to the same conclusion. There's no obvious solution to that.

Fediverse instances aren’t just providers, they’re communities.

Just like email list serves. Should a listserv block gmail subscriptions? I would again argue not.

This is in essence what FB/Meta is doing, all the time, except it’s not individual spam it’s an algorithmically backed manipulation mechanism using it’s users as tools ^.^

Presumably people using Threads want that. Or they'll tolerate it.

1
submitted 1 year ago by bouncing@partizle.com to c/tech@partizle.com

This has probably been pointed out before, but it's not like Twitter is hard to build, especially for a company like Facebo—I mean, "Meta."

What's hard is content moderation and community building.

1
submitted 1 year ago* (last edited 1 year ago) by bouncing@partizle.com to c/tech@partizle.com
1
1
submitted 1 year ago by bouncing@partizle.com to c/tech@partizle.com

One of the features seems to be a "hide my email" feature, akin to Apple's hide my email or Fastmail's masked email feature.

Having used both of those, I would say one downside is that occasionally, a site will detect that I used the Apple one, which is strange because it's just an iCloud email address. Perhaps they're looking for a specific pattern.

I haven't yet seen the Fastmail one blocked.

One concern with the Proton one is that it seems like its masked emails are all at passmail.com. I've already found some sites block protonmail, so they'll surely block passmail like they do Mailinator and other sites. That could be a limitation that's less likely to affect Fastmail's service.

1
submitted 1 year ago by bouncing@partizle.com to c/tech@partizle.com

It does seem like sooner or later, if someone is able to build a reliable AI model of my face and voice, they could even phish my own relatives by video call.

Seems like a Philip K. Dick novel—objective reality is something you could only see around you, while the machine would be completely untrustworthy.

1
Kagi raises $670K (blog.kagi.com)
submitted 1 year ago by bouncing@partizle.com to c/tech@partizle.com

Kagi is a paid search engine. Instead of getting ads, you just pay for the privilege of using it.

I've been using it for a while and overall I think for most searches it's better than Google. It isn't necessarily that the content is always better (sometimes it isn't) but the signal is far easier to find through the noise.

view more: ‹ prev next ›

bouncing

joined 1 year ago
MODERATOR OF